Scalability of the Example Transformation

The example transformation has been tested in the Amazon Cloud environment with the following conditions for all executions:

We tested "node allocation" cardinality from 1 single node, all the way up to 8 nodes.

The following figure shows the functional dependence of run-time on the number of nodes in the cluster:

Cluster Scalability

Figure 29.7. Cluster Scalability


The following figure shows the dependency of "speedup factor" on the number of nodes in the cluster. The speedup factor is the ratio of the average runtime with one cluster node and the average runtime with x cluster nodes. Thus:

speedupFactor = avgRuntime(1 node) / avgRuntime(x nodes)

We can see, that the results are favourable up to 4 nodes. Each additional node still improves cluster performance, however the effect of the improvement decreases. Nine or more nodes in the cluster may even have a negative effect because their benefit for performance may be lost in the overhead with the management of these nodes.

These results are specific for each transformation, there may be a transformation with much a better or possibly worse function curve.

Speedup factor

Figure 29.8. Speedup factor


Table of measured runtimes:

nodesruntime 1 [s]runtime 2 [s]runtime 3 [s]average runtime [s]speedup factor
18618618618611
24674654664661.85
3317319314316.672.72
42362332332343.68
5208204204205.334.19
6181182182181.674.74
71681681681685.13
8172159162164.335.24