Benchmarks

Graphalytics Benchmarks

This benchmark is to test the performance of GraphFrames algorithms, not Apache Spark itself. So, all the graphs are read from the disk and persisted in memory in the serialized format. In the result, only the time of GraphFrames algorithms is measured and the time of reading of the CSV, serialization and persisting the data does not measure.

Configurations

Graph: wiki-Talk

Algorithm Measurements Time (s)
Shortest Paths Graphframes 3 74.8877
Shortest Paths GraphX 3 15.8809
Connected Components Graphframes 3 35.3588
Connected Components GraphX 3 15.4621
Label Propagation GraphFrames 3 94.4113
Label Propagation GraphX 3 82.0218