TrillionG: A trillion-scale synthetic graph generator using a recursive vector model

Cited 0 time in webofscience Cited 24 time in scopus
  • Hit : 165
  • Download : 0
As many applications encounter exponential growth in graph sizes, a fast and scalable graph generator has become more important than ever before due to lack of large-scale realistic graphs for evaluating the performance of graph processing methods. Although there have been proposed a number of methods to generate synthetic graphs, they are not very efficient in terms of space and time complexities, and so, cannot generate even trillion-scale graphs using a moderate size cluster of commodity machines. Here, we propose an efficient and scalable disk-based graph generator, TrillionG that can generate massive graphs in a short time only using a small amount of memory. It can generate a graph of a trillion edges following the RMAT or Kronecker models within two hours only using 10 PCs. We first generalize existing graph generation models to the scope-based generation model, where RMAT and Kronecker correspond to two extremes. Then, we propose a new graph generation model called the recursive vector model, which compromises two extremes, and so, solves the space and time complexity problems existing in RMAT and Kronecker. We also extend the recursive vector model so as to generate a semantically richer graph database. Through extensive experiments, we have demonstrated that TrillionG outperforms the state-of-the-art graph generators by up to orders of magnitude. ? 2017 ACM.
Publisher
ACM Special Interest Group on Management of Data (SIGMOD)
Issue Date
2017-05
Language
English
Citation

2017 ACM SIGMOD International Conference on Management of Data, SIGMOD 2017, pp.913 - 928

ISSN
0730-8078
DOI
10.1145/3035918.3064014
URI
http://hdl.handle.net/10203/274447
Appears in Collection
RIMS Conference Papers
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0