Parallel community detection on large graphs with MapReduce and GraphChi

Cited 26 time in webofscience Cited 0 time in scopus
  • Hit : 719
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorMoon, Seunghyeonko
dc.contributor.authorLee, Jae-Gilko
dc.contributor.authorKang, Minseoko
dc.contributor.authorChoy, Minsooko
dc.contributor.authorLee, Jin Wooko
dc.date.accessioned2016-10-07T09:32:55Z-
dc.date.available2016-10-07T09:32:55Z-
dc.date.created2014-12-24-
dc.date.created2014-12-24-
dc.date.created2014-12-24-
dc.date.issued2016-07-
dc.identifier.citationDATA & KNOWLEDGE ENGINEERING, v.104, pp.17 - 31-
dc.identifier.issn0169-023X-
dc.identifier.urihttp://hdl.handle.net/10203/213274-
dc.description.abstractCommunity detection from social network data gains much attention from academia and industry since it has many real-world applications. The Girvan-Newman (GN) algorithm is a divisive hierarchical clustering algorithm for community detection, which is regarded as one of the most popular algorithms. It exploits the concept of edge betweenness to divide a network into multiple communities. Though it is being widely used, it has limitations in supporting large-scale networks since it needs to calculate the shortest path between every pair of vertices in a network. In this paper, we develop two parallel versions of the GN algorithm to support large-scale networks. First, we propose a new algorithm, which we call Shortest Path Betweenness MapReduce Algorithm (SPB-MRA), that utilizes the MapReduce model. Second, we propose another new algorithm, which we call Shortest Path Betweenness Vertex-Centric Algorithm (SPB-VCA), that utilizes the vertex-centric model. An approximation technique is also developed to further speed up community detection processes. We implemented SPB-MRA using Hadoop and SPB-VCA using GraphChi, and then evaluated the performance of SPB-MRA on Amazon EC2 instances and that of SPB-VCA on a single commodity PC. The evaluation results showed that the elapsed time of SPB-MRA decreased almost linearly as the number of reducers increased, SPB-VCA outperformed SPB-MRA just on a single PC by 4-6 times, and the approximation technique introduced negligible errors. (C) 2015 Elsevier B.V. All rights reserved.-
dc.languageEnglish-
dc.publisherELSEVIER SCIENCE BV-
dc.titleParallel community detection on large graphs with MapReduce and GraphChi-
dc.typeArticle-
dc.identifier.wosid000381589800003-
dc.identifier.scopusid2-s2.0-84929256294-
dc.type.rimsART-
dc.citation.volume104-
dc.citation.beginningpage17-
dc.citation.endingpage31-
dc.citation.publicationnameDATA & KNOWLEDGE ENGINEERING-
dc.identifier.doi10.1016/j.datak.2015.05.001-
dc.contributor.localauthorLee, Jae-Gil-
dc.contributor.nonIdAuthorMoon, Seunghyeon-
dc.type.journalArticleArticle; Proceedings Paper-
dc.subject.keywordAuthorClustering, classification, and association rules-
dc.subject.keywordAuthorMining methods and algorithms-
dc.subject.keywordAuthorCommunity detection-
dc.subject.keywordAuthorSocial networks-
dc.subject.keywordAuthorMapReduce-
dc.subject.keywordAuthorVertex-centric model-
dc.subject.keywordPlusEDGE-BETWEENNESS-
dc.subject.keywordPlusNETWORKS-
Appears in Collection
IE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 26 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0