An Experimental Comparison of Iterative MapReduce Frameworks

Cited 8 time in webofscience Cited 0 time in scopus
  • Hit : 354
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorLee, Haejoonko
dc.contributor.authorKang, Minseoko
dc.contributor.authorYoun, Sun-Bumko
dc.contributor.authorLee, Jae-Gilko
dc.contributor.authorKwon, YongChulko
dc.date.accessioned2017-01-03T07:36:03Z-
dc.date.available2017-01-03T07:36:03Z-
dc.date.created2016-11-16-
dc.date.created2016-11-16-
dc.date.created2016-11-16-
dc.date.issued2016-10-26-
dc.identifier.citation25th ACM Int'l on Conf. on Information and Knowledge Management (CIKM), pp.2089 - 2094-
dc.identifier.urihttp://hdl.handle.net/10203/215623-
dc.description.abstractMapReduce has become a dominant framework in big data analysis, and thus there have been significant efforts to implement various data analysis algorithms in MapReduce. Many data analysis algorithms are inherently iterative, repeating the same set of tasks until a convergence. To efficiently support iterative algorithms at scale, a few variants of Hadoop and new platforms have been proposed and actively developed in both academia and industry. Representative systems include HaLoop, iMapReduce, Twister, and Spark. In this paper, we experimentally compare Hadoop and the aforementioned systems using various workloads and metrics. The five systems are compared through four iterative algorithms-PageRank, recursive query, k-means, and logistic regression-on 50 Amazon EC2 machines (200 cores in total). We thoroughly explore the effectiveness of their new caching, communication, and scheduling mechanisms in support of iterative computation. Our evaluation also shows the performance depending on data skew-ness and memory residency. Overall, we believe that our evaluation and interpretation will be useful for designing a new framework or improving the existing ones.-
dc.languageEnglish-
dc.publisherACM Special Interest Group on Information Retrieval (SIGIR)-
dc.titleAn Experimental Comparison of Iterative MapReduce Frameworks-
dc.typeConference-
dc.identifier.wosid000390890800245-
dc.identifier.scopusid2-s2.0-84996550782-
dc.type.rimsCONF-
dc.citation.beginningpage2089-
dc.citation.endingpage2094-
dc.citation.publicationname25th ACM Int'l on Conf. on Information and Knowledge Management (CIKM)-
dc.identifier.conferencecountryUS-
dc.identifier.conferencelocationIndianapolis, IN, USA-
dc.identifier.doi10.1145/2983323.2983647-
dc.contributor.localauthorLee, Jae-Gil-
dc.contributor.nonIdAuthorLee, Haejoon-
dc.contributor.nonIdAuthorKang, Minseo-
dc.contributor.nonIdAuthorYoun, Sun-Bum-
dc.contributor.nonIdAuthorKwon, YongChul-
Appears in Collection
CS-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 8 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0