DSpace at KOASAS: An Experimental Analysis of Limitations of MapReduce for Iterative Algorithms on Spark

DSpace at KOASAS

College of Engineering(공과대학)Dept. of Industrial and Systems Engineering(산업및시스템공학과)IE-Journal Papers(저널논문)

An Experimental Analysis of Limitations of MapReduce for Iterative Algorithms on Spark

Cited 6 time in

Cited 0 time in

Hit : 696
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Kang, Minseo	ko
dc.contributor.author	Lee, Jae-Gil	ko
dc.date.accessioned	2017-12-05T02:08:17Z	-
dc.date.available	2017-12-05T02:08:17Z	-
dc.date.created	2017-09-06	-
dc.date.created	2017-09-06	-
dc.date.issued	2017-12	-
dc.identifier.citation	CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, v.20, no.4, pp.3593 - 3604	-
dc.identifier.issn	1386-7857	-
dc.identifier.uri	http://hdl.handle.net/10203/227498	-
dc.description.abstract	MapReduce is the most popular framework for distributed processing. Recently, the scalability of data mining and machine learning algorithms has significantly improved with help from MapReduce. However, MapReduce does not handle iterative algorithms very efficiently. The problem is that many data mining and machine learning algorithms are iterative by nature. In order to overcome the limitations of MapReduce, many advanced distributed systems have been developed, including HaLoop, iMapReduce, Twister, and Spark. In this paper, we identify and categorize the limitations of MapReduce in handling iterative algorithms, and then, experimentally investigate the consequences of these limitations by using the most flexible and stable distributed system, Spark. According to our experiment results, the network I/O overhead was the primary factor that affected system performance the most. The disk I/O overhead also affected system performance, but it was less significant than the network I/O overhead. For the synchronization overhead, it affected system performance only when the static data was not cached.	-
dc.language	English	-
dc.publisher	SPRINGER	-
dc.title	An Experimental Analysis of Limitations of MapReduce for Iterative Algorithms on Spark	-
dc.type	Article	-
dc.identifier.wosid	000414780400063	-
dc.identifier.scopusid	2-s2.0-85029577286	-
dc.type.rims	ART	-
dc.citation.volume	20	-
dc.citation.issue	4	-
dc.citation.beginningpage	3593	-
dc.citation.endingpage	3604	-
dc.citation.publicationname	CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS	-
dc.identifier.doi	10.1007/s10586-017-1167-y	-
dc.contributor.localauthor	Lee, Jae-Gil	-
dc.description.isOpenAccess	N	-
dc.type.journalArticle	Article	-
dc.subject.keywordAuthor	Iterative algorithms	-
dc.subject.keywordAuthor	Hadoop	-
dc.subject.keywordAuthor	Spark	-
dc.subject.keywordAuthor	HaLoop	-
dc.subject.keywordAuthor	MapReduce	-
dc.subject.keywordAuthor	Twister	-

Appears in Collection: IE-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 6 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

An Experimental Analysis of Limitations of MapReduce for Iterative Algorithms on Spark

This item is cited by other documents in WoS

KOASAS

Communities & Collections