A parallel query processing system based on graph-based database partitioning

Cited 4 time in webofscience Cited 7 time in scopus
  • Hit : 480
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorNam, Yoon-Minko
dc.contributor.authorHan, Donghyoungko
dc.contributor.authorKim, Min-Sooko
dc.date.accessioned2020-03-19T02:24:55Z-
dc.date.available2020-03-19T02:24:55Z-
dc.date.created2020-03-10-
dc.date.created2020-03-10-
dc.date.created2020-03-10-
dc.date.created2020-03-10-
dc.date.created2020-03-10-
dc.date.issued2019-04-
dc.identifier.citationINFORMATION SCIENCES, v.480, pp.237 - 260-
dc.identifier.issn0020-0255-
dc.identifier.urihttp://hdl.handle.net/10203/272649-
dc.description.abstractAs parallel database systems have large amounts of data to process, it is important to utilize a scalable and efficient horizontal database partitioning method. The existing partitioning methods have major drawbacks that not only cause large amounts of data redundancy but also still require expensive shuffle operations for join queries in many cases-despite their high data redundancy. We elucidate upon the drawbacks originating from the tree-based partitioning schemes and propose a novel graph-based database partitioning method called GPT that both improves the query performance and reduces data redundancy. We integrate the proposed GPT method into a parallel query processing system, Spark SQL, across all the relevant layers and modules, including the query plan generator and the scan operator. Through extensive experiments using three benchmarks, TPC-DS, IMDB and BioWarehouse, we show that GPT significantly outperforms the state-of-the-art method in terms of both storage overhead and query performance.-
dc.languageEnglish-
dc.publisherELSEVIER SCIENCE INC-
dc.titleA parallel query processing system based on graph-based database partitioning-
dc.typeArticle-
dc.identifier.wosid000459644700015-
dc.identifier.scopusid2-s2.0-85059005654-
dc.type.rimsART-
dc.citation.volume480-
dc.citation.beginningpage237-
dc.citation.endingpage260-
dc.citation.publicationnameINFORMATION SCIENCES-
dc.identifier.doi10.1016/j.ins.2018.12.031-
dc.contributor.localauthorKim, Min-Soo-
dc.contributor.nonIdAuthorNam, Yoon-Min-
dc.contributor.nonIdAuthorHan, Donghyoung-
dc.description.isOpenAccessN-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorHorizontal database partitioning-
dc.subject.keywordAuthorGraph-based partitioning-
dc.subject.keywordAuthorParallel query processing-
Appears in Collection
CS-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 4 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0