Load balancing for distributed processing of spatial data stream공간 데이터 스트림 분산 처리에서의 부하 균형화 연구

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 697
  • Download : 0
A spatial big data stream is an unbounded and continuous data stream with geospatial features such as point, line, and polygon. With the evolution of Internet-of-Things technology, it has become much easier to acquire spatial big data streams from various sources such as smart cars, road network sensors, smart phones, and geo-tagged Social Network Services. Distributed and real-time processing are required to effectively process spatial big data streams; however, existing approaches such as spatial databases or spatial Hadoop are unable to meet these requirements. Distributed Stream Processing System implemented with spatial operations may be one of the most plausible solutions, but it also has a load imbalance issue. The spatial data streams inherently have a skewed and dynamically changing distribution; therefore, the system may display poor performance unless the entire load is efficiently distributed among workers. Previous studies to solve the load imbalance problem is not directly applicable to processing spatial queries since it does not consider the spatial locality of data. In this paper, we propose a load balancing algorithm, Adaptive Spatial Key Grouping (ASKG), which is specialized for spatial data stream. The main idea of ASKG is to learn the previous distribution of a data stream and utilize it to suggest a new grouping scheme that evenly distributes the incoming data stream into workers. By repeating the procedure periodically, the system can balance loads among workers adaptively to the spatial distribution of data stream. To evaluate the validity of our proposed algorithm in a variety of environments, we performed an evaluation with real datasets by varying the number of workers, input rate, and processing overhead. Compared to two other alternative algorithms, ASKG reduced load imbalance up to around 95%, leading directly to an increase in throughput of up to 147%, and to a decrease in latency of up to 99%.
Advisors
Lee, Jae-Gilresearcher이재길researcher
Description
한국과학기술원 :지식서비스공학대학원,
Publisher
한국과학기술원
Issue Date
2017
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 지식서비스공학대학원, 2017.2,[iv, 35 p. :]

Keywords

spatial data stream; load balancing; stream processing; distributed processing; spatial data processing; 공간 데이터 스트림; 부하 균형; 스트림 처리; 분산 처리; 공간 데이터 처리

URI
http://hdl.handle.net/10203/243517
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=675510&flag=dissertation
Appears in Collection
KSE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0