Multi-dimensional selectivity estimation using compressed information압축된 정보를 사용한 다차원 선택율 추정

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 469
  • Download : 0
For multimedia databases, the fuzzy query consists of a logical combination of content based similarity queries on features such as the color and the shape which are represented in continuous dimensions. Since the similarity query is intrinsically multi-dimensional, the multi-dimensional selectivity estimation is required in order to optimize a fuzzy query. For traditional databases, the query optimizer requires the multi-dimensional selectivity estimation for queries referencing multiple attributes from the same relation. The histogram is popularly used for the selectivity estimation. But the histogram has shortcomings as follows: First, it is difficult to estimate the selectivity of a similarity query using the histogram, since the typical similarity query has the shape of the hyper sphere and the range of features is continuous. Second, the histogram is efficient and practically the most preferable at a low dimensionality, particularly 1-dimension. However it suffers from the `dimensionality curses` at a high dimensionality, since the number of buckets is increased explosively. This causes severe problems of the high storage overhead and the high error rate at a high dimensionality. Third, most histograms do not support dynamic updates. This causes the periodical reconstruction of the histogram. With above motivations, we propose a novel approach for the multi-dimensional selectivity estimation of a query in continuous dimensions. Compressed information from a large number of small-sized buckets is maintained using the discrete cosine transform (DCT). This enables low error rates and low storage overheads even in a high dimensionality. Using the continuity property of the DCT we can make rectangles of any sizes in the data space in order to get approximate selectivity answers of queries of any shapes. In addition, this approach supports dynamic data updates, therefore it has the advantage of eliminating the overhead of periodical reconstructions of the compr...
Advisors
Chung, Chin-Wanresearcher정진완researcher
Description
한국과학기술원 : 컴퓨터공학전공,
Publisher
한국과학기술원
Issue Date
2001
Identifier
166554/325007 / 000939056
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 컴퓨터공학전공, 2001.2, [ viii, 106 p. ]

Keywords

Selectivity Estimation; Discrete Cosine Transform; Data COmpression; 선택율추정; 질의최적화; 데이타베이스; 이산여현변환; 데이타압축; Database; Query Optimization

URI
http://hdl.handle.net/10203/52050
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=166554&flag=dissertation
Appears in Collection
CS-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0