n-GRAM Indexstruktur mit zwei Ebenen und Verfahren zur Indexerstellung2단계 n-gram 역색인 구조 및 그 구성 방법과 질의 처리 방법 및 그 색인 도출 방법

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 321
  • Download : 0
Disclosed relates to a structure of two-level n-gram inverted index and methods of building the same, processing queries and deriving the index that reduce the size of n-gram inverted index and improves the query performance by eliminating the redundancy of the position information that exists in the n-gram inverted index. The inverted index of the present invention comprises a back-end inverted index using subsequences extracted from documents as a term and a front-end inverted index using n-grams extracted from the subsequences as a term. The back-end inverted index uses the subsequences of a specific length extracted from the documents to be overlapped with each other by n−1 (n: the length of n-gram) as a term and stores position information of the subsequences occurring in the documents in a posting list for the respective subsequences. The front-end inverted index uses the n-grams of a specific length extracted from the subsequences using a 1-sliding technique as a term and stores position information of the n-grams occurring in the subsequences in a posting list for the respective n-grams.
Assignee
KAIST
Country
GE (Georgia)
Application Date
2006-08-23
Application Number
102006039484.4
Registration Date
2009-12-03
Registration Number
102006039484
URI
http://hdl.handle.net/10203/303582
Appears in Collection
CS-Patent(특허)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0