Efficient text categorization using reference information for knowledge portal = 지식 포탈에서 인용정보를 이용한 효율적인 문서 자동분류에 관한 연구

As the volume of information available on the Internet and corporate intranets continues to increase, there is a growing need for the text categorization in knowledge portal helping people better find, filter, and manage these resources. But the conventional text categorization has some limitation. First of all, it is hard to keep paces with Interest of experts changed dynamically as time goes by. Second, the keyword-based method provokes problems such as very high dimensionality in the documents and sparsely resided feature space, gap between the unrestricted vocabularies of documents and the restricted vocabularies of indexing categories, and computational Complexity. In this thesis, to overcome these limitations, new text categorization method using reference information is presented. In science and research domains, most documents have reference information that contains a list of related documents and their publication information. The authors write the references that are mostly concentrated on one or two major categories. So text categorization using reference information can reflect a list of related documents and their publication information of authors and reduce dimensionality due to sparse feature set. The performance of suggested method is verified by conducting an experiment.
Advisors
Park, Sung-Jooresearcher박성주researcher
Publisher
한국과학기술원
Issue Date
2002
Identifier
173802/325007 / 020003601
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 경영공학전공, 2002.2, [ iv, 52 p. ]

Keywords

Reference Information; Text Categorization; Knowledge Portal; 지식포탈; 인용정보; 문서자동분류

URI
http://hdl.handle.net/10203/53727
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=173802&flag=t
Appears in Collection
KGSM-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.
  • Hit : 92
  • Download : 0
  • Cited 0 times in thomson ci

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0