DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Choi, Ho-Jin | - |
dc.contributor.advisor | 최호진 | - |
dc.contributor.author | Kang, Dong-Yeop | - |
dc.contributor.author | 강동엽 | - |
dc.date.accessioned | 2011-12-13T06:09:22Z | - |
dc.date.available | 2011-12-13T06:09:22Z | - |
dc.date.issued | 2010 | - |
dc.identifier.uri | http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=455254&flag=dissertation | - |
dc.identifier.uri | http://hdl.handle.net/10203/34945 | - |
dc.description | 학위논문(석사) - 한국과학기술원 : 전산학과, 2010.08, [ v, 31 p. ] | - |
dc.description.abstract | In addition to search queries and the corresponding click-through information, search engine logs record multidimensional information about user search activities, such as search time, location, vertical, and search device. Multidimensional mining of search logs can provide novel insights and useful knowledge for both search engine users and developers. How can we develop a search engine service to support multidimensional mining of search logs effectively and efficiently? In this paper, we describe our topic-concept cube project which addresses the business need and answers several challenges. First, to semantically summarize a set of search queries and click-through data, we develop a novel topic-concept model which learns a hierarchy of concepts and topics automatically from search logs. Second, to handle a huge amount of log data, we develop distributed algorithms for learning model parameters efficiently. Third, we present alternative approaches for computing a topic-concept cube which supports multidimensional mining of search log data online. Last, we report an empirical study verifying the effectiveness and efficiency of our approach on a real data set of 1.96 billion queries and 2.73 billion clicks. | eng |
dc.language | eng | - |
dc.publisher | 한국과학기술원 | - |
dc.subject | data mining | - |
dc.subject | data cube | - |
dc.subject | 데이터 큐브 | - |
dc.subject | 데이터마이닝 | - |
dc.title | Multidimensional mining of search logs based on topic-concept cube approach | - |
dc.title.alternative | 주제-개념 큐브 접근법에 기반한 검색 로그의 다차원 마이닝 | - |
dc.type | Thesis(Master) | - |
dc.identifier.CNRN | 455254/325007 | - |
dc.description.department | 한국과학기술원 : 전산학과, | - |
dc.identifier.uid | 020074340 | - |
dc.contributor.localauthor | Choi, Ho-Jin | - |
dc.contributor.localauthor | 최호진 | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.