Wikipedia-based query phrase expansion in patent class search

Cited 21 time in webofscience Cited 23 time in scopus
  • Hit : 776
  • Download : 4
DC FieldValueLanguage
dc.contributor.authorAl-Shboul, Basharko
dc.contributor.authorMyaeng, Sung Hyonko
dc.date.accessioned2014-12-16T01:08:08Z-
dc.date.available2014-12-16T01:08:08Z-
dc.date.created2014-06-30-
dc.date.created2014-06-30-
dc.date.issued2014-10-
dc.identifier.citationINFORMATION RETRIEVAL, v.17, no.5-6, pp.430 - 451-
dc.identifier.issn1386-4564-
dc.identifier.urihttp://hdl.handle.net/10203/192763-
dc.description.abstractRelevance feedback methods generally suffer from topic drift caused by word ambiguities and synonymous uses of words. Topic drift is an important issue in patent information retrieval as people tend to use different expressions describing similar concepts causing low precision and recall at the same time. Furthermore, failing to retrieve relevant patents to an application during the examination process may cause legal problems caused by granting an existing invention. A possible cause of topic drift is utilizing a relevance feedback-based search method. As a way to alleviate the inherent problem, we propose a novel query phrase expansion approach utilizing semantic annotations in Wikipedia pages, trying to enrich queries with phrases disambiguating the original query words. The idea was implemented for patent search where patents are classified into a hierarchy of categories, and the analyses of the experimental results showed not only the positive roles of phrases and words in retrieving additional relevant documents through query expansion but also their contributions to alleviating the query drift problem. More specifically, our query expansion method was compared against relevance-based language model, a state-of-the-art query expansion method, to show its superiority in terms of MAP on all levels of the classification hierarchy.-
dc.languageEnglish-
dc.publisherSPRINGER-
dc.subjectINFORMATION-RETRIEVAL-
dc.subjectLEXICAL COHESION-
dc.subjectTERMS-
dc.titleWikipedia-based query phrase expansion in patent class search-
dc.typeArticle-
dc.identifier.wosid000342411200003-
dc.identifier.scopusid2-s2.0-84943589091-
dc.type.rimsART-
dc.citation.volume17-
dc.citation.issue5-6-
dc.citation.beginningpage430-
dc.citation.endingpage451-
dc.citation.publicationnameINFORMATION RETRIEVAL-
dc.identifier.doi10.1007/s10791-013-9233-4-
dc.embargo.liftdate9999-12-31-
dc.embargo.terms9999-12-31-
dc.contributor.localauthorMyaeng, Sung Hyon-
dc.contributor.nonIdAuthorAl-Shboul, Bashar-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorPatent search-
dc.subject.keywordAuthorPhrase-based query expansion-
dc.subject.keywordAuthorWikipedia categories-
dc.subject.keywordAuthorClarity-
dc.subject.keywordAuthorRetrievability-
dc.subject.keywordPlusINFORMATION-RETRIEVAL-
dc.subject.keywordPlusLEXICAL COHESION-
dc.subject.keywordPlusTERMS-
Appears in Collection
CS-Journal Papers(저널논문)
Files in This Item
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 21 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0