Fake review detection: Understanding how deception is expressed in writing허위 리뷰 탐지: 작문에 나타나는 거짓의 표현방법에 대한 이해

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 722
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorMyaeng, Sung-Hyon-
dc.contributor.advisor맹성현-
dc.contributor.authorLee, Kyungyup Daniel-
dc.contributor.author이경엽-
dc.date.accessioned2015-04-23T06:15:57Z-
dc.date.available2015-04-23T06:15:57Z-
dc.date.issued2014-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=592450&flag=dissertation-
dc.identifier.urihttp://hdl.handle.net/10203/196839-
dc.description학위논문(석사) - 한국과학기술원 : 전산학과, 2014.8, [ v, 32 p. ]-
dc.description.abstractUser-generated online reviews for products and services are becoming increasingly important for potential customers in making purchase decisions. At the same time, some online reviews are not trustworthy because some business owners hire people to generate fake reviews, making automatic sentiment analysis and summarization meaningless. Fake review detection however, is not easy for even humans, and therefore previous approaches to automatic detection only had a limited success. Noting from a previous study that people show factitious wring behaviors in writing deceptive reviews, which may cause a word selection process different from that used for writing truthful reviews, we propose a novel approach to fake detection, employing a generative model where word selections in writing documents are assumed to be affected by the topics selected by the writer. In other words, we assume that distinct features of fake reviews come from different “topic” distributions compared to truthful ones and attempt to detect fake reviews by comparing two topic distributions generated by LDA from truthful and fake review document sets. Using an evaluation corpus constructed from Yelp reviews in seven categories, such as ‘hotels’ and ‘restaurants’, we show our method outperforms a previously proposed word-based method by a significant margin and our method has little category dependency. We also make some semantic interpretation of result of topic modeling.eng
dc.languageeng-
dc.publisher한국과학기술원-
dc.subjectFake review-
dc.subject사용자 성능-
dc.subject카테고리 의존도-
dc.subject주제 분포-
dc.subject허위 리뷰-
dc.subjectHuman performance-
dc.subjectTopic distribution-
dc.subjectCategory dependency-
dc.titleFake review detection: Understanding how deception is expressed in writing-
dc.title.alternative허위 리뷰 탐지: 작문에 나타나는 거짓의 표현방법에 대한 이해-
dc.typeThesis(Master)-
dc.identifier.CNRN592450/325007 -
dc.description.department한국과학기술원 : 전산학과, -
dc.identifier.uid020124480-
dc.contributor.localauthorMyaeng, Sung-Hyon-
dc.contributor.localauthor맹성현-
Appears in Collection
CS-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0