Inferring user profile using textual content on twitter단어를 이용한 트위터 상의 사용자 프로파일 유추에 관한 연구

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 1359
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorMoon, Sue-Bok-
dc.contributor.advisor문수복-
dc.contributor.authorRyu, Kyoung-Min-
dc.contributor.author류경민-
dc.date.accessioned2015-04-23T07:06:30Z-
dc.date.available2015-04-23T07:06:30Z-
dc.date.issued2014-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=592369&flag=dissertation-
dc.identifier.urihttp://hdl.handle.net/10203/197114-
dc.description학위논문(석사) - 한국과학기술원 : 웹사이언스공학전공, 2014.8, [ iv, 26 p. ]-
dc.description.abstractIn the past few years online social media have risen as a key venue for communicating with the public and monitoring public opinions. People talk about movies they watch, restaurants they visit, and views they enjoy, insinuating their whereabouts. In order to weigh in the public opinions expressed on such social media as much as traditional poll results or to optimize businesses for specific class of users, the representativeness of the opinions has to be accounted for. A profile such as age, gender, and location of users is one of the key factors in the representativeness, but are not available by default in online social networking platform. The number of users who make their profiles public is relatively small, compared to the huge number of users in online social networking services and social media platforms. Besides, there are several studies inferring user profile on various social networking services, but none of them apply their methods on Korean Twitter users. In this work we propose a new framework to infer a Korean user`s main location of activities, age, and gender in Twitter using their textual contents. Our approach is based on a probabilistic generative model that filters local words, employs data binning for scalability, and applies a map projection technique for performance in inferring user’s main location. Also, we use classifier for inferring user’s age and gender and apply feature selection for filtering relevant features to classes. We evaluate our method with users who have focused GPS-tagged tweets or with manually annotated users who use profile-relevant words in their description data. For inferring Korean user’s location, we report that 60% of users are identified within 10km of their locations, a significant improvement over existing approaches. And for inferring user’s age and gender, we report that 75% and 88% of users are correctly identified.eng
dc.languageeng-
dc.publisher한국과학기술원-
dc.subjectprofile-
dc.subjectdata mining-
dc.subject소셜미디어-
dc.subject프로필-
dc.subjectsocial media-
dc.subject데이터마이닝-
dc.titleInferring user profile using textual content on twitter-
dc.title.alternative단어를 이용한 트위터 상의 사용자 프로파일 유추에 관한 연구-
dc.typeThesis(Master)-
dc.identifier.CNRN592369/325007 -
dc.description.department한국과학기술원 : 웹사이언스공학전공, -
dc.identifier.uid020124398-
dc.contributor.localauthorMoon, Sue-Bok-
dc.contributor.localauthor문수복-
Appears in Collection
WST-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0