Topic chains for understanding a news corpus

Cited 28 time in webofscience Cited 0 time in scopus
  • Hit : 340
  • Download : 0
The Web is a great resource and archive of news articles for the world. We present a framework, based on probabilistic topic modeling, for uncovering the meaningful structure and trends of important topics and issues hidden within the news archives on the Web. Central in the framework is a topic chain, a temporal organization of similar topics. We experimented with various topic similarity metrics and present our insights on how best to construct topic chains. We discuss how to interpret the topic chains to understand the news corpus by looking at long-term topics, temporary issues, and shifts of focus in the topic chains. We applied our framework to nine months of Korean Web news corpus and present our findings.
Publisher
CICLing'11
Issue Date
2011-02-20
Language
English
Citation

12th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2011, pp.163 - 176

ISSN
0302-9743
URI
http://hdl.handle.net/10203/166575
Appears in Collection
CS-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 28 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0