Nonnegative matrix factorization for interactive topic modeling and document clustering

Cited 0 time in webofscience Cited 76 time in scopus
  • Hit : 180
  • Download : 0
Nonnegative matrix factorization (NMF) approximates a nonnegative matrix by the product of two low–rank nonnegative matrices. Since it gives semantically meaningful result that is easily interpretable in clustering applications, NMF has been widely used as a clustering method especially for document data, and as a topic modeling method. We describe several fundamental facts of NMF and introduce its optimization framework called block coordinate descent. In the context of clustering, our framework provides a flexible way to extend NMF such as the sparse NMF and the weakly–supervised NMF. The former provides succinct representations for better interpretations while the latter flexibly incorporate extra information and user feedback in NMF, which effectively works as the basis for the visual analytic topic modeling system that we present. Using real–world text data sets, we present quantitative experimental results showing the superiority of our framework from the following aspects: fast convergence, high clustering accuracy, sparse representation, consistent output, and user interactivity. In addition, we present a visual analytic system called UTOPIAN (User–driven Topic modeling based on Interactive NMF) and show several usage scenarios. Overall, our book chapter cover the broad spectrum of NMF in the context of clustering and topic modeling, from fundamental algorithmic behaviors to practical visual analytics systems.
Publisher
Springer International Publishing
Issue Date
2015-01
Language
English
Article Type
Book Chapter
Citation

Partitional Clustering Algorithms, v.1, pp.215 - 243

DOI
10.1007/978-3-319-09259-1_7
URI
http://hdl.handle.net/10203/273480
Appears in Collection
AI-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0