Nonnegative matrix factorization for interactive topic modeling and document clustering

Cited 0 time in webofscience Cited 76 time in scopus
  • Hit : 181
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorda Kuang, D.ko
dc.contributor.authorChoo, Jaegulko
dc.contributor.authorPark, Haesunko
dc.date.accessioned2020-03-24T08:20:05Z-
dc.date.available2020-03-24T08:20:05Z-
dc.date.created2020-03-24-
dc.date.created2020-03-24-
dc.date.created2020-03-24-
dc.date.created2020-03-24-
dc.date.issued2015-01-
dc.identifier.citationPartitional Clustering Algorithms, v.1, pp.215 - 243-
dc.identifier.urihttp://hdl.handle.net/10203/273480-
dc.description.abstractNonnegative matrix factorization (NMF) approximates a nonnegative matrix by the product of two low–rank nonnegative matrices. Since it gives semantically meaningful result that is easily interpretable in clustering applications, NMF has been widely used as a clustering method especially for document data, and as a topic modeling method. We describe several fundamental facts of NMF and introduce its optimization framework called block coordinate descent. In the context of clustering, our framework provides a flexible way to extend NMF such as the sparse NMF and the weakly–supervised NMF. The former provides succinct representations for better interpretations while the latter flexibly incorporate extra information and user feedback in NMF, which effectively works as the basis for the visual analytic topic modeling system that we present. Using real–world text data sets, we present quantitative experimental results showing the superiority of our framework from the following aspects: fast convergence, high clustering accuracy, sparse representation, consistent output, and user interactivity. In addition, we present a visual analytic system called UTOPIAN (User–driven Topic modeling based on Interactive NMF) and show several usage scenarios. Overall, our book chapter cover the broad spectrum of NMF in the context of clustering and topic modeling, from fundamental algorithmic behaviors to practical visual analytics systems.-
dc.languageEnglish-
dc.publisherSpringer International Publishing-
dc.titleNonnegative matrix factorization for interactive topic modeling and document clustering-
dc.typeArticle-
dc.identifier.scopusid2-s2.0-84944595142-
dc.type.rimsART-
dc.citation.volume1-
dc.citation.beginningpage215-
dc.citation.endingpage243-
dc.citation.publicationnamePartitional Clustering Algorithms-
dc.identifier.doi10.1007/978-3-319-09259-1_7-
dc.contributor.localauthorChoo, Jaegul-
dc.contributor.nonIdAuthorda Kuang, D.-
dc.contributor.nonIdAuthorPark, Haesun-
dc.description.isOpenAccessN-
dc.type.journalArticleBook Chapter-
Appears in Collection
AI-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0