Non-negative matrix factorization based text mining: Feature extraction and classification

Cited 7 time in webofscience Cited 0 time in scopus
  • Hit : 584
  • Download : 2
DC FieldValueLanguage
dc.contributor.authorBarman, P. C.ko
dc.contributor.authorIqbal, Nadeemko
dc.contributor.authorLee, Soo-Youngko
dc.date.accessioned2009-07-23T02:12:16Z-
dc.date.available2009-07-23T02:12:16Z-
dc.date.created2012-02-06-
dc.date.created2012-02-06-
dc.date.issued2006-
dc.identifier.citationNEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS BOOK SERIES: LECTURE NOTES IN COMPUTER SCIENCE, v.4233, pp.703 - 712-
dc.identifier.issn0302-9743-
dc.identifier.urihttp://hdl.handle.net/10203/10203-
dc.description.abstractThe unlabeled document or text collections are becoming larger and larger which is common and obvious; mining such data sets are a challenging task. Using the simple word-document frequency matrix as feature space the mining process is becoming more complex. The text documents are often represented as high dimensional about few thousand sparse vectors with sparsity about 95 to 99% which significantly affects the efficiency and the results of the mining process. In this paper, we propose the two-stage Non-negative Matrix Factorization (NMF): in the first stage we tried to extract the uncorrelated basis probabilistic document feature vectors by significantly reducing the dimension of the feature vectors of the word-document frequency from few thousand to few hundred, and in the second stage for clustering or classification. In our propose approach it has been observed that the clustering or classification performance with more than 98.5% accuracy. The dimension reduction and classification performance has observed for the Classic3 dataset.-
dc.description.sponsorshipThis research was supported as the Brain Neuroinformatic Research Program by Korean Ministry of Commerce, Industry, and Energy.en
dc.languageEnglish-
dc.language.isoen_USen
dc.publisherSPRINGER-VERLAG BERLIN-
dc.titleNon-negative matrix factorization based text mining: Feature extraction and classification-
dc.typeArticle-
dc.identifier.wosid000241753100078-
dc.identifier.scopusid2-s2.0-33750701776-
dc.type.rimsART-
dc.citation.volume4233-
dc.citation.beginningpage703-
dc.citation.endingpage712-
dc.citation.publicationnameNEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS BOOK SERIES: LECTURE NOTES IN COMPUTER SCIENCE-
dc.embargo.liftdate9999-12-31-
dc.embargo.terms9999-12-31-
dc.contributor.localauthorLee, Soo-Young-
dc.contributor.nonIdAuthorBarman, P. C.-
dc.type.journalArticleArticle; Proceedings Paper-
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 7 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0