DSpace at KOASAS: Non-negative matrix factorization based text mining: Feature extraction and classification

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Journal Papers(저널논문)

Non-negative matrix factorization based text mining: Feature extraction and classification

Cited 7 time in

Cited 0 time in

Hit : 584
Download : 2

Export

DC Field	Value	Language
dc.contributor.author	Barman, P. C.	ko
dc.contributor.author	Iqbal, Nadeem	ko
dc.contributor.author	Lee, Soo-Young	ko
dc.date.accessioned	2009-07-23T02:12:16Z	-
dc.date.available	2009-07-23T02:12:16Z	-
dc.date.created	2012-02-06	-
dc.date.created	2012-02-06	-
dc.date.issued	2006	-
dc.identifier.citation	NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS BOOK SERIES: LECTURE NOTES IN COMPUTER SCIENCE, v.4233, pp.703 - 712	-
dc.identifier.issn	0302-9743	-
dc.identifier.uri	http://hdl.handle.net/10203/10203	-
dc.description.abstract	The unlabeled document or text collections are becoming larger and larger which is common and obvious; mining such data sets are a challenging task. Using the simple word-document frequency matrix as feature space the mining process is becoming more complex. The text documents are often represented as high dimensional about few thousand sparse vectors with sparsity about 95 to 99% which significantly affects the efficiency and the results of the mining process. In this paper, we propose the two-stage Non-negative Matrix Factorization (NMF): in the first stage we tried to extract the uncorrelated basis probabilistic document feature vectors by significantly reducing the dimension of the feature vectors of the word-document frequency from few thousand to few hundred, and in the second stage for clustering or classification. In our propose approach it has been observed that the clustering or classification performance with more than 98.5% accuracy. The dimension reduction and classification performance has observed for the Classic3 dataset.	-
dc.description.sponsorship	This research was supported as the Brain Neuroinformatic Research Program by Korean Ministry of Commerce, Industry, and Energy.	en
dc.language	English	-
dc.language.iso	en_US	en
dc.publisher	SPRINGER-VERLAG BERLIN	-
dc.title	Non-negative matrix factorization based text mining: Feature extraction and classification	-
dc.type	Article	-
dc.identifier.wosid	000241753100078	-
dc.identifier.scopusid	2-s2.0-33750701776	-
dc.type.rims	ART	-
dc.citation.volume	4233	-
dc.citation.beginningpage	703	-
dc.citation.endingpage	712	-
dc.citation.publicationname	NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS BOOK SERIES: LECTURE NOTES IN COMPUTER SCIENCE	-
dc.embargo.liftdate	9999-12-31	-
dc.embargo.terms	9999-12-31	-
dc.contributor.localauthor	Lee, Soo-Young	-
dc.contributor.nonIdAuthor	Barman, P. C.	-
dc.type.journalArticle	Article; Proceedings Paper	-

Appears in Collection: EE-Journal Papers(저널논문)

Files in This Item

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 7 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Non-negative matrix factorization based text mining: Feature extraction and classification

This item is cited by other documents in WoS

KOASAS

Communities & Collections