(A) new similarity measure for categorical attribute-based clustering범주형 속성 기반 군집화를 위한 새로운 유사 측도

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 598
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorKim, Myoung-Ho-
dc.contributor.advisor김명호-
dc.contributor.authorKim, Min-
dc.contributor.author김민-
dc.date.accessioned2011-12-13T06:08:27Z-
dc.date.available2011-12-13T06:08:27Z-
dc.date.issued2009-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=327352&flag=dissertation-
dc.identifier.urihttp://hdl.handle.net/10203/34887-
dc.description학위논문(석사) - 한국과학기술원 : 전산학전공, 2009. 8., [ v, 22 p. ]-
dc.description.abstractThe problem of finding clusters is widely used in numerous applications, such as pattern recognition, image analysis, market analysis. The important factors that decide cluster quality are the similarity measure and the number of attributes. Similarity measures should be defined with respect to the data types. Existing similarity measures are well applicable to numerical attribute values. However, those measures do not work well when the data is described by categorical attributes, that is, when no inherent similarity measure between values. In high dimensional spaces, conventional clustering algorithms tend to break down because of sparsity of data points. To overcome this difficulty, a subspace clustering approach has been proposed. It is based on the observation that different clusters may exist in different subspaces. In this paper, we propose a new similarity measure for clustering of high dimensional categorical data. The measure is defined based on the fact that a good clustering is one where each cluster should have certain information that can distinguish it with other clusters. We also try to capture on the attribute dependencies. Experimental results on real datasets show clusters obtained by our proposed similarity measure are good enough with respect to clustering accuracy.eng
dc.languageeng-
dc.publisher한국과학기술원-
dc.subjectclustering-
dc.subjectsimilarity measure-
dc.subjectk-means clustering-
dc.subject군집화-
dc.subject유사 측도-
dc.subjectk-평균 군집화-
dc.subjectclustering-
dc.subjectsimilarity measure-
dc.subjectk-means clustering-
dc.subject군집화-
dc.subject유사 측도-
dc.subjectk-평균 군집화-
dc.title(A) new similarity measure for categorical attribute-based clustering-
dc.title.alternative범주형 속성 기반 군집화를 위한 새로운 유사 측도-
dc.typeThesis(Master)-
dc.identifier.CNRN327352/325007-
dc.description.department한국과학기술원 : 전산학전공,-
dc.identifier.uid020063056-
dc.contributor.localauthorKim, Myoung-Ho-
dc.contributor.localauthor김명호-
Appears in Collection
CS-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0