DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Kim, Myoung-Ho | - |
dc.contributor.advisor | 김명호 | - |
dc.contributor.author | Kim, Min | - |
dc.contributor.author | 김민 | - |
dc.date.accessioned | 2011-12-13T06:08:27Z | - |
dc.date.available | 2011-12-13T06:08:27Z | - |
dc.date.issued | 2009 | - |
dc.identifier.uri | http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=327352&flag=dissertation | - |
dc.identifier.uri | http://hdl.handle.net/10203/34887 | - |
dc.description | 학위논문(석사) - 한국과학기술원 : 전산학전공, 2009. 8., [ v, 22 p. ] | - |
dc.description.abstract | The problem of finding clusters is widely used in numerous applications, such as pattern recognition, image analysis, market analysis. The important factors that decide cluster quality are the similarity measure and the number of attributes. Similarity measures should be defined with respect to the data types. Existing similarity measures are well applicable to numerical attribute values. However, those measures do not work well when the data is described by categorical attributes, that is, when no inherent similarity measure between values. In high dimensional spaces, conventional clustering algorithms tend to break down because of sparsity of data points. To overcome this difficulty, a subspace clustering approach has been proposed. It is based on the observation that different clusters may exist in different subspaces. In this paper, we propose a new similarity measure for clustering of high dimensional categorical data. The measure is defined based on the fact that a good clustering is one where each cluster should have certain information that can distinguish it with other clusters. We also try to capture on the attribute dependencies. Experimental results on real datasets show clusters obtained by our proposed similarity measure are good enough with respect to clustering accuracy. | eng |
dc.language | eng | - |
dc.publisher | 한국과학기술원 | - |
dc.subject | clustering | - |
dc.subject | similarity measure | - |
dc.subject | k-means clustering | - |
dc.subject | 군집화 | - |
dc.subject | 유사 측도 | - |
dc.subject | k-평균 군집화 | - |
dc.subject | clustering | - |
dc.subject | similarity measure | - |
dc.subject | k-means clustering | - |
dc.subject | 군집화 | - |
dc.subject | 유사 측도 | - |
dc.subject | k-평균 군집화 | - |
dc.title | (A) new similarity measure for categorical attribute-based clustering | - |
dc.title.alternative | 범주형 속성 기반 군집화를 위한 새로운 유사 측도 | - |
dc.type | Thesis(Master) | - |
dc.identifier.CNRN | 327352/325007 | - |
dc.description.department | 한국과학기술원 : 전산학전공, | - |
dc.identifier.uid | 020063056 | - |
dc.contributor.localauthor | Kim, Myoung-Ho | - |
dc.contributor.localauthor | 김명호 | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.