Variable grouping by CART and combination of marginal models for large scale modellingCART를 활용한 변수 군집화와 주변 모형 결합에 의한 거대모형 개발

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 518
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorKim, Sung-Ho-
dc.contributor.advisor김성호-
dc.contributor.authorKim, Yoon-Jung-
dc.contributor.author김윤정-
dc.date.accessioned2011-12-14T04:55:18Z-
dc.date.available2011-12-14T04:55:18Z-
dc.date.issued2005-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=243535&flag=dissertation-
dc.identifier.urihttp://hdl.handle.net/10203/42120-
dc.description학위논문(석사) - 한국과학기술원 : 응용수학전공, 2005.2, [ vi, 50 p. ]-
dc.description.abstractThese days we are exposed to huge data, some of which has relations each other. But it is different to find them. Data mining is a series of procedure which extracts information by exploring and modelling the relationships within such data and CART is one of the most popular tools for Data Mining. It develops for us a classification tree for categorical response variables and a regression tree for continuous response variables. The trees are developed in such a way that predictor variables are selected one after another in the order of the information amount that a predictor variable has for the response variable, where the information amount is computed conditional on the outcome of the predictor variables that are already selected in the tree construction process. Our goal in this thesis is finding a model structure for a large set of random variables, some of which are continuous and the rest are categorical. While CART is useful for a supervised learning, log-linear modelling is an unsupervised learning. We use CART at an initial stage of large scale modelling for the purpose of selecting subgroups of the random variables that are involved in the whole data set. Since CART is available to a data set of many random variables of mixed type, easy to apply, and easy to interpret the result of analysis, we can easily group the variables so that the variables in a group are associated highly with each other. Once groups of random variables are obtained, we then apply log-linear modelling to individual groups and obtain graphical log-linear models whose model structures are rep-resentable via graphs of vertices and edges. From each graphical model, we find particular types of graph separators called "prime separators", which are each defined as a graph separator which separates cliques or irreducible cycles. The prime separators have a nice property that they remain as prime separators both in a graphical model and its marginal model. This property is used in comb...eng
dc.languageeng-
dc.publisher한국과학기술원-
dc.subjectContinuous and categrical variable-
dc.subjectLarge scale modelling-
dc.subjectVariable grouping-
dc.subjectCART-
dc.subjectseparator-
dc.subject분리자-
dc.subject연속과 이산 변수-
dc.subject거대 모델링-
dc.subject변수 군집화-
dc.subject카트-
dc.titleVariable grouping by CART and combination of marginal models for large scale modelling-
dc.title.alternativeCART를 활용한 변수 군집화와 주변 모형 결합에 의한 거대모형 개발-
dc.typeThesis(Master)-
dc.identifier.CNRN243535/325007 -
dc.description.department한국과학기술원 : 응용수학전공, -
dc.identifier.uid020033133-
dc.contributor.localauthorKim, Sung-Ho-
dc.contributor.localauthor김성호-
Appears in Collection
MA-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0