Machine learning approach for anonymizing electronic medical records전자의무기록의 기계학습 기반 익명화 기법

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 652
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorLee, Do-Heon-
dc.contributor.advisor이도헌-
dc.contributor.authorShin, Moon-Shik-
dc.contributor.author신문식-
dc.date.accessioned2015-04-23T07:08:25Z-
dc.date.available2015-04-23T07:08:25Z-
dc.date.issued2012-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=568077&flag=dissertation-
dc.identifier.urihttp://hdl.handle.net/10203/197197-
dc.description학위논문(석사) - 한국과학기술원 : 로봇공학학제전공, 2012.2, [ vi, 38 p. ]-
dc.description.abstractElectronic Medical Records (EMRs) enable the sharing of patient medical data whenever it is needed and also are used as a tool for building new medical technology and patient recommendation systems. Since EMRs include patients’ private data there exist restriction to researchers for access. Thus an anonymizing technique is necessary that keeps patients’ private data safe while undamaging useful medical information. Conventional research has been focusing on de-identification which can lead to unexpected privacy exposure issue. To prevent unexpected privacy exposure issues anonymization techniques based on k-anonymity has been previously introduced. k-member clustering anonymization is a technique that approaches the k-anonymization as a clustering issue. The objective of the k-member clustering problem is to gather (i.e. cluster) records that will minimize the data distortion during data generalization process. Most of the clustering techniques include random seed selection and iteration process to gather record that gives minimum information distortion. However, dealing with massive medical patient dataset, randomly selecting a cluster seed will provide inconsistent performance. This paper proposes a seed selection method based on closeness centrality which not only provides consistent information loss but at the same time reduces the information loss and execution time. We experimentally compare our algorithm with two previous studies. The experiments show that our algorithm provides better performance with respect to information loss.eng
dc.languageeng-
dc.publisher한국과학기술원-
dc.subjectK-anonymity-
dc.subject정보 손실량-
dc.subject근접 중심성 분석-
dc.subjectseed 선정 알고리즘-
dc.subjectk-요소 군집화 재식별 방지-
dc.subjectk-재식별 방지-
dc.subjectK-member clustering anonymization-
dc.subjectSeed selection algorithm-
dc.subjectCloseness Centrality-
dc.subjectInformation Loss-
dc.titleMachine learning approach for anonymizing electronic medical records-
dc.title.alternative전자의무기록의 기계학습 기반 익명화 기법-
dc.typeThesis(Master)-
dc.identifier.CNRN568077/325007 -
dc.description.department한국과학기술원 : 로봇공학학제전공, -
dc.identifier.uid020103336-
dc.contributor.localauthorLee, Do-Heon-
dc.contributor.localauthor이도헌-
Appears in Collection
RE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0