Machine learning approach for anonymizing electronic medical records전자의무기록의 기계학습 기반 익명화 기법

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 638
  • Download : 0
Electronic Medical Records (EMRs) enable the sharing of patient medical data whenever it is needed and also are used as a tool for building new medical technology and patient recommendation systems. Since EMRs include patients’ private data there exist restriction to researchers for access. Thus an anonymizing technique is necessary that keeps patients’ private data safe while undamaging useful medical information. Conventional research has been focusing on de-identification which can lead to unexpected privacy exposure issue. To prevent unexpected privacy exposure issues anonymization techniques based on k-anonymity has been previously introduced. k-member clustering anonymization is a technique that approaches the k-anonymization as a clustering issue. The objective of the k-member clustering problem is to gather (i.e. cluster) records that will minimize the data distortion during data generalization process. Most of the clustering techniques include random seed selection and iteration process to gather record that gives minimum information distortion. However, dealing with massive medical patient dataset, randomly selecting a cluster seed will provide inconsistent performance. This paper proposes a seed selection method based on closeness centrality which not only provides consistent information loss but at the same time reduces the information loss and execution time. We experimentally compare our algorithm with two previous studies. The experiments show that our algorithm provides better performance with respect to information loss.
Advisors
Lee, Do-Heonresearcher이도헌
Description
한국과학기술원 : 로봇공학학제전공,
Publisher
한국과학기술원
Issue Date
2012
Identifier
568077/325007  / 020103336
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 로봇공학학제전공, 2012.2, [ vi, 38 p. ]

Keywords

K-anonymity; 정보 손실량; 근접 중심성 분석; seed 선정 알고리즘; k-요소 군집화 재식별 방지; k-재식별 방지; K-member clustering anonymization; Seed selection algorithm; Closeness Centrality; Information Loss

URI
http://hdl.handle.net/10203/197197
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=568077&flag=dissertation
Appears in Collection
RE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0