A method using candidate exploration and ranking for abbreviation resolution in clinical documents후보 탐색과 랭킹을 이용한 임상 의료문서 내의 약어 처리 방법 연구

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 870
  • Download : 0
In biomedical texts, abbreviations are frequently used due to their inclusion of many technical expressions of some length. Accordingly, appropriate recognition of abbreviations and their full form pairs is essential task in automatic text processing of biomedical documents. However, unlike biomedical literatures, clinical notes have many abbreviations without full form indicated in the text or without standard definition in dictionaries due to the nature of the documents. This causes difficulties in adapting traditional approaches for abbreviation disambiguation such as classification among fixed candidates or pattern-based definition extraction. Because of this reason, we consider the task as search problem and propose an approach with two steps: a) exploring possible full form candidates from various resources and b) choosing most acceptable one among retrieved candidates by ranking. To discover full form candidates and extract features of them, we exploited external academic resources such as MEDLINE and UMLS as well as clinical note corpus itself. To rank the candidates properly by consulting human criteria, we adopted RankBoost, one of learning to rank models developed from information retrieval and machine learning societies. Results show the suggested two-step approach has potential on this kind of task and propose another possible application of learning to rank models.
Advisors
Myaeng, Sung-Hyonresearcher맹성현
Description
한국과학기술원 : 전산학과,
Publisher
한국과학기술원
Issue Date
2012
Identifier
509479/325007  / 020104291
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전산학과, 2012.8, [ iv, 36 p. ]

Keywords

Abbreviation Resolution; Learning to Rank; 약어 처리; Learning to Rank; 의료문서처리; Medical Text Processing

URI
http://hdl.handle.net/10203/180471
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=509479&flag=dissertation
Appears in Collection
CS-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0