Context-aware multi-token concept recognition for biomedical text mining바이오 텍스트마이닝을 위한 맥락정보를 고려한 다중토큰 개념어의 인식 기법

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 132
  • Download : 0
Concept recognition is a term that corresponds to the two sequential steps of named entity recognition and named entity normalization, and plays an essential role in the field of bioinformatics. However, the conventional dictionary-based methods did not sufficiently addressed the variation of the concepts in actual use in literature, resulting in the particularly degraded performances in recognition of multi-token concepts. In this paper, we propose a concept recognition method of multi-token biological entities using neural models combined with literature contexts. The key aspect of our method is utilizing the contextual information from the biological knowledge-bases for concept normalization, which is followed by named entity recognition procedure. The model showed improved performances over conventional methods, particularly for multi-token concepts with higher variations. We expect that our model can be utilized for effective concept recognition and variety of natural language processing tasks on bioinformatics.
Advisors
Lee, Doheonresearcher이도헌researcher
Description
한국과학기술원 :바이오및뇌공학과,
Country
한국과학기술원
Issue Date
2021
Identifier
325007
Language
eng
Article Type
Thesis(Ph.D)
URI
http://hdl.handle.net/10203/294582
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=962582&flag=dissertation
Appears in Collection
BiS-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0