Context-aware multi-token concept recognition for biomedical text mining바이오 텍스트마이닝을 위한 맥락정보를 고려한 다중토큰 개념어의 인식 기법

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 133
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorLee, Doheon-
dc.contributor.advisor이도헌-
dc.contributor.authorKim, Kwangmin-
dc.date.accessioned2022-04-15T01:53:55Z-
dc.date.available2022-04-15T01:53:55Z-
dc.date.issued2021-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=962582&flag=dissertationen_US
dc.identifier.urihttp://hdl.handle.net/10203/294582-
dc.description.abstractConcept recognition is a term that corresponds to the two sequential steps of named entity recognition and named entity normalization, and plays an essential role in the field of bioinformatics. However, the conventional dictionary-based methods did not sufficiently addressed the variation of the concepts in actual use in literature, resulting in the particularly degraded performances in recognition of multi-token concepts. In this paper, we propose a concept recognition method of multi-token biological entities using neural models combined with literature contexts. The key aspect of our method is utilizing the contextual information from the biological knowledge-bases for concept normalization, which is followed by named entity recognition procedure. The model showed improved performances over conventional methods, particularly for multi-token concepts with higher variations. We expect that our model can be utilized for effective concept recognition and variety of natural language processing tasks on bioinformatics.-
dc.languageeng-
dc.titleContext-aware multi-token concept recognition for biomedical text mining-
dc.title.alternative바이오 텍스트마이닝을 위한 맥락정보를 고려한 다중토큰 개념어의 인식 기법-
dc.identifier.CNRN325007-
dc.description.department한국과학기술원 :바이오및뇌공학과,-
dc.description.isOpenAccess학위논문(박사) - 한국과학기술원 : 바이오및뇌공학과, 2021.8,[iv, 55 p. :]-
dc.publisher.country한국과학기술원-
dc.type.journalArticleThesis(Ph.D)-
dc.contributor.alternativeauthor김광민-
dc.subject.keywordAuthorConcept Recognition▼aEntity Normalization▼aGene Ontology▼aNamed Entity Recognition▼aLanguage Model-
dc.subject.keywordAuthor개념어 인식▼a개체명 정규화▼a유전자 온톨로지▼a개체명 인식▼a언어 모델-
Appears in Collection
BiS-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0