Context-aware multi-token concept recognition of biological entities

Cited 3 time in webofscience Cited 0 time in scopus
  • Hit : 201
  • Download : 100
Background Concept recognition is a term that corresponds to the two sequential steps of named entity recognition and named entity normalization, and plays an essential role in the field of bioinformatics. However, the conventional dictionary-based methods did not sufficiently addressed the variation of the concepts in actual use in literature, resulting in the particularly degraded performances in recognition of multi-token concepts. Results In this paper, we propose a concept recognition method of multi-token biological entities using neural models combined with literature contexts. The key aspect of our method is utilizing the contextual information from the biological knowledge-bases for concept normalization, which is followed by named entity recognition procedure. The model showed improved performances over conventional methods, particularly for multi-token concepts with higher variations. Conclusions We expect that our model can be utilized for effective concept recognition and variety of natural language processing tasks on bioinformatics.
Publisher
BMC
Issue Date
2021-10
Language
English
Article Type
Article
Citation

BMC BIOINFORMATICS, v.22, no.SUPPL 11

ISSN
1471-2105
DOI
10.1186/s12859-021-04248-8
URI
http://hdl.handle.net/10203/288971
Appears in Collection
BiS-Journal Papers(저널논문)
Files in This Item
122262.pdf(1.4 MB)Download
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 3 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0