Building Text-mining Framework for Gene-Phenotype Relation Extraction using Deep Leaning

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 258
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorJang, Dongjinko
dc.contributor.authorLee, Jaehyunko
dc.contributor.authorKim, Kwangminko
dc.contributor.authorLee, Doheonko
dc.date.accessioned2020-03-19T05:22:25Z-
dc.date.available2020-03-19T05:22:25Z-
dc.date.created2020-02-20-
dc.date.issued2015-10-
dc.identifier.citationthe ACM Ninth International Workshop on Data and Text Mining in Biomedical Informatics in conjunction with CIKM, pp.17-
dc.identifier.urihttp://hdl.handle.net/10203/273128-
dc.description.abstractThe scientific literature is a rich resource for information retrieval on the biological knowledge. Nevertheless, the unstructured textual data in the research articles makes it difficult to access the information with computer-aided systems. Text-mining is one of the solution that can transform unstructured information in the text into database content, and most of the approaches are based on the machine learning models. Since these approaches require high-dimensional features, the performance of the model is heavily dependent on the selection of features. However, it is usually difficult and labor-intensive to choose good features, because feature extraction requires prior knowledge and ingenuity of human experts. Here, we suggest a novel framework to extract biological relations from the texts by using hierarchical text features that enhance the effectiveness of relation extraction model. The proposed framework is composed of two parts, node and edge detection, using deep belief networks. Each part is based on the hierarchical text features learned by Gaussian-Bernoulli restricted Boltzmann machine (GBRBM). In this work, we performed gene-cancer relation extraction task as a pilot study. The classification model was trained based on both GE09 corpus from BioNLP'09 Shared Task and CoMAGC corpus. The results show that our model achieved better performance than other handcrafted feature-based approaches. The evaluation results suggest that deep belief networks offers the optimized and generalized hierarchical text features for the large-scale text mining.-
dc.languageEnglish-
dc.publisherACM Press-
dc.titleBuilding Text-mining Framework for Gene-Phenotype Relation Extraction using Deep Leaning-
dc.typeConference-
dc.type.rimsCONF-
dc.citation.beginningpage17-
dc.citation.publicationnamethe ACM Ninth International Workshop on Data and Text Mining in Biomedical Informatics in conjunction with CIKM-
dc.identifier.conferencecountryAT-
dc.identifier.conferencelocationMelbourne, Australia-
dc.identifier.doi10.1145/2811163.2811165-
dc.contributor.localauthorLee, Doheon-
dc.contributor.nonIdAuthorJang, Dongjin-
dc.contributor.nonIdAuthorLee, Jaehyun-
dc.contributor.nonIdAuthorKim, Kwangmin-
Appears in Collection
BiS-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0