Construction of phenotypic similarity network of rare diseases and candidate disease gene prediction using RWRHN = 희귀질환의 증상유사네트워크 구축과 RWRHN 방법을 사용한 질병유전자 예측

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 235
  • Download : 0
Recent studies such as text-mining against huge biomedical databases, devising ontology similarity measure, and predicting candidate disease genes using various methods has been applied on the rare disease field. In this study, we aim to devise and compare models of prioritizing candidate rare disease genes. Disease phenotype, which is defined as an every possible observation from patient to diagnose primarily, plays an important role in inferring novel disease related genes recently. Therefore we compute pairwise phenotypic similarity of rare diseases using 3 similarity measures which are Wang, Resnik, and Cosine. The rare disease ontology was extracted from OMIM and Orphanet databases. Next we construct rare disease similarity network where nodes are rare diseases and edges are connected when the similarity value of two disease nodes are higher than cutoff. To group phenotypically similar diseases, we use two strategy which is collecting neighboring disease of query disease and clustering on the rare disease similarity network. Phenotypically similar diseases of query disease is further used as an input to predict potential disease genes. We construct heterogeneous network which consists of physical protein-protein interaction network (PPI) from BioGrid, phenotypic rare disease similarity network, and bipartite disease gene association network obtained from HPO database. In the gene prediction step, we iteratively simulate the random walker on the heterogeneous network using RWRHN algorithm, and the resulting probability of potential candidate rare disease genes to be related to the query diseases are prioritized. We evaluate 6 rare disease gene prediction model which is from a combination of 3 ontology similarity measure and 2 disease grouping methods by employing 10-fold cross validation and by calculating ROAUC. Finally we analyze factors that influence the performance of 6 models.
Kim, Dong Supresearcher김동섭researcher
한국과학기술원 :바이오및뇌공학과,
Issue Date

학위논문(석사) - 한국과학기술원 : 바이오및뇌공학과, 2017.2,[iv, 30 p. :]


Rare disease; phenotypic similarity disease network; heterogeneous network integration; random walk with restart algorithm; disease gene prediction; 희귀질환; 증상 유자 질환 네트워크; 혼성네트워크 통합; 재시작 랜덤워크 알고리즘; 질병 유전자 예측

Appears in Collection
Files in This Item
There are no files associated with this item.


  • mendeley


rss_1.0 rss_2.0 atom_1.0