PLPD: reliable protein localization prediction from imbalanced and overlapped datasets

Cited 42 time in webofscience Cited 0 time in scopus
  • Hit : 569
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorLee, KiYoungko
dc.contributor.authorKim, Dae-Wonko
dc.contributor.authorNa, DoKyunko
dc.contributor.authorLee, Kwang-Hyungko
dc.contributor.authorLee, Doheonko
dc.date.accessioned2013-03-08T02:02:21Z-
dc.date.available2013-03-08T02:02:21Z-
dc.date.created2012-02-06-
dc.date.created2012-02-06-
dc.date.issued2006-10-
dc.identifier.citationNUCLEIC ACIDS RESEARCH, v.34, no.17, pp.4655 - 4666-
dc.identifier.issn0305-1048-
dc.identifier.urihttp://hdl.handle.net/10203/91787-
dc.description.abstractSubcellular localization is one of the key functional characteristics of proteins. An automatic and efficient prediction method for the protein subcellular localization is highly required owing to the need for large-scale genome analysis. From a machine learning point of view, a dataset of protein localization has several characteristics: the dataset has too many classes (there are more than 10 localizations in a cell), it is a multi-label dataset (a protein may occur in several different subcellular locations), and it is too imbalanced (the number of proteins in each localization is remarkably different). Even though many previous works have been done for the prediction of protein subcellular localization, none of them tackles effectively these characteristics at the same time. Thus, a new computational method for protein localization is eventually needed for more reliable outcomes. To address the issue, we present a protein localization predictor based on D-SVDD (PLPD) for the prediction of protein localization, which can find the likelihood of a specific localization of a protein more easily and more correctly. Moreover, we introduce three measurements for the more precise evaluation of a protein localization predictor. As the results of various datasets which are made from the experiments of Huh et al. (2003), the proposed PLPD method represents a different approach that might play a complimentary role to the existing methods, such as Nearest Neighbor method and discriminate covariant method. Finally, after finding a good boundary for each localization using the 5184 classified proteins as training data, we predicted 138 proteins whose subcellular localizations could not be clearly observed by the experiments of Huh et al. (2003).-
dc.languageEnglish-
dc.publisherOXFORD UNIV PRESS-
dc.subjectAMINO-ACID-COMPOSITION-
dc.subjectSUPPORT VECTOR MACHINES-
dc.subjectSUBCELLULAR LOCATION PREDICTION-
dc.subjectFUNCTIONAL DOMAIN COMPOSITION-
dc.subjectBUDDING YEAST-
dc.subjectSACCHAROMYCES-CEREVISIAE-
dc.subjectSTRUCTURAL CLASSES-
dc.subjectSEQUENCE-
dc.subjectDATABASE-
dc.subjectGENOME-
dc.titlePLPD: reliable protein localization prediction from imbalanced and overlapped datasets-
dc.typeArticle-
dc.identifier.wosid000241427300007-
dc.identifier.scopusid2-s2.0-33750245540-
dc.type.rimsART-
dc.citation.volume34-
dc.citation.issue17-
dc.citation.beginningpage4655-
dc.citation.endingpage4666-
dc.citation.publicationnameNUCLEIC ACIDS RESEARCH-
dc.identifier.doi10.1093/nar/gkl638-
dc.contributor.localauthorLee, Kwang-Hyung-
dc.contributor.localauthorLee, Doheon-
dc.contributor.nonIdAuthorLee, KiYoung-
dc.contributor.nonIdAuthorKim, Dae-Won-
dc.contributor.nonIdAuthorNa, DoKyun-
dc.type.journalArticleArticle-
dc.subject.keywordPlusAMINO-ACID-COMPOSITION-
dc.subject.keywordPlusSUPPORT VECTOR MACHINES-
dc.subject.keywordPlusSUBCELLULAR LOCATION PREDICTION-
dc.subject.keywordPlusFUNCTIONAL DOMAIN COMPOSITION-
dc.subject.keywordPlusBUDDING YEAST-
dc.subject.keywordPlusSACCHAROMYCES-CEREVISIAE-
dc.subject.keywordPlusSTRUCTURAL CLASSES-
dc.subject.keywordPlusSEQUENCE-
dc.subject.keywordPlusDATABASE-
dc.subject.keywordPlusGENOME-
Appears in Collection
BiS-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 42 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0