DSpace at KOASAS: Prediction of protein secondary structure content using amino acid composition an evolutionary information

DSpace at KOASAS

College of Engineering(공과대학)Dept. of Bio and Brain Engineering(바이오및뇌공학과)BiS-Journal Papers(저널논문)

Prediction of protein secondary structure content using amino acid composition an evolutionary information

Cited 45 time in

Cited 0 time in

Hit : 363
Download : 0

Export

Lee, S / Lee, BC / Kim, Dongsup researcher

Knowing protein structure and inferring its function from the structure are one of the main issues of computational structural biology, and often the first step is studying protein secondary structure. There have been many attempts to predict protein secondary structure contents. Previous attempts assumed that the content of protein secondary structure can be predicted successfully using the information on the amino acid composition of a protein. Recent methods achieved remarkable prediction accuracy by using the expanded composition information. The overall average error of the most successful method is 3.4%. Here, we demonstrate that even if we only use the simple amino acid composition information alone, it is possible to improve the prediction accuracy significantly if the evolutionary information is included. The idea is motivated by the observation that evolutionarily related proteins share the similar structure. After calculating the homolog-averaged amino acid composition of a protein, which can be easily obtained from the multiple sequence alignment by running PSI-BLAST, those 20 numbers are learned by a multiple linear regression, an artificial neural network and a support vector regression. The overall average error of method by a support vector regression is 3.3%. It is remarkable that we obtain the comparable accuracy without utilizing the expanded composition information such as pair-coupled amino acid composition. This work again demonstrates that the amino acid composition is a fundamental characteristic of a protein. It is anticipated that our novel idea can be applied to many areas of protein bioinformatics where the amino acid composition information is utilized, such as subcellular localization prediction, enzyme subclass prediction, domain boundary prediction, signal sequence prediction, and prediction of unfolded segment in a protein sequence, to name a few.

Publisher: WILEY-LISS

Issue Date: 2006-03

Language: English

Article Type: Article

Keywords: CIRCULAR-DICHROISM SPECTRA; NEURAL-NETWORK; CLASSIFICATION; DATABASE; SEQUENCES; ACCURACY; PARADOX

Citation: PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, v.62, no.4, pp.1107 - 1114

ISSN: 0887-3585

URI: http://hdl.handle.net/10203/91244

Appears in Collection: BiS-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 45 items in WoS	Click to see citing articles in

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Prediction of protein secondary structure content using amino acid composition an evolutionary information

This item is cited by other documents in WoS

KOASAS

Communities & Collections