Linear predictive coding representation of correlated mutation for protein sequence alignment

Cited 2 time in webofscience Cited 0 time in scopus
  • Hit : 358
  • Download : 257
Background: Although both conservation and correlated mutation ( CM) are important information reflecting the different sorts of context in multiple sequence alignment, most of alignment methods use sequence profiles that only represent conservation. There is no general way to represent correlated mutation and incorporate it with sequence alignment yet. Methods: We develop a novel method, CM profile, to represent correlated mutation as the spectral feature derived by using linear predictive coding where correlated mutations among different positions are represented by a fixed number of values. We combine CM profile with conventional sequence profile to improve alignment quality. Results: For distantly related protein pairs, using CM profile improves the profile-profile alignment with or without predicted secondary structure. Especially, at superfamily level, combining CM profile with sequence profile improves profile-profile alignment by 9.5% while predicted secondary structure does by 6.0%. More significantly, using both of them improves profile-profile alignment by 13.9%. We also exemplify the effectiveness of CM profile by demonstrating that the resulting alignment preserves share coevolution and contacts. Conclusions: In this work, we introduce a novel method, CM profile, which represents correlated mutation information as paralleled form, and apply it to the protein sequence alignment problem. When combined with conventional sequence profile, CM profile improves alignment quality significantly better than predicted secondary structure information, which should be beneficial for target-template alignment in protein structure prediction. Because of the generality of CM profile, it can be used for other bioinformatics applications in the same way of using sequence profile.
Publisher
BIOMED CENTRAL LTD
Issue Date
2010-04
Language
English
Article Type
Article; Proceedings Paper
Keywords

MUTUAL INFORMATION; SECONDARY STRUCTURE; CONTACT PREDICTION; QUALITY; DATABASE; SIMILARITY; PHYLOGENY; RESIDUES; FAMILIES; IMPROVES

Citation

BMC BIOINFORMATICS, v.11, no.sup.2

ISSN
1471-2105
DOI
10.1186/1471-2105-11-S2-S2
URI
http://hdl.handle.net/10203/98885
Appears in Collection
BiS-Journal Papers(저널논문)
Files in This Item
29073.pdf(3.01 MB)Download
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 2 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0