Tyagi, M and de Brevern, AG and Srinivasan, N and Offmann, B (2008) Protein structure mining using a structural alphabet. In: Proteins: Structure, Function, and Bioinformatics, 71 (2). pp. 920-937.
dsd.pdf - Published Version
Restricted to Registered users only
Download (729Kb) | Request a copy
We present a comprehensive evaluation of a new structure mining method called PB-ALIGN. It is based on the encoding of protein structure as 1D sequence of a combination of 16 short structural motifs or protein blocks (PBs). PBs are short motifs capable of representing most of the local structural features of a protein backbone. Using derived PB substitution matrix and simple dynamic programming algorithm, PB sequences are aligned the same way amino acid sequences to yield structure alignment. PBs are short motifs capable of representing most of the local structural features of a protein backbone. Alignment of these local features as sequence of symbols enables fast detection of structural similarities between two proteins. Ability of the method to characterize and align regions beyond regular secondary structures, for example, N and C caps of helix and loops connecting regular structures, puts it a step ahead of existing methods, which strongly rely on secondary structure elements. PB-ALIGN achieved efficiency of 85% in extracting true fold from a large database of 7259 SCOP domains and was successful in 82% cases to identify true super-family members. On comparison to 13 existing structure comparison/mining methods, PB-ALIGN emerged as the best on general ability test dataset and was at par with methods like YAKUSA and CE on nontrivial test dataset. Furthermore, the proposed method performed well when compared to flexible structure alignment method like FATCAT and outperforms in processing speed (less than 45 s per database scan). This work also establishes a reliable cut-off value for the demarcation of similar folds. It finally shows that global alignment scores of unrelated structures using PBs follow an extreme value distribution. PB-ALIGN is freely available on web server called Protein Block Expert (PBE) at http://bioinformatics.univ-reunion.fr/PBE/.
|Item Type:||Journal Article|
|Additional Information:||Copyright of this article belongs to John Wiley & Sons.|
|Keywords:||Substitution matrix;protein blocks;local protein structure;structure mining;local alignment;global alignment;structure comparison.|
|Department/Centre:||Division of Biological Sciences > Molecular Biophysics Unit|
|Date Deposited:||17 Sep 2008 04:48|
|Last Modified:||19 Sep 2010 04:49|
Actions (login required)