Characterization of Proteins Structural Classes

Understanding the recognition of protein structural classes by amino acid composition(.pdf)

Ivet Bahar 1 2, Ali Rana Atilgan 2, Robert L. Jernigan 1 *, Burak Erman 2
1Molecular Structure Section, Laboratory of Experimental and Computational Biology, Division of Basic Sciences, National Cancer Institute, National Institutes of Health, MSC 5677, Bethesda, Maryland
2Polymer Research Center, Bogazici University, and TUBITAK Advanced Polymeric Materials Research Center, Bebek 80815, Istanbul, Turkey

 

Keywords:

 non-bonded contacts; coordination of amino acids; Kirchhoff matrices; lattice models; singular value decomposition; secondary structure content prediction; contact patterns

Abstract:

Knowledge of amino acid composition, alone, is verified here to be sufficient for recognizing the structural class, a, b, a+b, or a/b of a given protein with an accuracy of 81%. This is supported by results from exhaustive enumerations of all conformations for all sequences of simple, compact lattice models consisting of two types (hydrophobic and polar) of residues. Different compositions exhibit strong affinities for certain folds. Within the limits of validity of the lattice models, two factors appear to determine the choice of particular folds: 1) the coordination numbers of individual sites and 2) the size and geometry of non-bonded clusters. These two properties, collectively termed the distribution of non-bonded contacts, are quantitatively assessed by an eigenvalue analysis of the so-called Kirchhoff or adjacency matrices obtained by considering the non-bonded interactions on a lattice. The analysis permits the identification of conformations that possess the same distribution of non-bonded contacts. Furthermore, some distributions of non-bonded contacts are favored entropically, due to their high degeneracies. Thus, a competition between enthalpic and entropic effects is effective in determining the choice of a distribution for a given composition. Based on these findings, an analysis of non-bonded contacts in protein structures was made. The analysis shows that proteins belonging to the four distinct folding classes exhibit significant differences in their distributions of non-bonded contacts, which more directly explains the success in predicting structural class from amino acid composition. Proteins 29:172-185, 1997. Published 1997 Wiley-Liss, Inc.

Received: 10 February 1997; Accepted: 6 June 1997

Funding Agency: NATO Collaborative Research Grant Project; Grant Number: CRG951240
Funding Agency: Bogazici University Research Funds Project; Grant Number: 96A0430


Projection of proteins onto the framed spanned by the two principle axes u1and u2.  Sixty test structures are displayed of which 30 are - proteins (o) and 30 are  - proteins ().


Loci of amino acids in the space spannedby the three principle axes u1, u2, u3 determined from SVD analysis.
 

 

 

University of Pittsburgh ---------- School of Medicine
3064 Biomedical Science Tower  3501 Fifth Ave., Pittsburgh, PA 15213.     Phone : (412) 648-3333,  Fax: (412) 648-3163

Tel : (412) 648-6671,  Fax: (412) 648-6676