Deprecated: Implicit conversion from float 217.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 217.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Warning: imagejpeg(C:\Inetpub\vhosts\kidney.de\httpdocs\phplern\27810958
.jpg): Failed to open stream: No such file or directory in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 117 Proc+Natl+Acad+Sci+U+S+A
2016 ; 113
(47
): E7438-E7447
Nephropedia Template TP
gab.com Text
Twit Text FOAVip
Twit Text #
English Wikipedia
Tertiary alphabet for the observable protein structural universe
#MMPMID27810958
Mackenzie CO
; Zhou J
; Grigoryan G
Proc Natl Acad Sci U S A
2016[Nov]; 113
(47
): E7438-E7447
PMID27810958
show ga
Here, we systematically decompose the known protein structural universe into its
basic elements, which we dub tertiary structural motifs (TERMs). A TERM is a
compact backbone fragment that captures the secondary, tertiary, and quaternary
environments around a given residue, comprising one or more disjoint segments
(three on average). We seek the set of universal TERMs that capture all structure
in the Protein Data Bank (PDB), finding remarkable degeneracy. Only ?600 TERMs
are sufficient to describe 50% of the PDB at sub-Angstrom resolution. However,
more rare geometries also exist, and the overall structural coverage grows
logarithmically with the number of TERMs. We go on to show that universal TERMs
provide an effective mapping between sequence and structure. We demonstrate that
TERM-based statistics alone are sufficient to recapitulate close-to-native
sequences given either NMR or X-ray backbones. Furthermore, sequence variability
predicted from TERM data agrees closely with evolutionary variation. Finally,
locations of TERMs in protein chains can be predicted from sequence alone based
on sequence signatures emergent from TERM instances in the PDB. For multisegment
motifs, this method identifies spatially adjacent fragments that are not
contiguous in sequence-a major bottleneck in structure prediction. Although all
TERMs recur in diverse proteins, some appear specialized for certain functions,
such as interface formation, metal coordination, or even water binding.
Structural biology has benefited greatly from previously observed degeneracies in
structure. The decomposition of the known structural universe into a finite set
of compact TERMs offers exciting opportunities toward better understanding,
design, and prediction of protein structure.