Proteome-wide discovery of evolutionary conserved sequences in disordered regions

Alex N. Nguyen Ba, Brian J. Yeh, Dewald Van Dyk, Alan R. Davidson, Brenda J. Andrews, Eric L. Weiss, Alan M. Moses*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

103 Scopus citations

Abstract

At least 30% of human proteins are thought to contain intrinsically disordered regions, which lack stable structural conformation. Despite lacking enzymatic functions and having few protein domains, disordered regions are functionally important for protein regulation and contain short linear motifs (short peptide sequences involved in protein-protein interactions), but in most disordered regions, the functional amino acid residues remain unknown. We searched for evolutionarily conserved sequences within disordered regions according to the hypothesis that conservation would indicate functional residues. Using a phylogenetic hidden Markov model (phylo-HMM), we made accurate, specific predictions of functional elements in disordered regions even when these elements are only two or three amino acids long. Among the conserved sequences that we identified were previously known and newly identified short linear motifs, and we experimentally verified key examples, including a motif that may mediate interaction between protein kinase Cbk1 and its substrates.We also observed that hub proteins, which interact with many partners in a protein interaction network, are highly enriched in these conserved sequences. Our analysis enabled the systematic identification of the functional residues in disordered regions and suggested that at least 5% of amino acids in disordered regions are important for function.

Original languageEnglish (US)
Article numberrs1
JournalScience Signaling
Volume5
Issue number215
DOIs
StatePublished - Mar 13 2012

ASJC Scopus subject areas

  • Molecular Biology
  • Biochemistry
  • Cell Biology

Fingerprint

Dive into the research topics of 'Proteome-wide discovery of evolutionary conserved sequences in disordered regions'. Together they form a unique fingerprint.

Cite this