Quantitative Biology > Quantitative Methods
[Submitted on 12 Mar 2015 (this version), latest version 19 Jan 2016 (v2)]
Title:Identifying relevant positions in proteins by Critical Variable Selection
View PDFAbstract:Evolution, in its course, has found many different solutions to the same problem. Focusing on proteins, the amino acid sequence of protein domains performing the same function in different organisms differs markedly. Since the structure and function of proteins are ultimately encoded in the amino acid sequence, multiple sequence alignments (MSA) of homologous protein domains can be used to provide information about this encoding. Regarding each sequence in an MSA as a solution of an optimisation problem, we exploit the MSA to infer the relevance of different positions, thereby identifying a hierarchy of relevant sites. Our method exploits information on coevolution going beyond pairwise correlations. This method, called Critical Variable Selection (CVS), affords predictions that are significantly different from those of methods based on pairwise correlations, and it recovers biologically relevant sites, including highly conserved ones. As compared to other methods based on pairwise correlations, we find, in the analysed cases, that CVS is more efficient in identifying the core of relevant sites, as well as most of the tightest contacts in the protein tertiary structure.
Submission history
From: Silvia Grigolon [view email][v1] Thu, 12 Mar 2015 17:07:19 UTC (1,765 KB)
[v2] Tue, 19 Jan 2016 21:26:33 UTC (3,363 KB)
Current browse context:
q-bio.QM
Change to browse by:
References & Citations
Bibliographic and Citation Tools
Bibliographic Explorer (What is the Explorer?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Code, Data and Media Associated with this Article
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)
Demos
Recommenders and Search Tools
Influence Flower (What are Influence Flowers?)
Connected Papers (What is Connected Papers?)
CORE Recommender (What is CORE?)
arXivLabs: experimental projects with community collaborators
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.