Cargando…

CoeViz 2: Protein Graphs Derived From Amino Acid Covariance

Proteins by and large carry out their molecular functions in a folded state when residues, distant in sequence, assemble together in 3D space to bind a ligand, catalyze a reaction, form a channel, or exert another concerted macromolecular interaction. It has been long recognized that covariance of a...

Descripción completa

Detalles Bibliográficos
Autores principales: Corcoran, Daniel, Maltbie, Nicholas, Sudalairaj, Shivchander, Baker, Frazier N., Hirschfeld, Joseph, Porollo, Aleksey
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9187035/
https://www.ncbi.nlm.nih.gov/pubmed/35694032
http://dx.doi.org/10.3389/fbinf.2021.653681
_version_ 1784725079876173824
author Corcoran, Daniel
Maltbie, Nicholas
Sudalairaj, Shivchander
Baker, Frazier N.
Hirschfeld, Joseph
Porollo, Aleksey
author_facet Corcoran, Daniel
Maltbie, Nicholas
Sudalairaj, Shivchander
Baker, Frazier N.
Hirschfeld, Joseph
Porollo, Aleksey
author_sort Corcoran, Daniel
collection PubMed
description Proteins by and large carry out their molecular functions in a folded state when residues, distant in sequence, assemble together in 3D space to bind a ligand, catalyze a reaction, form a channel, or exert another concerted macromolecular interaction. It has been long recognized that covariance of amino acids between distant positions within a protein sequence allows for the inference of long range contacts to facilitate 3D structure modeling. In this work, we investigated whether covariance analysis may reveal residues involved in the same molecular function. Building upon our previous work, CoeViz, we have conducted a large scale covariance analysis among 7,595 non-redundant proteins with resolved 3D structures to assess 1) whether the residues with the same function coevolve, 2) which covariance metric captures such couplings better, and 3) how different molecular functions compare in this context. We found that the chi-squared metric is the most informative for the identification of coevolving functional sites, followed by the Pearson correlation-based, whereas mutual information is the least informative. Of the seven categories of the most common natural ligands, including coenzyme A, dinucleotide, DNA/RNA, heme, metal, nucleoside, and sugar, the trace metal binding residues display the most prominent coupling, followed by the sugar binding sites. We also developed a web-based tool, CoeViz 2, that enables the interactive visualization of covarying residues as cliques from a larger protein graph. CoeViz 2 is publicly available at https://research.cchmc.org/CoevLab/.
format Online
Article
Text
id pubmed-9187035
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-91870352022-06-10 CoeViz 2: Protein Graphs Derived From Amino Acid Covariance Corcoran, Daniel Maltbie, Nicholas Sudalairaj, Shivchander Baker, Frazier N. Hirschfeld, Joseph Porollo, Aleksey Front Bioinform Bioinformatics Proteins by and large carry out their molecular functions in a folded state when residues, distant in sequence, assemble together in 3D space to bind a ligand, catalyze a reaction, form a channel, or exert another concerted macromolecular interaction. It has been long recognized that covariance of amino acids between distant positions within a protein sequence allows for the inference of long range contacts to facilitate 3D structure modeling. In this work, we investigated whether covariance analysis may reveal residues involved in the same molecular function. Building upon our previous work, CoeViz, we have conducted a large scale covariance analysis among 7,595 non-redundant proteins with resolved 3D structures to assess 1) whether the residues with the same function coevolve, 2) which covariance metric captures such couplings better, and 3) how different molecular functions compare in this context. We found that the chi-squared metric is the most informative for the identification of coevolving functional sites, followed by the Pearson correlation-based, whereas mutual information is the least informative. Of the seven categories of the most common natural ligands, including coenzyme A, dinucleotide, DNA/RNA, heme, metal, nucleoside, and sugar, the trace metal binding residues display the most prominent coupling, followed by the sugar binding sites. We also developed a web-based tool, CoeViz 2, that enables the interactive visualization of covarying residues as cliques from a larger protein graph. CoeViz 2 is publicly available at https://research.cchmc.org/CoevLab/. Frontiers Media S.A. 2021-06-24 /pmc/articles/PMC9187035/ /pubmed/35694032 http://dx.doi.org/10.3389/fbinf.2021.653681 Text en Copyright © 2021 Corcoran, Maltbie, Sudalairaj, Baker, Hirschfeld and Porollo. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Bioinformatics
Corcoran, Daniel
Maltbie, Nicholas
Sudalairaj, Shivchander
Baker, Frazier N.
Hirschfeld, Joseph
Porollo, Aleksey
CoeViz 2: Protein Graphs Derived From Amino Acid Covariance
title CoeViz 2: Protein Graphs Derived From Amino Acid Covariance
title_full CoeViz 2: Protein Graphs Derived From Amino Acid Covariance
title_fullStr CoeViz 2: Protein Graphs Derived From Amino Acid Covariance
title_full_unstemmed CoeViz 2: Protein Graphs Derived From Amino Acid Covariance
title_short CoeViz 2: Protein Graphs Derived From Amino Acid Covariance
title_sort coeviz 2: protein graphs derived from amino acid covariance
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9187035/
https://www.ncbi.nlm.nih.gov/pubmed/35694032
http://dx.doi.org/10.3389/fbinf.2021.653681
work_keys_str_mv AT corcorandaniel coeviz2proteingraphsderivedfromaminoacidcovariance
AT maltbienicholas coeviz2proteingraphsderivedfromaminoacidcovariance
AT sudalairajshivchander coeviz2proteingraphsderivedfromaminoacidcovariance
AT bakerfraziern coeviz2proteingraphsderivedfromaminoacidcovariance
AT hirschfeldjoseph coeviz2proteingraphsderivedfromaminoacidcovariance
AT porolloaleksey coeviz2proteingraphsderivedfromaminoacidcovariance