Cargando…
Scoring Protein Relationships in Functional Interaction Networks Predicted from Sequence Data
The abundance of diverse biological data from various sources constitutes a rich source of knowledge, which has the power to advance our understanding of organisms. This requires computational methods in order to integrate and exploit these data effectively and elucidate local and genome wide functi...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3079720/ https://www.ncbi.nlm.nih.gov/pubmed/21526183 http://dx.doi.org/10.1371/journal.pone.0018607 |
_version_ | 1782202053777424384 |
---|---|
author | Mazandu, Gaston K. Mulder, Nicola J. |
author_facet | Mazandu, Gaston K. Mulder, Nicola J. |
author_sort | Mazandu, Gaston K. |
collection | PubMed |
description | The abundance of diverse biological data from various sources constitutes a rich source of knowledge, which has the power to advance our understanding of organisms. This requires computational methods in order to integrate and exploit these data effectively and elucidate local and genome wide functional connections between protein pairs, thus enabling functional inferences for uncharacterized proteins. These biological data are primarily in the form of sequences, which determine functions, although functional properties of a protein can often be predicted from just the domains it contains. Thus, protein sequences and domains can be used to predict protein pair-wise functional relationships, and thus contribute to the function prediction process of uncharacterized proteins in order to ensure that knowledge is gained from sequencing efforts. In this work, we introduce information-theoretic based approaches to score protein-protein functional interaction pairs predicted from protein sequence similarity and conserved protein signature matches. The proposed schemes are effective for data-driven scoring of connections between protein pairs. We applied these schemes to the Mycobacterium tuberculosis proteome to produce a homology-based functional network of the organism with a high confidence and coverage. We use the network for predicting functions of uncharacterised proteins. AVAILABILITY: Protein pair-wise functional relationship scores for Mycobacterium tuberculosis strain CDC1551 sequence data and python scripts to compute these scores are available at http://web.cbio.uct.ac.za/~gmazandu/scoringschemes. |
format | Text |
id | pubmed-3079720 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-30797202011-04-27 Scoring Protein Relationships in Functional Interaction Networks Predicted from Sequence Data Mazandu, Gaston K. Mulder, Nicola J. PLoS One Research Article The abundance of diverse biological data from various sources constitutes a rich source of knowledge, which has the power to advance our understanding of organisms. This requires computational methods in order to integrate and exploit these data effectively and elucidate local and genome wide functional connections between protein pairs, thus enabling functional inferences for uncharacterized proteins. These biological data are primarily in the form of sequences, which determine functions, although functional properties of a protein can often be predicted from just the domains it contains. Thus, protein sequences and domains can be used to predict protein pair-wise functional relationships, and thus contribute to the function prediction process of uncharacterized proteins in order to ensure that knowledge is gained from sequencing efforts. In this work, we introduce information-theoretic based approaches to score protein-protein functional interaction pairs predicted from protein sequence similarity and conserved protein signature matches. The proposed schemes are effective for data-driven scoring of connections between protein pairs. We applied these schemes to the Mycobacterium tuberculosis proteome to produce a homology-based functional network of the organism with a high confidence and coverage. We use the network for predicting functions of uncharacterised proteins. AVAILABILITY: Protein pair-wise functional relationship scores for Mycobacterium tuberculosis strain CDC1551 sequence data and python scripts to compute these scores are available at http://web.cbio.uct.ac.za/~gmazandu/scoringschemes. Public Library of Science 2011-04-19 /pmc/articles/PMC3079720/ /pubmed/21526183 http://dx.doi.org/10.1371/journal.pone.0018607 Text en Mazandu, Mulder. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Mazandu, Gaston K. Mulder, Nicola J. Scoring Protein Relationships in Functional Interaction Networks Predicted from Sequence Data |
title | Scoring Protein Relationships in Functional Interaction Networks Predicted from Sequence Data |
title_full | Scoring Protein Relationships in Functional Interaction Networks Predicted from Sequence Data |
title_fullStr | Scoring Protein Relationships in Functional Interaction Networks Predicted from Sequence Data |
title_full_unstemmed | Scoring Protein Relationships in Functional Interaction Networks Predicted from Sequence Data |
title_short | Scoring Protein Relationships in Functional Interaction Networks Predicted from Sequence Data |
title_sort | scoring protein relationships in functional interaction networks predicted from sequence data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3079720/ https://www.ncbi.nlm.nih.gov/pubmed/21526183 http://dx.doi.org/10.1371/journal.pone.0018607 |
work_keys_str_mv | AT mazandugastonk scoringproteinrelationshipsinfunctionalinteractionnetworkspredictedfromsequencedata AT muldernicolaj scoringproteinrelationshipsinfunctionalinteractionnetworkspredictedfromsequencedata |