Cargando…

Scoring Protein Relationships in Functional Interaction Networks Predicted from Sequence Data

The abundance of diverse biological data from various sources constitutes a rich source of knowledge, which has the power to advance our understanding of organisms. This requires computational methods in order to integrate and exploit these data effectively and elucidate local and genome wide functi...

Descripción completa

Detalles Bibliográficos
Autores principales: Mazandu, Gaston K., Mulder, Nicola J.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3079720/
https://www.ncbi.nlm.nih.gov/pubmed/21526183
http://dx.doi.org/10.1371/journal.pone.0018607
_version_ 1782202053777424384
author Mazandu, Gaston K.
Mulder, Nicola J.
author_facet Mazandu, Gaston K.
Mulder, Nicola J.
author_sort Mazandu, Gaston K.
collection PubMed
description The abundance of diverse biological data from various sources constitutes a rich source of knowledge, which has the power to advance our understanding of organisms. This requires computational methods in order to integrate and exploit these data effectively and elucidate local and genome wide functional connections between protein pairs, thus enabling functional inferences for uncharacterized proteins. These biological data are primarily in the form of sequences, which determine functions, although functional properties of a protein can often be predicted from just the domains it contains. Thus, protein sequences and domains can be used to predict protein pair-wise functional relationships, and thus contribute to the function prediction process of uncharacterized proteins in order to ensure that knowledge is gained from sequencing efforts. In this work, we introduce information-theoretic based approaches to score protein-protein functional interaction pairs predicted from protein sequence similarity and conserved protein signature matches. The proposed schemes are effective for data-driven scoring of connections between protein pairs. We applied these schemes to the Mycobacterium tuberculosis proteome to produce a homology-based functional network of the organism with a high confidence and coverage. We use the network for predicting functions of uncharacterised proteins. AVAILABILITY: Protein pair-wise functional relationship scores for Mycobacterium tuberculosis strain CDC1551 sequence data and python scripts to compute these scores are available at http://web.cbio.uct.ac.za/~gmazandu/scoringschemes.
format Text
id pubmed-3079720
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-30797202011-04-27 Scoring Protein Relationships in Functional Interaction Networks Predicted from Sequence Data Mazandu, Gaston K. Mulder, Nicola J. PLoS One Research Article The abundance of diverse biological data from various sources constitutes a rich source of knowledge, which has the power to advance our understanding of organisms. This requires computational methods in order to integrate and exploit these data effectively and elucidate local and genome wide functional connections between protein pairs, thus enabling functional inferences for uncharacterized proteins. These biological data are primarily in the form of sequences, which determine functions, although functional properties of a protein can often be predicted from just the domains it contains. Thus, protein sequences and domains can be used to predict protein pair-wise functional relationships, and thus contribute to the function prediction process of uncharacterized proteins in order to ensure that knowledge is gained from sequencing efforts. In this work, we introduce information-theoretic based approaches to score protein-protein functional interaction pairs predicted from protein sequence similarity and conserved protein signature matches. The proposed schemes are effective for data-driven scoring of connections between protein pairs. We applied these schemes to the Mycobacterium tuberculosis proteome to produce a homology-based functional network of the organism with a high confidence and coverage. We use the network for predicting functions of uncharacterised proteins. AVAILABILITY: Protein pair-wise functional relationship scores for Mycobacterium tuberculosis strain CDC1551 sequence data and python scripts to compute these scores are available at http://web.cbio.uct.ac.za/~gmazandu/scoringschemes. Public Library of Science 2011-04-19 /pmc/articles/PMC3079720/ /pubmed/21526183 http://dx.doi.org/10.1371/journal.pone.0018607 Text en Mazandu, Mulder. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Mazandu, Gaston K.
Mulder, Nicola J.
Scoring Protein Relationships in Functional Interaction Networks Predicted from Sequence Data
title Scoring Protein Relationships in Functional Interaction Networks Predicted from Sequence Data
title_full Scoring Protein Relationships in Functional Interaction Networks Predicted from Sequence Data
title_fullStr Scoring Protein Relationships in Functional Interaction Networks Predicted from Sequence Data
title_full_unstemmed Scoring Protein Relationships in Functional Interaction Networks Predicted from Sequence Data
title_short Scoring Protein Relationships in Functional Interaction Networks Predicted from Sequence Data
title_sort scoring protein relationships in functional interaction networks predicted from sequence data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3079720/
https://www.ncbi.nlm.nih.gov/pubmed/21526183
http://dx.doi.org/10.1371/journal.pone.0018607
work_keys_str_mv AT mazandugastonk scoringproteinrelationshipsinfunctionalinteractionnetworkspredictedfromsequencedata
AT muldernicolaj scoringproteinrelationshipsinfunctionalinteractionnetworkspredictedfromsequencedata