Cargando…

Predicting Functional Gene Links from Phylogenetic-Statistical Analyses of Whole Genomes

An important element of the developing field of proteomics is to understand protein-protein interactions and other functional links amongst genes. Across-species correlation methods for detecting functional links work on the premise that functionally linked proteins will tend to show a common patter...

Descripción completa

Detalles Bibliográficos
Autores principales: Barker, Daniel, Pagel, Mark
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1183509/
https://www.ncbi.nlm.nih.gov/pubmed/16103904
http://dx.doi.org/10.1371/journal.pcbi.0010003
_version_ 1782124701782376448
author Barker, Daniel
Pagel, Mark
author_facet Barker, Daniel
Pagel, Mark
author_sort Barker, Daniel
collection PubMed
description An important element of the developing field of proteomics is to understand protein-protein interactions and other functional links amongst genes. Across-species correlation methods for detecting functional links work on the premise that functionally linked proteins will tend to show a common pattern of presence and absence across a range of genomes. We describe a maximum likelihood statistical model for predicting functional gene linkages. The method detects independent instances of the correlated gain or loss of pairs of proteins on phylogenetic trees, reducing the high rates of false positives observed in conventional across-species methods that do not explicitly incorporate a phylogeny. We show, in a dataset of 10,551 protein pairs, that the phylogenetic method improves by up to 35% on across-species analyses at identifying known functionally linked proteins. The method shows that protein pairs with at least two to three correlated events of gain or loss are almost certainly functionally linked. Contingent evolution, in which one gene's presence or absence depends upon the presence of another, can also be detected phylogenetically, and may identify genes whose functional significance depends upon its interaction with other genes. Incorporating phylogenetic information improves the prediction of functional linkages. The improvement derives from having a lower rate of false positives and from detecting trends that across-species analyses miss. Phylogenetic methods can easily be incorporated into the screening of large-scale bioinformatics datasets to identify sets of protein links and to characterise gene networks.
format Text
id pubmed-1183509
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-11835092005-08-12 Predicting Functional Gene Links from Phylogenetic-Statistical Analyses of Whole Genomes Barker, Daniel Pagel, Mark PLoS Comput Biol Research Article An important element of the developing field of proteomics is to understand protein-protein interactions and other functional links amongst genes. Across-species correlation methods for detecting functional links work on the premise that functionally linked proteins will tend to show a common pattern of presence and absence across a range of genomes. We describe a maximum likelihood statistical model for predicting functional gene linkages. The method detects independent instances of the correlated gain or loss of pairs of proteins on phylogenetic trees, reducing the high rates of false positives observed in conventional across-species methods that do not explicitly incorporate a phylogeny. We show, in a dataset of 10,551 protein pairs, that the phylogenetic method improves by up to 35% on across-species analyses at identifying known functionally linked proteins. The method shows that protein pairs with at least two to three correlated events of gain or loss are almost certainly functionally linked. Contingent evolution, in which one gene's presence or absence depends upon the presence of another, can also be detected phylogenetically, and may identify genes whose functional significance depends upon its interaction with other genes. Incorporating phylogenetic information improves the prediction of functional linkages. The improvement derives from having a lower rate of false positives and from detecting trends that across-species analyses miss. Phylogenetic methods can easily be incorporated into the screening of large-scale bioinformatics datasets to identify sets of protein links and to characterise gene networks. Public Library of Science 2005-06 2005-06-24 /pmc/articles/PMC1183509/ /pubmed/16103904 http://dx.doi.org/10.1371/journal.pcbi.0010003 Text en Copyright: © 2005 Barker and Pagel. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Barker, Daniel
Pagel, Mark
Predicting Functional Gene Links from Phylogenetic-Statistical Analyses of Whole Genomes
title Predicting Functional Gene Links from Phylogenetic-Statistical Analyses of Whole Genomes
title_full Predicting Functional Gene Links from Phylogenetic-Statistical Analyses of Whole Genomes
title_fullStr Predicting Functional Gene Links from Phylogenetic-Statistical Analyses of Whole Genomes
title_full_unstemmed Predicting Functional Gene Links from Phylogenetic-Statistical Analyses of Whole Genomes
title_short Predicting Functional Gene Links from Phylogenetic-Statistical Analyses of Whole Genomes
title_sort predicting functional gene links from phylogenetic-statistical analyses of whole genomes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1183509/
https://www.ncbi.nlm.nih.gov/pubmed/16103904
http://dx.doi.org/10.1371/journal.pcbi.0010003
work_keys_str_mv AT barkerdaniel predictingfunctionalgenelinksfromphylogeneticstatisticalanalysesofwholegenomes
AT pagelmark predictingfunctionalgenelinksfromphylogeneticstatisticalanalysesofwholegenomes