Cargando…

Co-Inheritance Analysis within the Domains of Life Substantially Improves Network Inference by Phylogenetic Profiling

Phylogenetic profiling, a network inference method based on gene inheritance profiles, has been widely used to construct functional gene networks in microbes. However, its utility for network inference in higher eukaryotes has been limited. An improved algorithm with an in-depth understanding of pat...

Descripción completa

Detalles Bibliográficos
Autores principales: Shin, Junha, Lee, Insuk
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4578931/
https://www.ncbi.nlm.nih.gov/pubmed/26394049
http://dx.doi.org/10.1371/journal.pone.0139006
_version_ 1782391189402550272
author Shin, Junha
Lee, Insuk
author_facet Shin, Junha
Lee, Insuk
author_sort Shin, Junha
collection PubMed
description Phylogenetic profiling, a network inference method based on gene inheritance profiles, has been widely used to construct functional gene networks in microbes. However, its utility for network inference in higher eukaryotes has been limited. An improved algorithm with an in-depth understanding of pathway evolution may overcome this limitation. In this study, we investigated the effects of taxonomic structures on co-inheritance analysis using 2,144 reference species in four query species: Escherichia coli, Saccharomyces cerevisiae, Arabidopsis thaliana, and Homo sapiens. We observed three clusters of reference species based on a principal component analysis of the phylogenetic profiles, which correspond to the three domains of life—Archaea, Bacteria, and Eukaryota—suggesting that pathways inherit primarily within specific domains or lower-ranked taxonomic groups during speciation. Hence, the co-inheritance pattern within a taxonomic group may be eroded by confounding inheritance patterns from irrelevant taxonomic groups. We demonstrated that co-inheritance analysis within domains substantially improved network inference not only in microbe species but also in the higher eukaryotes, including humans. Although we observed two sub-domain clusters of reference species within Eukaryota, co-inheritance analysis within these sub-domain taxonomic groups only marginally improved network inference. Therefore, we conclude that co-inheritance analysis within domains is the optimal approach to network inference with the given reference species. The construction of a series of human gene networks with increasing sample sizes of the reference species for each domain revealed that the size of the high-accuracy networks increased as additional reference species genomes were included, suggesting that within-domain co-inheritance analysis will continue to expand human gene networks as genomes of additional species are sequenced. Taken together, we propose that co-inheritance analysis within the domains of life will greatly potentiate the use of the expected onslaught of sequenced genomes in the study of molecular pathways in higher eukaryotes.
format Online
Article
Text
id pubmed-4578931
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-45789312015-10-01 Co-Inheritance Analysis within the Domains of Life Substantially Improves Network Inference by Phylogenetic Profiling Shin, Junha Lee, Insuk PLoS One Research Article Phylogenetic profiling, a network inference method based on gene inheritance profiles, has been widely used to construct functional gene networks in microbes. However, its utility for network inference in higher eukaryotes has been limited. An improved algorithm with an in-depth understanding of pathway evolution may overcome this limitation. In this study, we investigated the effects of taxonomic structures on co-inheritance analysis using 2,144 reference species in four query species: Escherichia coli, Saccharomyces cerevisiae, Arabidopsis thaliana, and Homo sapiens. We observed three clusters of reference species based on a principal component analysis of the phylogenetic profiles, which correspond to the three domains of life—Archaea, Bacteria, and Eukaryota—suggesting that pathways inherit primarily within specific domains or lower-ranked taxonomic groups during speciation. Hence, the co-inheritance pattern within a taxonomic group may be eroded by confounding inheritance patterns from irrelevant taxonomic groups. We demonstrated that co-inheritance analysis within domains substantially improved network inference not only in microbe species but also in the higher eukaryotes, including humans. Although we observed two sub-domain clusters of reference species within Eukaryota, co-inheritance analysis within these sub-domain taxonomic groups only marginally improved network inference. Therefore, we conclude that co-inheritance analysis within domains is the optimal approach to network inference with the given reference species. The construction of a series of human gene networks with increasing sample sizes of the reference species for each domain revealed that the size of the high-accuracy networks increased as additional reference species genomes were included, suggesting that within-domain co-inheritance analysis will continue to expand human gene networks as genomes of additional species are sequenced. Taken together, we propose that co-inheritance analysis within the domains of life will greatly potentiate the use of the expected onslaught of sequenced genomes in the study of molecular pathways in higher eukaryotes. Public Library of Science 2015-09-22 /pmc/articles/PMC4578931/ /pubmed/26394049 http://dx.doi.org/10.1371/journal.pone.0139006 Text en © 2015 Shin, Lee http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Shin, Junha
Lee, Insuk
Co-Inheritance Analysis within the Domains of Life Substantially Improves Network Inference by Phylogenetic Profiling
title Co-Inheritance Analysis within the Domains of Life Substantially Improves Network Inference by Phylogenetic Profiling
title_full Co-Inheritance Analysis within the Domains of Life Substantially Improves Network Inference by Phylogenetic Profiling
title_fullStr Co-Inheritance Analysis within the Domains of Life Substantially Improves Network Inference by Phylogenetic Profiling
title_full_unstemmed Co-Inheritance Analysis within the Domains of Life Substantially Improves Network Inference by Phylogenetic Profiling
title_short Co-Inheritance Analysis within the Domains of Life Substantially Improves Network Inference by Phylogenetic Profiling
title_sort co-inheritance analysis within the domains of life substantially improves network inference by phylogenetic profiling
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4578931/
https://www.ncbi.nlm.nih.gov/pubmed/26394049
http://dx.doi.org/10.1371/journal.pone.0139006
work_keys_str_mv AT shinjunha coinheritanceanalysiswithinthedomainsoflifesubstantiallyimprovesnetworkinferencebyphylogeneticprofiling
AT leeinsuk coinheritanceanalysiswithinthedomainsoflifesubstantiallyimprovesnetworkinferencebyphylogeneticprofiling