Cargando…

MIPhy: identify and quantify rapidly evolving members of large gene families

After transitioning to a new environment, species often exhibit rapid phenotypic innovation. One of the fastest mechanisms for this is duplication followed by specialization of existing genes. When this happens to a member of a gene family, it tends to leave a detectable phylogenetic signature of li...

Descripción completa

Detalles Bibliográficos
Autores principales: Curran, David M., Gilleard, John S., Wasmuth, James D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5983006/
https://www.ncbi.nlm.nih.gov/pubmed/29868279
http://dx.doi.org/10.7717/peerj.4873
_version_ 1783328356602740736
author Curran, David M.
Gilleard, John S.
Wasmuth, James D.
author_facet Curran, David M.
Gilleard, John S.
Wasmuth, James D.
author_sort Curran, David M.
collection PubMed
description After transitioning to a new environment, species often exhibit rapid phenotypic innovation. One of the fastest mechanisms for this is duplication followed by specialization of existing genes. When this happens to a member of a gene family, it tends to leave a detectable phylogenetic signature of lineage-specific expansions and contractions. These can be identified by analyzing the gene family across several species and identifying patterns of gene duplication and loss that do not correlate with the known relationships between those species. This signature, termed phylogenetic instability, has been previously linked to adaptations that change the way an organism samples and responds to its environment; conversely, low phylogenetic instability has been previously linked to proteins with endogenous functions. With the increase in genome-level data, there is a need to identify and quantify phylogenetic instability. Here, we present Minimizing Instability in Phylogenetics (MIPhy), a tool that solves this problem by quantifying the incongruence of a gene’s evolutionary history. The motivation behind MIPhy was to produce a tool to aid in interpreting phylogenetic trees. It can predict which members of a gene family are under adaptive evolution, working only from a gene tree and the relationship between the species under consideration. While it does not conduct any estimation of positive selection—which is the typical indication of adaptive evolution—the results tend to agree. We demonstrate the usefulness of MIPhy by accurately predicting which members of the mammalian cytochrome P450 gene superfamily metabolize xenobiotics and which metabolize endogenous compounds. Our predictions correlate very well with known substrate specificities of the human enzymes. We also analyze the Caenorhabditis collagen gene family and use MIPhy to predict genes that produce an observable phenotype when knocked down in C. elegans, and show that our predictions correlate well with existing knowledge. The software can be downloaded and installed from https://github.com/dave-the-scientist/miphy and is also available as an online web tool at http://www.miphy.wasmuthlab.org.
format Online
Article
Text
id pubmed-5983006
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-59830062018-06-04 MIPhy: identify and quantify rapidly evolving members of large gene families Curran, David M. Gilleard, John S. Wasmuth, James D. PeerJ Bioinformatics After transitioning to a new environment, species often exhibit rapid phenotypic innovation. One of the fastest mechanisms for this is duplication followed by specialization of existing genes. When this happens to a member of a gene family, it tends to leave a detectable phylogenetic signature of lineage-specific expansions and contractions. These can be identified by analyzing the gene family across several species and identifying patterns of gene duplication and loss that do not correlate with the known relationships between those species. This signature, termed phylogenetic instability, has been previously linked to adaptations that change the way an organism samples and responds to its environment; conversely, low phylogenetic instability has been previously linked to proteins with endogenous functions. With the increase in genome-level data, there is a need to identify and quantify phylogenetic instability. Here, we present Minimizing Instability in Phylogenetics (MIPhy), a tool that solves this problem by quantifying the incongruence of a gene’s evolutionary history. The motivation behind MIPhy was to produce a tool to aid in interpreting phylogenetic trees. It can predict which members of a gene family are under adaptive evolution, working only from a gene tree and the relationship between the species under consideration. While it does not conduct any estimation of positive selection—which is the typical indication of adaptive evolution—the results tend to agree. We demonstrate the usefulness of MIPhy by accurately predicting which members of the mammalian cytochrome P450 gene superfamily metabolize xenobiotics and which metabolize endogenous compounds. Our predictions correlate very well with known substrate specificities of the human enzymes. We also analyze the Caenorhabditis collagen gene family and use MIPhy to predict genes that produce an observable phenotype when knocked down in C. elegans, and show that our predictions correlate well with existing knowledge. The software can be downloaded and installed from https://github.com/dave-the-scientist/miphy and is also available as an online web tool at http://www.miphy.wasmuthlab.org. PeerJ Inc. 2018-05-29 /pmc/articles/PMC5983006/ /pubmed/29868279 http://dx.doi.org/10.7717/peerj.4873 Text en © 2018 Curran et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Curran, David M.
Gilleard, John S.
Wasmuth, James D.
MIPhy: identify and quantify rapidly evolving members of large gene families
title MIPhy: identify and quantify rapidly evolving members of large gene families
title_full MIPhy: identify and quantify rapidly evolving members of large gene families
title_fullStr MIPhy: identify and quantify rapidly evolving members of large gene families
title_full_unstemmed MIPhy: identify and quantify rapidly evolving members of large gene families
title_short MIPhy: identify and quantify rapidly evolving members of large gene families
title_sort miphy: identify and quantify rapidly evolving members of large gene families
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5983006/
https://www.ncbi.nlm.nih.gov/pubmed/29868279
http://dx.doi.org/10.7717/peerj.4873
work_keys_str_mv AT currandavidm miphyidentifyandquantifyrapidlyevolvingmembersoflargegenefamilies
AT gilleardjohns miphyidentifyandquantifyrapidlyevolvingmembersoflargegenefamilies
AT wasmuthjamesd miphyidentifyandquantifyrapidlyevolvingmembersoflargegenefamilies