Cargando…

Redefining Possible: Combining Phylogenomic and Supersparse Data in Frogs

The data available for reconstructing molecular phylogenies have become wildly disparate. Phylogenomic studies can generate data for thousands of genetic markers for dozens of species, but for hundreds of other taxa, data may be available from only a few genes. Can these two types of data be integra...

Descripción completa

Detalles Bibliográficos
Autores principales: Portik, Daniel M, Streicher, Jeffrey W, Blackburn, David C, Moen, Daniel S, Hutter, Carl R, Wiens, John J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10202597/
https://www.ncbi.nlm.nih.gov/pubmed/37140129
http://dx.doi.org/10.1093/molbev/msad109
_version_ 1785045458035408896
author Portik, Daniel M
Streicher, Jeffrey W
Blackburn, David C
Moen, Daniel S
Hutter, Carl R
Wiens, John J
author_facet Portik, Daniel M
Streicher, Jeffrey W
Blackburn, David C
Moen, Daniel S
Hutter, Carl R
Wiens, John J
author_sort Portik, Daniel M
collection PubMed
description The data available for reconstructing molecular phylogenies have become wildly disparate. Phylogenomic studies can generate data for thousands of genetic markers for dozens of species, but for hundreds of other taxa, data may be available from only a few genes. Can these two types of data be integrated to combine the advantages of both, addressing the relationships of hundreds of species with thousands of genes? Here, we show that this is possible, using data from frogs. We generated a phylogenomic data set for 138 ingroup species and 3,784 nuclear markers (ultraconserved elements [UCEs]), including new UCE data from 70 species. We also assembled a supermatrix data set, including data from 97% of frog genera (441 total), with 1–307 genes per taxon. We then produced a combined phylogenomic–supermatrix data set (a “gigamatrix”) containing 441 ingroup taxa and 4,091 markers but with 86% missing data overall. Likelihood analysis of the gigamatrix yielded a generally well-supported tree among families, largely consistent with trees from the phylogenomic data alone. All terminal taxa were placed in the expected families, even though 42.5% of these taxa each had >99.5% missing data and 70.2% had >90% missing data. Our results show that missing data need not be an impediment to successfully combining very large phylogenomic and supermatrix data sets, and they open the door to new studies that simultaneously maximize sampling of genes and taxa.
format Online
Article
Text
id pubmed-10202597
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-102025972023-05-23 Redefining Possible: Combining Phylogenomic and Supersparse Data in Frogs Portik, Daniel M Streicher, Jeffrey W Blackburn, David C Moen, Daniel S Hutter, Carl R Wiens, John J Mol Biol Evol Discoveries The data available for reconstructing molecular phylogenies have become wildly disparate. Phylogenomic studies can generate data for thousands of genetic markers for dozens of species, but for hundreds of other taxa, data may be available from only a few genes. Can these two types of data be integrated to combine the advantages of both, addressing the relationships of hundreds of species with thousands of genes? Here, we show that this is possible, using data from frogs. We generated a phylogenomic data set for 138 ingroup species and 3,784 nuclear markers (ultraconserved elements [UCEs]), including new UCE data from 70 species. We also assembled a supermatrix data set, including data from 97% of frog genera (441 total), with 1–307 genes per taxon. We then produced a combined phylogenomic–supermatrix data set (a “gigamatrix”) containing 441 ingroup taxa and 4,091 markers but with 86% missing data overall. Likelihood analysis of the gigamatrix yielded a generally well-supported tree among families, largely consistent with trees from the phylogenomic data alone. All terminal taxa were placed in the expected families, even though 42.5% of these taxa each had >99.5% missing data and 70.2% had >90% missing data. Our results show that missing data need not be an impediment to successfully combining very large phylogenomic and supermatrix data sets, and they open the door to new studies that simultaneously maximize sampling of genes and taxa. Oxford University Press 2023-05-04 /pmc/articles/PMC10202597/ /pubmed/37140129 http://dx.doi.org/10.1093/molbev/msad109 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Discoveries
Portik, Daniel M
Streicher, Jeffrey W
Blackburn, David C
Moen, Daniel S
Hutter, Carl R
Wiens, John J
Redefining Possible: Combining Phylogenomic and Supersparse Data in Frogs
title Redefining Possible: Combining Phylogenomic and Supersparse Data in Frogs
title_full Redefining Possible: Combining Phylogenomic and Supersparse Data in Frogs
title_fullStr Redefining Possible: Combining Phylogenomic and Supersparse Data in Frogs
title_full_unstemmed Redefining Possible: Combining Phylogenomic and Supersparse Data in Frogs
title_short Redefining Possible: Combining Phylogenomic and Supersparse Data in Frogs
title_sort redefining possible: combining phylogenomic and supersparse data in frogs
topic Discoveries
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10202597/
https://www.ncbi.nlm.nih.gov/pubmed/37140129
http://dx.doi.org/10.1093/molbev/msad109
work_keys_str_mv AT portikdanielm redefiningpossiblecombiningphylogenomicandsupersparsedatainfrogs
AT streicherjeffreyw redefiningpossiblecombiningphylogenomicandsupersparsedatainfrogs
AT blackburndavidc redefiningpossiblecombiningphylogenomicandsupersparsedatainfrogs
AT moendaniels redefiningpossiblecombiningphylogenomicandsupersparsedatainfrogs
AT huttercarlr redefiningpossiblecombiningphylogenomicandsupersparsedatainfrogs
AT wiensjohnj redefiningpossiblecombiningphylogenomicandsupersparsedatainfrogs