Cargando…

High‐throughput methods for efficiently building massive phylogenies from natural history collections

PREMISE: Large phylogenetic data sets have often been restricted to small numbers of loci from GenBank, and a vetted sampling‐to‐sequencing phylogenomic protocol scaling to thousands of species is not yet available. Here, we report a high‐throughput collections‐based approach that empowers researche...

Descripción completa

Detalles Bibliográficos
Autores principales: Folk, Ryan A., Kates, Heather R., LaFrance, Raphael, Soltis, Douglas E., Soltis, Pamela S., Guralnick, Robert P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7910806/
https://www.ncbi.nlm.nih.gov/pubmed/33680581
http://dx.doi.org/10.1002/aps3.11410
_version_ 1783656198430523392
author Folk, Ryan A.
Kates, Heather R.
LaFrance, Raphael
Soltis, Douglas E.
Soltis, Pamela S.
Guralnick, Robert P.
author_facet Folk, Ryan A.
Kates, Heather R.
LaFrance, Raphael
Soltis, Douglas E.
Soltis, Pamela S.
Guralnick, Robert P.
author_sort Folk, Ryan A.
collection PubMed
description PREMISE: Large phylogenetic data sets have often been restricted to small numbers of loci from GenBank, and a vetted sampling‐to‐sequencing phylogenomic protocol scaling to thousands of species is not yet available. Here, we report a high‐throughput collections‐based approach that empowers researchers to explore more branches of the tree of life with numerous loci. METHODS: We developed an integrated Specimen‐to‐Laboratory Information Management System (SLIMS), connecting sampling and wet lab efforts with progress tracking at each stage. Using unique identifiers encoded in QR codes and a taxonomic database, a research team can sample herbarium specimens, efficiently record the sampling event, and capture specimen images. After sampling in herbaria, images are uploaded to a citizen science platform for metadata generation, and tissue samples are moved through a simple, high‐throughput, plate‐based herbarium DNA extraction and sequencing protocol. RESULTS: We applied this sampling‐to‐sequencing workflow to ~15,000 species, producing for the first time a data set with ~50% taxonomic representation of the “nitrogen‐fixing clade” of angiosperms. DISCUSSION: The approach we present is appropriate at any taxonomic scale and is extensible to other collection types. The widespread use of large‐scale sampling strategies repositions herbaria as accessible but largely untapped resources for broad taxonomic sampling with thousands of species.
format Online
Article
Text
id pubmed-7910806
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-79108062021-03-05 High‐throughput methods for efficiently building massive phylogenies from natural history collections Folk, Ryan A. Kates, Heather R. LaFrance, Raphael Soltis, Douglas E. Soltis, Pamela S. Guralnick, Robert P. Appl Plant Sci Application Article PREMISE: Large phylogenetic data sets have often been restricted to small numbers of loci from GenBank, and a vetted sampling‐to‐sequencing phylogenomic protocol scaling to thousands of species is not yet available. Here, we report a high‐throughput collections‐based approach that empowers researchers to explore more branches of the tree of life with numerous loci. METHODS: We developed an integrated Specimen‐to‐Laboratory Information Management System (SLIMS), connecting sampling and wet lab efforts with progress tracking at each stage. Using unique identifiers encoded in QR codes and a taxonomic database, a research team can sample herbarium specimens, efficiently record the sampling event, and capture specimen images. After sampling in herbaria, images are uploaded to a citizen science platform for metadata generation, and tissue samples are moved through a simple, high‐throughput, plate‐based herbarium DNA extraction and sequencing protocol. RESULTS: We applied this sampling‐to‐sequencing workflow to ~15,000 species, producing for the first time a data set with ~50% taxonomic representation of the “nitrogen‐fixing clade” of angiosperms. DISCUSSION: The approach we present is appropriate at any taxonomic scale and is extensible to other collection types. The widespread use of large‐scale sampling strategies repositions herbaria as accessible but largely untapped resources for broad taxonomic sampling with thousands of species. John Wiley and Sons Inc. 2021-02-27 /pmc/articles/PMC7910806/ /pubmed/33680581 http://dx.doi.org/10.1002/aps3.11410 Text en © 2021 Folk et al. Applications in Plant Sciences is published by Wiley Periodicals LLC on behalf of the Botanical Society of America This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Application Article
Folk, Ryan A.
Kates, Heather R.
LaFrance, Raphael
Soltis, Douglas E.
Soltis, Pamela S.
Guralnick, Robert P.
High‐throughput methods for efficiently building massive phylogenies from natural history collections
title High‐throughput methods for efficiently building massive phylogenies from natural history collections
title_full High‐throughput methods for efficiently building massive phylogenies from natural history collections
title_fullStr High‐throughput methods for efficiently building massive phylogenies from natural history collections
title_full_unstemmed High‐throughput methods for efficiently building massive phylogenies from natural history collections
title_short High‐throughput methods for efficiently building massive phylogenies from natural history collections
title_sort high‐throughput methods for efficiently building massive phylogenies from natural history collections
topic Application Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7910806/
https://www.ncbi.nlm.nih.gov/pubmed/33680581
http://dx.doi.org/10.1002/aps3.11410
work_keys_str_mv AT folkryana highthroughputmethodsforefficientlybuildingmassivephylogeniesfromnaturalhistorycollections
AT katesheatherr highthroughputmethodsforefficientlybuildingmassivephylogeniesfromnaturalhistorycollections
AT lafranceraphael highthroughputmethodsforefficientlybuildingmassivephylogeniesfromnaturalhistorycollections
AT soltisdouglase highthroughputmethodsforefficientlybuildingmassivephylogeniesfromnaturalhistorycollections
AT soltispamelas highthroughputmethodsforefficientlybuildingmassivephylogeniesfromnaturalhistorycollections
AT guralnickrobertp highthroughputmethodsforefficientlybuildingmassivephylogeniesfromnaturalhistorycollections