Cargando…
High‐throughput methods for efficiently building massive phylogenies from natural history collections
PREMISE: Large phylogenetic data sets have often been restricted to small numbers of loci from GenBank, and a vetted sampling‐to‐sequencing phylogenomic protocol scaling to thousands of species is not yet available. Here, we report a high‐throughput collections‐based approach that empowers researche...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7910806/ https://www.ncbi.nlm.nih.gov/pubmed/33680581 http://dx.doi.org/10.1002/aps3.11410 |
_version_ | 1783656198430523392 |
---|---|
author | Folk, Ryan A. Kates, Heather R. LaFrance, Raphael Soltis, Douglas E. Soltis, Pamela S. Guralnick, Robert P. |
author_facet | Folk, Ryan A. Kates, Heather R. LaFrance, Raphael Soltis, Douglas E. Soltis, Pamela S. Guralnick, Robert P. |
author_sort | Folk, Ryan A. |
collection | PubMed |
description | PREMISE: Large phylogenetic data sets have often been restricted to small numbers of loci from GenBank, and a vetted sampling‐to‐sequencing phylogenomic protocol scaling to thousands of species is not yet available. Here, we report a high‐throughput collections‐based approach that empowers researchers to explore more branches of the tree of life with numerous loci. METHODS: We developed an integrated Specimen‐to‐Laboratory Information Management System (SLIMS), connecting sampling and wet lab efforts with progress tracking at each stage. Using unique identifiers encoded in QR codes and a taxonomic database, a research team can sample herbarium specimens, efficiently record the sampling event, and capture specimen images. After sampling in herbaria, images are uploaded to a citizen science platform for metadata generation, and tissue samples are moved through a simple, high‐throughput, plate‐based herbarium DNA extraction and sequencing protocol. RESULTS: We applied this sampling‐to‐sequencing workflow to ~15,000 species, producing for the first time a data set with ~50% taxonomic representation of the “nitrogen‐fixing clade” of angiosperms. DISCUSSION: The approach we present is appropriate at any taxonomic scale and is extensible to other collection types. The widespread use of large‐scale sampling strategies repositions herbaria as accessible but largely untapped resources for broad taxonomic sampling with thousands of species. |
format | Online Article Text |
id | pubmed-7910806 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-79108062021-03-05 High‐throughput methods for efficiently building massive phylogenies from natural history collections Folk, Ryan A. Kates, Heather R. LaFrance, Raphael Soltis, Douglas E. Soltis, Pamela S. Guralnick, Robert P. Appl Plant Sci Application Article PREMISE: Large phylogenetic data sets have often been restricted to small numbers of loci from GenBank, and a vetted sampling‐to‐sequencing phylogenomic protocol scaling to thousands of species is not yet available. Here, we report a high‐throughput collections‐based approach that empowers researchers to explore more branches of the tree of life with numerous loci. METHODS: We developed an integrated Specimen‐to‐Laboratory Information Management System (SLIMS), connecting sampling and wet lab efforts with progress tracking at each stage. Using unique identifiers encoded in QR codes and a taxonomic database, a research team can sample herbarium specimens, efficiently record the sampling event, and capture specimen images. After sampling in herbaria, images are uploaded to a citizen science platform for metadata generation, and tissue samples are moved through a simple, high‐throughput, plate‐based herbarium DNA extraction and sequencing protocol. RESULTS: We applied this sampling‐to‐sequencing workflow to ~15,000 species, producing for the first time a data set with ~50% taxonomic representation of the “nitrogen‐fixing clade” of angiosperms. DISCUSSION: The approach we present is appropriate at any taxonomic scale and is extensible to other collection types. The widespread use of large‐scale sampling strategies repositions herbaria as accessible but largely untapped resources for broad taxonomic sampling with thousands of species. John Wiley and Sons Inc. 2021-02-27 /pmc/articles/PMC7910806/ /pubmed/33680581 http://dx.doi.org/10.1002/aps3.11410 Text en © 2021 Folk et al. Applications in Plant Sciences is published by Wiley Periodicals LLC on behalf of the Botanical Society of America This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Application Article Folk, Ryan A. Kates, Heather R. LaFrance, Raphael Soltis, Douglas E. Soltis, Pamela S. Guralnick, Robert P. High‐throughput methods for efficiently building massive phylogenies from natural history collections |
title | High‐throughput methods for efficiently building massive phylogenies from natural history collections |
title_full | High‐throughput methods for efficiently building massive phylogenies from natural history collections |
title_fullStr | High‐throughput methods for efficiently building massive phylogenies from natural history collections |
title_full_unstemmed | High‐throughput methods for efficiently building massive phylogenies from natural history collections |
title_short | High‐throughput methods for efficiently building massive phylogenies from natural history collections |
title_sort | high‐throughput methods for efficiently building massive phylogenies from natural history collections |
topic | Application Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7910806/ https://www.ncbi.nlm.nih.gov/pubmed/33680581 http://dx.doi.org/10.1002/aps3.11410 |
work_keys_str_mv | AT folkryana highthroughputmethodsforefficientlybuildingmassivephylogeniesfromnaturalhistorycollections AT katesheatherr highthroughputmethodsforefficientlybuildingmassivephylogeniesfromnaturalhistorycollections AT lafranceraphael highthroughputmethodsforefficientlybuildingmassivephylogeniesfromnaturalhistorycollections AT soltisdouglase highthroughputmethodsforefficientlybuildingmassivephylogeniesfromnaturalhistorycollections AT soltispamelas highthroughputmethodsforefficientlybuildingmassivephylogeniesfromnaturalhistorycollections AT guralnickrobertp highthroughputmethodsforefficientlybuildingmassivephylogeniesfromnaturalhistorycollections |