Cargando…

Comparing whole‐genome shotgun sequencing and DNA metabarcoding approaches for species identification and quantification of pollen species mixtures

Molecular identification of mixed‐species pollen samples has a range of applications in various fields of research. To date, such molecular identification has primarily been carried out via amplicon sequencing, but whole‐genome shotgun (WGS) sequencing of pollen DNA has potential advantages, includi...

Descripción completa

Detalles Bibliográficos
Autores principales: Bell, Karen L., Petit, Robert A., Cutler, Anya, Dobbs, Emily K., Macpherson, J. Michael, Read, Timothy D., Burgess, Kevin S., Brosi, Berry J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8601920/
https://www.ncbi.nlm.nih.gov/pubmed/34824813
http://dx.doi.org/10.1002/ece3.8281
_version_ 1784601459784941568
author Bell, Karen L.
Petit, Robert A.
Cutler, Anya
Dobbs, Emily K.
Macpherson, J. Michael
Read, Timothy D.
Burgess, Kevin S.
Brosi, Berry J.
author_facet Bell, Karen L.
Petit, Robert A.
Cutler, Anya
Dobbs, Emily K.
Macpherson, J. Michael
Read, Timothy D.
Burgess, Kevin S.
Brosi, Berry J.
author_sort Bell, Karen L.
collection PubMed
description Molecular identification of mixed‐species pollen samples has a range of applications in various fields of research. To date, such molecular identification has primarily been carried out via amplicon sequencing, but whole‐genome shotgun (WGS) sequencing of pollen DNA has potential advantages, including (1) more genetic information per sample and (2) the potential for better quantitative matching. In this study, we tested the performance of WGS sequencing methodology and publicly available reference sequences in identifying species and quantifying their relative abundance in pollen mock communities. Using mock communities previously analyzed with DNA metabarcoding, we sequenced approximately 200Mbp for each sample using Illumina HiSeq and MiSeq. Taxonomic identifications were based on the Kraken k‐mer identification method with reference libraries constructed from full‐genome and short read archive data from the NCBI database. We found WGS to be a reliable method for taxonomic identification of pollen with near 100% identification of species in mixtures but generating higher rates of false positives (reads not identified to the correct taxon at the required taxonomic level) relative to rbcL and ITS2 amplicon sequencing. For quantification of relative species abundance, WGS data provided a stronger correlation between pollen grain proportion and sequence read proportion, but diverged more from a 1:1 relationship, likely due to the higher rate of false positives. Currently, a limitation of WGS‐based pollen identification is the lack of representation of plant diversity in publicly available genome databases. As databases improve and costs drop, we expect that eventually genomics methods will become the methods of choice for species identification and quantification of mixed‐species pollen samples.
format Online
Article
Text
id pubmed-8601920
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-86019202021-11-24 Comparing whole‐genome shotgun sequencing and DNA metabarcoding approaches for species identification and quantification of pollen species mixtures Bell, Karen L. Petit, Robert A. Cutler, Anya Dobbs, Emily K. Macpherson, J. Michael Read, Timothy D. Burgess, Kevin S. Brosi, Berry J. Ecol Evol Research Articles Molecular identification of mixed‐species pollen samples has a range of applications in various fields of research. To date, such molecular identification has primarily been carried out via amplicon sequencing, but whole‐genome shotgun (WGS) sequencing of pollen DNA has potential advantages, including (1) more genetic information per sample and (2) the potential for better quantitative matching. In this study, we tested the performance of WGS sequencing methodology and publicly available reference sequences in identifying species and quantifying their relative abundance in pollen mock communities. Using mock communities previously analyzed with DNA metabarcoding, we sequenced approximately 200Mbp for each sample using Illumina HiSeq and MiSeq. Taxonomic identifications were based on the Kraken k‐mer identification method with reference libraries constructed from full‐genome and short read archive data from the NCBI database. We found WGS to be a reliable method for taxonomic identification of pollen with near 100% identification of species in mixtures but generating higher rates of false positives (reads not identified to the correct taxon at the required taxonomic level) relative to rbcL and ITS2 amplicon sequencing. For quantification of relative species abundance, WGS data provided a stronger correlation between pollen grain proportion and sequence read proportion, but diverged more from a 1:1 relationship, likely due to the higher rate of false positives. Currently, a limitation of WGS‐based pollen identification is the lack of representation of plant diversity in publicly available genome databases. As databases improve and costs drop, we expect that eventually genomics methods will become the methods of choice for species identification and quantification of mixed‐species pollen samples. John Wiley and Sons Inc. 2021-11-04 /pmc/articles/PMC8601920/ /pubmed/34824813 http://dx.doi.org/10.1002/ece3.8281 Text en © 2021 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Bell, Karen L.
Petit, Robert A.
Cutler, Anya
Dobbs, Emily K.
Macpherson, J. Michael
Read, Timothy D.
Burgess, Kevin S.
Brosi, Berry J.
Comparing whole‐genome shotgun sequencing and DNA metabarcoding approaches for species identification and quantification of pollen species mixtures
title Comparing whole‐genome shotgun sequencing and DNA metabarcoding approaches for species identification and quantification of pollen species mixtures
title_full Comparing whole‐genome shotgun sequencing and DNA metabarcoding approaches for species identification and quantification of pollen species mixtures
title_fullStr Comparing whole‐genome shotgun sequencing and DNA metabarcoding approaches for species identification and quantification of pollen species mixtures
title_full_unstemmed Comparing whole‐genome shotgun sequencing and DNA metabarcoding approaches for species identification and quantification of pollen species mixtures
title_short Comparing whole‐genome shotgun sequencing and DNA metabarcoding approaches for species identification and quantification of pollen species mixtures
title_sort comparing whole‐genome shotgun sequencing and dna metabarcoding approaches for species identification and quantification of pollen species mixtures
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8601920/
https://www.ncbi.nlm.nih.gov/pubmed/34824813
http://dx.doi.org/10.1002/ece3.8281
work_keys_str_mv AT bellkarenl comparingwholegenomeshotgunsequencinganddnametabarcodingapproachesforspeciesidentificationandquantificationofpollenspeciesmixtures
AT petitroberta comparingwholegenomeshotgunsequencinganddnametabarcodingapproachesforspeciesidentificationandquantificationofpollenspeciesmixtures
AT cutleranya comparingwholegenomeshotgunsequencinganddnametabarcodingapproachesforspeciesidentificationandquantificationofpollenspeciesmixtures
AT dobbsemilyk comparingwholegenomeshotgunsequencinganddnametabarcodingapproachesforspeciesidentificationandquantificationofpollenspeciesmixtures
AT macphersonjmichael comparingwholegenomeshotgunsequencinganddnametabarcodingapproachesforspeciesidentificationandquantificationofpollenspeciesmixtures
AT readtimothyd comparingwholegenomeshotgunsequencinganddnametabarcodingapproachesforspeciesidentificationandquantificationofpollenspeciesmixtures
AT burgesskevins comparingwholegenomeshotgunsequencinganddnametabarcodingapproachesforspeciesidentificationandquantificationofpollenspeciesmixtures
AT brosiberryj comparingwholegenomeshotgunsequencinganddnametabarcodingapproachesforspeciesidentificationandquantificationofpollenspeciesmixtures