Cargando…
Comparing whole‐genome shotgun sequencing and DNA metabarcoding approaches for species identification and quantification of pollen species mixtures
Molecular identification of mixed‐species pollen samples has a range of applications in various fields of research. To date, such molecular identification has primarily been carried out via amplicon sequencing, but whole‐genome shotgun (WGS) sequencing of pollen DNA has potential advantages, includi...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8601920/ https://www.ncbi.nlm.nih.gov/pubmed/34824813 http://dx.doi.org/10.1002/ece3.8281 |
_version_ | 1784601459784941568 |
---|---|
author | Bell, Karen L. Petit, Robert A. Cutler, Anya Dobbs, Emily K. Macpherson, J. Michael Read, Timothy D. Burgess, Kevin S. Brosi, Berry J. |
author_facet | Bell, Karen L. Petit, Robert A. Cutler, Anya Dobbs, Emily K. Macpherson, J. Michael Read, Timothy D. Burgess, Kevin S. Brosi, Berry J. |
author_sort | Bell, Karen L. |
collection | PubMed |
description | Molecular identification of mixed‐species pollen samples has a range of applications in various fields of research. To date, such molecular identification has primarily been carried out via amplicon sequencing, but whole‐genome shotgun (WGS) sequencing of pollen DNA has potential advantages, including (1) more genetic information per sample and (2) the potential for better quantitative matching. In this study, we tested the performance of WGS sequencing methodology and publicly available reference sequences in identifying species and quantifying their relative abundance in pollen mock communities. Using mock communities previously analyzed with DNA metabarcoding, we sequenced approximately 200Mbp for each sample using Illumina HiSeq and MiSeq. Taxonomic identifications were based on the Kraken k‐mer identification method with reference libraries constructed from full‐genome and short read archive data from the NCBI database. We found WGS to be a reliable method for taxonomic identification of pollen with near 100% identification of species in mixtures but generating higher rates of false positives (reads not identified to the correct taxon at the required taxonomic level) relative to rbcL and ITS2 amplicon sequencing. For quantification of relative species abundance, WGS data provided a stronger correlation between pollen grain proportion and sequence read proportion, but diverged more from a 1:1 relationship, likely due to the higher rate of false positives. Currently, a limitation of WGS‐based pollen identification is the lack of representation of plant diversity in publicly available genome databases. As databases improve and costs drop, we expect that eventually genomics methods will become the methods of choice for species identification and quantification of mixed‐species pollen samples. |
format | Online Article Text |
id | pubmed-8601920 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-86019202021-11-24 Comparing whole‐genome shotgun sequencing and DNA metabarcoding approaches for species identification and quantification of pollen species mixtures Bell, Karen L. Petit, Robert A. Cutler, Anya Dobbs, Emily K. Macpherson, J. Michael Read, Timothy D. Burgess, Kevin S. Brosi, Berry J. Ecol Evol Research Articles Molecular identification of mixed‐species pollen samples has a range of applications in various fields of research. To date, such molecular identification has primarily been carried out via amplicon sequencing, but whole‐genome shotgun (WGS) sequencing of pollen DNA has potential advantages, including (1) more genetic information per sample and (2) the potential for better quantitative matching. In this study, we tested the performance of WGS sequencing methodology and publicly available reference sequences in identifying species and quantifying their relative abundance in pollen mock communities. Using mock communities previously analyzed with DNA metabarcoding, we sequenced approximately 200Mbp for each sample using Illumina HiSeq and MiSeq. Taxonomic identifications were based on the Kraken k‐mer identification method with reference libraries constructed from full‐genome and short read archive data from the NCBI database. We found WGS to be a reliable method for taxonomic identification of pollen with near 100% identification of species in mixtures but generating higher rates of false positives (reads not identified to the correct taxon at the required taxonomic level) relative to rbcL and ITS2 amplicon sequencing. For quantification of relative species abundance, WGS data provided a stronger correlation between pollen grain proportion and sequence read proportion, but diverged more from a 1:1 relationship, likely due to the higher rate of false positives. Currently, a limitation of WGS‐based pollen identification is the lack of representation of plant diversity in publicly available genome databases. As databases improve and costs drop, we expect that eventually genomics methods will become the methods of choice for species identification and quantification of mixed‐species pollen samples. John Wiley and Sons Inc. 2021-11-04 /pmc/articles/PMC8601920/ /pubmed/34824813 http://dx.doi.org/10.1002/ece3.8281 Text en © 2021 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Articles Bell, Karen L. Petit, Robert A. Cutler, Anya Dobbs, Emily K. Macpherson, J. Michael Read, Timothy D. Burgess, Kevin S. Brosi, Berry J. Comparing whole‐genome shotgun sequencing and DNA metabarcoding approaches for species identification and quantification of pollen species mixtures |
title | Comparing whole‐genome shotgun sequencing and DNA metabarcoding approaches for species identification and quantification of pollen species mixtures |
title_full | Comparing whole‐genome shotgun sequencing and DNA metabarcoding approaches for species identification and quantification of pollen species mixtures |
title_fullStr | Comparing whole‐genome shotgun sequencing and DNA metabarcoding approaches for species identification and quantification of pollen species mixtures |
title_full_unstemmed | Comparing whole‐genome shotgun sequencing and DNA metabarcoding approaches for species identification and quantification of pollen species mixtures |
title_short | Comparing whole‐genome shotgun sequencing and DNA metabarcoding approaches for species identification and quantification of pollen species mixtures |
title_sort | comparing whole‐genome shotgun sequencing and dna metabarcoding approaches for species identification and quantification of pollen species mixtures |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8601920/ https://www.ncbi.nlm.nih.gov/pubmed/34824813 http://dx.doi.org/10.1002/ece3.8281 |
work_keys_str_mv | AT bellkarenl comparingwholegenomeshotgunsequencinganddnametabarcodingapproachesforspeciesidentificationandquantificationofpollenspeciesmixtures AT petitroberta comparingwholegenomeshotgunsequencinganddnametabarcodingapproachesforspeciesidentificationandquantificationofpollenspeciesmixtures AT cutleranya comparingwholegenomeshotgunsequencinganddnametabarcodingapproachesforspeciesidentificationandquantificationofpollenspeciesmixtures AT dobbsemilyk comparingwholegenomeshotgunsequencinganddnametabarcodingapproachesforspeciesidentificationandquantificationofpollenspeciesmixtures AT macphersonjmichael comparingwholegenomeshotgunsequencinganddnametabarcodingapproachesforspeciesidentificationandquantificationofpollenspeciesmixtures AT readtimothyd comparingwholegenomeshotgunsequencinganddnametabarcodingapproachesforspeciesidentificationandquantificationofpollenspeciesmixtures AT burgesskevins comparingwholegenomeshotgunsequencinganddnametabarcodingapproachesforspeciesidentificationandquantificationofpollenspeciesmixtures AT brosiberryj comparingwholegenomeshotgunsequencinganddnametabarcodingapproachesforspeciesidentificationandquantificationofpollenspeciesmixtures |