Cargando…

Handling of targeted amplicon sequencing data focusing on index hopping and demultiplexing using a nested metabarcoding approach in ecology

High-throughput sequencing platforms are increasingly being used for targeted amplicon sequencing because they enable cost-effective sequencing of large sample sets. For meaningful interpretation of targeted amplicon sequencing data and comparison between studies, it is critical that bioinformatic a...

Descripción completa

Detalles Bibliográficos
Autores principales: Guenay-Greunke, Yasemin, Bohan, David A., Traugott, Michael, Wallinger, Corinna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8484467/
https://www.ncbi.nlm.nih.gov/pubmed/34593851
http://dx.doi.org/10.1038/s41598-021-98018-4
_version_ 1784577324415451136
author Guenay-Greunke, Yasemin
Bohan, David A.
Traugott, Michael
Wallinger, Corinna
author_facet Guenay-Greunke, Yasemin
Bohan, David A.
Traugott, Michael
Wallinger, Corinna
author_sort Guenay-Greunke, Yasemin
collection PubMed
description High-throughput sequencing platforms are increasingly being used for targeted amplicon sequencing because they enable cost-effective sequencing of large sample sets. For meaningful interpretation of targeted amplicon sequencing data and comparison between studies, it is critical that bioinformatic analyses do not introduce artefacts and rely on detailed protocols to ensure that all methods are properly performed and documented. The analysis of large sample sets and the use of predefined indexes create challenges, such as adjusting the sequencing depth across samples and taking sequencing errors or index hopping into account. However, the potential biases these factors introduce to high-throughput amplicon sequencing data sets and how they may be overcome have rarely been addressed. On the example of a nested metabarcoding analysis of 1920 carabid beetle regurgitates to assess plant feeding, we investigated: (i) the variation in sequencing depth of individually tagged samples and the effect of library preparation on the data output; (ii) the influence of sequencing errors within index regions and its consequences for demultiplexing; and (iii) the effect of index hopping. Our results demonstrate that despite library quantification, large variation in read counts and sequencing depth occurred among samples and that the sequencing error rate in bioinformatic software is essential for accurate adapter/primer trimming and demultiplexing. Moreover, setting an index hopping threshold to avoid incorrect assignment of samples is highly recommended.
format Online
Article
Text
id pubmed-8484467
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-84844672021-10-04 Handling of targeted amplicon sequencing data focusing on index hopping and demultiplexing using a nested metabarcoding approach in ecology Guenay-Greunke, Yasemin Bohan, David A. Traugott, Michael Wallinger, Corinna Sci Rep Article High-throughput sequencing platforms are increasingly being used for targeted amplicon sequencing because they enable cost-effective sequencing of large sample sets. For meaningful interpretation of targeted amplicon sequencing data and comparison between studies, it is critical that bioinformatic analyses do not introduce artefacts and rely on detailed protocols to ensure that all methods are properly performed and documented. The analysis of large sample sets and the use of predefined indexes create challenges, such as adjusting the sequencing depth across samples and taking sequencing errors or index hopping into account. However, the potential biases these factors introduce to high-throughput amplicon sequencing data sets and how they may be overcome have rarely been addressed. On the example of a nested metabarcoding analysis of 1920 carabid beetle regurgitates to assess plant feeding, we investigated: (i) the variation in sequencing depth of individually tagged samples and the effect of library preparation on the data output; (ii) the influence of sequencing errors within index regions and its consequences for demultiplexing; and (iii) the effect of index hopping. Our results demonstrate that despite library quantification, large variation in read counts and sequencing depth occurred among samples and that the sequencing error rate in bioinformatic software is essential for accurate adapter/primer trimming and demultiplexing. Moreover, setting an index hopping threshold to avoid incorrect assignment of samples is highly recommended. Nature Publishing Group UK 2021-09-30 /pmc/articles/PMC8484467/ /pubmed/34593851 http://dx.doi.org/10.1038/s41598-021-98018-4 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Guenay-Greunke, Yasemin
Bohan, David A.
Traugott, Michael
Wallinger, Corinna
Handling of targeted amplicon sequencing data focusing on index hopping and demultiplexing using a nested metabarcoding approach in ecology
title Handling of targeted amplicon sequencing data focusing on index hopping and demultiplexing using a nested metabarcoding approach in ecology
title_full Handling of targeted amplicon sequencing data focusing on index hopping and demultiplexing using a nested metabarcoding approach in ecology
title_fullStr Handling of targeted amplicon sequencing data focusing on index hopping and demultiplexing using a nested metabarcoding approach in ecology
title_full_unstemmed Handling of targeted amplicon sequencing data focusing on index hopping and demultiplexing using a nested metabarcoding approach in ecology
title_short Handling of targeted amplicon sequencing data focusing on index hopping and demultiplexing using a nested metabarcoding approach in ecology
title_sort handling of targeted amplicon sequencing data focusing on index hopping and demultiplexing using a nested metabarcoding approach in ecology
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8484467/
https://www.ncbi.nlm.nih.gov/pubmed/34593851
http://dx.doi.org/10.1038/s41598-021-98018-4
work_keys_str_mv AT guenaygreunkeyasemin handlingoftargetedampliconsequencingdatafocusingonindexhoppinganddemultiplexingusinganestedmetabarcodingapproachinecology
AT bohandavida handlingoftargetedampliconsequencingdatafocusingonindexhoppinganddemultiplexingusinganestedmetabarcodingapproachinecology
AT traugottmichael handlingoftargetedampliconsequencingdatafocusingonindexhoppinganddemultiplexingusinganestedmetabarcodingapproachinecology
AT wallingercorinna handlingoftargetedampliconsequencingdatafocusingonindexhoppinganddemultiplexingusinganestedmetabarcodingapproachinecology