Cargando…

High‐throughput sequencing on preservative ethanol is effective at jointly examining infraspecific and taxonomic diversity, although bioinformatics pipelines do not perform equally

1. High‐throughput sequencing of amplicons (HTSA) has been proposed as an effective approach to evaluate taxonomic and genetic diversity at the same time. However, there are still uncertainties as to how the results produced by different bioinformatics treatments impact the conclusions drawn on biod...

Descripción completa

Detalles Bibliográficos
Autores principales: Couton, Marjorie, Baud, Aurélien, Daguin‐Thiébaut, Claire, Corre, Erwan, Comtet, Thierry, Viard, Frédérique
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8131761/
https://www.ncbi.nlm.nih.gov/pubmed/34026027
http://dx.doi.org/10.1002/ece3.7453
_version_ 1783694771584237568
author Couton, Marjorie
Baud, Aurélien
Daguin‐Thiébaut, Claire
Corre, Erwan
Comtet, Thierry
Viard, Frédérique
author_facet Couton, Marjorie
Baud, Aurélien
Daguin‐Thiébaut, Claire
Corre, Erwan
Comtet, Thierry
Viard, Frédérique
author_sort Couton, Marjorie
collection PubMed
description 1. High‐throughput sequencing of amplicons (HTSA) has been proposed as an effective approach to evaluate taxonomic and genetic diversity at the same time. However, there are still uncertainties as to how the results produced by different bioinformatics treatments impact the conclusions drawn on biodiversity and population genetics indices. 2. We evaluated the ability of six bioinformatics pipelines to recover taxonomic and genetic diversity from HTSA data obtained from controlled assemblages. To that end, 20 assemblages were produced using 354 colonies of Botrylloides spp., sampled in the wild in ten marinas around Brittany (France). We used DNA extracted from preservative ethanol (ebDNA) after various time of storage (3, 6, and 12 months), and from a bulk of preserved specimens (bulkDNA). DNA was amplified with primers designed for targeting this ascidian genus. Results obtained from HTSA data were compared with Sanger sequencing on individual zooids (i.e., individual barcoding). 3. Species identification and relative abundance determined with HTSA data from either ebDNA or bulkDNA were similar to those obtained with traditional individual barcoding. However, after 12 months of storage, the correlation between HTSA and individual‐based data was lower than after shorter durations. The six bioinformatics pipelines were able to depict accurately the genetic diversity using standard population genetics indices (H(S) and F(ST)), despite producing false positives and missing rare haplotypes. However, they did not perform equally and dada2 was the only pipeline able to retrieve all expected haplotypes. 4. This study showed that ebDNA is a nondestructive alternative for both species identification and haplotype recovery, providing storage does not last more than 6 months before DNA extraction. Choosing the bioinformatics pipeline is a matter of compromise, aiming to retrieve all true haplotypes while avoiding false positives. We here recommend to process HTSA data using dada2, including a chimera‐removal step. Even if the possibility to use multiplexed primer sets deserves further investigation to expand the taxonomic coverage in future similar studies, we showed that primers targeting a particular genus allowed to reliably analyze this genus within a complex community.
format Online
Article
Text
id pubmed-8131761
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-81317612021-05-21 High‐throughput sequencing on preservative ethanol is effective at jointly examining infraspecific and taxonomic diversity, although bioinformatics pipelines do not perform equally Couton, Marjorie Baud, Aurélien Daguin‐Thiébaut, Claire Corre, Erwan Comtet, Thierry Viard, Frédérique Ecol Evol Original Research 1. High‐throughput sequencing of amplicons (HTSA) has been proposed as an effective approach to evaluate taxonomic and genetic diversity at the same time. However, there are still uncertainties as to how the results produced by different bioinformatics treatments impact the conclusions drawn on biodiversity and population genetics indices. 2. We evaluated the ability of six bioinformatics pipelines to recover taxonomic and genetic diversity from HTSA data obtained from controlled assemblages. To that end, 20 assemblages were produced using 354 colonies of Botrylloides spp., sampled in the wild in ten marinas around Brittany (France). We used DNA extracted from preservative ethanol (ebDNA) after various time of storage (3, 6, and 12 months), and from a bulk of preserved specimens (bulkDNA). DNA was amplified with primers designed for targeting this ascidian genus. Results obtained from HTSA data were compared with Sanger sequencing on individual zooids (i.e., individual barcoding). 3. Species identification and relative abundance determined with HTSA data from either ebDNA or bulkDNA were similar to those obtained with traditional individual barcoding. However, after 12 months of storage, the correlation between HTSA and individual‐based data was lower than after shorter durations. The six bioinformatics pipelines were able to depict accurately the genetic diversity using standard population genetics indices (H(S) and F(ST)), despite producing false positives and missing rare haplotypes. However, they did not perform equally and dada2 was the only pipeline able to retrieve all expected haplotypes. 4. This study showed that ebDNA is a nondestructive alternative for both species identification and haplotype recovery, providing storage does not last more than 6 months before DNA extraction. Choosing the bioinformatics pipeline is a matter of compromise, aiming to retrieve all true haplotypes while avoiding false positives. We here recommend to process HTSA data using dada2, including a chimera‐removal step. Even if the possibility to use multiplexed primer sets deserves further investigation to expand the taxonomic coverage in future similar studies, we showed that primers targeting a particular genus allowed to reliably analyze this genus within a complex community. John Wiley and Sons Inc. 2021-03-23 /pmc/articles/PMC8131761/ /pubmed/34026027 http://dx.doi.org/10.1002/ece3.7453 Text en © 2021 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Research
Couton, Marjorie
Baud, Aurélien
Daguin‐Thiébaut, Claire
Corre, Erwan
Comtet, Thierry
Viard, Frédérique
High‐throughput sequencing on preservative ethanol is effective at jointly examining infraspecific and taxonomic diversity, although bioinformatics pipelines do not perform equally
title High‐throughput sequencing on preservative ethanol is effective at jointly examining infraspecific and taxonomic diversity, although bioinformatics pipelines do not perform equally
title_full High‐throughput sequencing on preservative ethanol is effective at jointly examining infraspecific and taxonomic diversity, although bioinformatics pipelines do not perform equally
title_fullStr High‐throughput sequencing on preservative ethanol is effective at jointly examining infraspecific and taxonomic diversity, although bioinformatics pipelines do not perform equally
title_full_unstemmed High‐throughput sequencing on preservative ethanol is effective at jointly examining infraspecific and taxonomic diversity, although bioinformatics pipelines do not perform equally
title_short High‐throughput sequencing on preservative ethanol is effective at jointly examining infraspecific and taxonomic diversity, although bioinformatics pipelines do not perform equally
title_sort high‐throughput sequencing on preservative ethanol is effective at jointly examining infraspecific and taxonomic diversity, although bioinformatics pipelines do not perform equally
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8131761/
https://www.ncbi.nlm.nih.gov/pubmed/34026027
http://dx.doi.org/10.1002/ece3.7453
work_keys_str_mv AT coutonmarjorie highthroughputsequencingonpreservativeethanoliseffectiveatjointlyexamininginfraspecificandtaxonomicdiversityalthoughbioinformaticspipelinesdonotperformequally
AT baudaurelien highthroughputsequencingonpreservativeethanoliseffectiveatjointlyexamininginfraspecificandtaxonomicdiversityalthoughbioinformaticspipelinesdonotperformequally
AT daguinthiebautclaire highthroughputsequencingonpreservativeethanoliseffectiveatjointlyexamininginfraspecificandtaxonomicdiversityalthoughbioinformaticspipelinesdonotperformequally
AT correerwan highthroughputsequencingonpreservativeethanoliseffectiveatjointlyexamininginfraspecificandtaxonomicdiversityalthoughbioinformaticspipelinesdonotperformequally
AT comtetthierry highthroughputsequencingonpreservativeethanoliseffectiveatjointlyexamininginfraspecificandtaxonomicdiversityalthoughbioinformaticspipelinesdonotperformequally
AT viardfrederique highthroughputsequencingonpreservativeethanoliseffectiveatjointlyexamininginfraspecificandtaxonomicdiversityalthoughbioinformaticspipelinesdonotperformequally