Cargando…

High-throughput sequencing of multiple amplicons for barcoding and integrative taxonomy

Until now, the potential of NGS for the construction of barcode libraries or integrative taxonomy has been seldom realised. Here, we amplified (two-step PCR) and simultaneously sequenced (MiSeq) multiple markers from hundreds of fig wasp specimens. We also developed a workflow for quality control of...

Descripción completa

Detalles Bibliográficos
Autores principales: Cruaud, Perrine, Rasplus, Jean-Yves, Rodriguez, Lillian Jennifer, Cruaud, Astrid
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5292727/
https://www.ncbi.nlm.nih.gov/pubmed/28165046
http://dx.doi.org/10.1038/srep41948
_version_ 1782504978965856256
author Cruaud, Perrine
Rasplus, Jean-Yves
Rodriguez, Lillian Jennifer
Cruaud, Astrid
author_facet Cruaud, Perrine
Rasplus, Jean-Yves
Rodriguez, Lillian Jennifer
Cruaud, Astrid
author_sort Cruaud, Perrine
collection PubMed
description Until now, the potential of NGS for the construction of barcode libraries or integrative taxonomy has been seldom realised. Here, we amplified (two-step PCR) and simultaneously sequenced (MiSeq) multiple markers from hundreds of fig wasp specimens. We also developed a workflow for quality control of the data. Illumina and Sanger sequences accumulated in the past years were compared. Interestingly, primers and PCR conditions used for the Sanger approach did not require optimisation to construct the MiSeq library. After quality controls, 87% of the species (76% of the specimens) had a valid MiSeq sequence for each marker. Importantly, major clusters did not always correspond to the targeted loci. Nine specimens exhibited two divergent sequences (up to 10%). In 95% of the species, MiSeq and Sanger sequences obtained from the same sampling were similar. For the remaining 5%, species were paraphyletic or the sequences clustered into divergent groups on the Sanger + MiSeq trees (>7%). These problematic cases may represent coding NUMTS or heteroplasms. Our results illustrate that Illumina approaches are not artefact-free and confirm that Sanger databases can contain non-target genes. This highlights the importance of quality controls, working with taxonomists and using multiple markers for DNA-taxonomy or species diversity assessment.
format Online
Article
Text
id pubmed-5292727
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-52927272017-02-10 High-throughput sequencing of multiple amplicons for barcoding and integrative taxonomy Cruaud, Perrine Rasplus, Jean-Yves Rodriguez, Lillian Jennifer Cruaud, Astrid Sci Rep Article Until now, the potential of NGS for the construction of barcode libraries or integrative taxonomy has been seldom realised. Here, we amplified (two-step PCR) and simultaneously sequenced (MiSeq) multiple markers from hundreds of fig wasp specimens. We also developed a workflow for quality control of the data. Illumina and Sanger sequences accumulated in the past years were compared. Interestingly, primers and PCR conditions used for the Sanger approach did not require optimisation to construct the MiSeq library. After quality controls, 87% of the species (76% of the specimens) had a valid MiSeq sequence for each marker. Importantly, major clusters did not always correspond to the targeted loci. Nine specimens exhibited two divergent sequences (up to 10%). In 95% of the species, MiSeq and Sanger sequences obtained from the same sampling were similar. For the remaining 5%, species were paraphyletic or the sequences clustered into divergent groups on the Sanger + MiSeq trees (>7%). These problematic cases may represent coding NUMTS or heteroplasms. Our results illustrate that Illumina approaches are not artefact-free and confirm that Sanger databases can contain non-target genes. This highlights the importance of quality controls, working with taxonomists and using multiple markers for DNA-taxonomy or species diversity assessment. Nature Publishing Group 2017-02-06 /pmc/articles/PMC5292727/ /pubmed/28165046 http://dx.doi.org/10.1038/srep41948 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Cruaud, Perrine
Rasplus, Jean-Yves
Rodriguez, Lillian Jennifer
Cruaud, Astrid
High-throughput sequencing of multiple amplicons for barcoding and integrative taxonomy
title High-throughput sequencing of multiple amplicons for barcoding and integrative taxonomy
title_full High-throughput sequencing of multiple amplicons for barcoding and integrative taxonomy
title_fullStr High-throughput sequencing of multiple amplicons for barcoding and integrative taxonomy
title_full_unstemmed High-throughput sequencing of multiple amplicons for barcoding and integrative taxonomy
title_short High-throughput sequencing of multiple amplicons for barcoding and integrative taxonomy
title_sort high-throughput sequencing of multiple amplicons for barcoding and integrative taxonomy
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5292727/
https://www.ncbi.nlm.nih.gov/pubmed/28165046
http://dx.doi.org/10.1038/srep41948
work_keys_str_mv AT cruaudperrine highthroughputsequencingofmultipleampliconsforbarcodingandintegrativetaxonomy
AT rasplusjeanyves highthroughputsequencingofmultipleampliconsforbarcodingandintegrativetaxonomy
AT rodriguezlillianjennifer highthroughputsequencingofmultipleampliconsforbarcodingandintegrativetaxonomy
AT cruaudastrid highthroughputsequencingofmultipleampliconsforbarcodingandintegrativetaxonomy