Cargando…
High-throughput sequencing of multiple amplicons for barcoding and integrative taxonomy
Until now, the potential of NGS for the construction of barcode libraries or integrative taxonomy has been seldom realised. Here, we amplified (two-step PCR) and simultaneously sequenced (MiSeq) multiple markers from hundreds of fig wasp specimens. We also developed a workflow for quality control of...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5292727/ https://www.ncbi.nlm.nih.gov/pubmed/28165046 http://dx.doi.org/10.1038/srep41948 |
_version_ | 1782504978965856256 |
---|---|
author | Cruaud, Perrine Rasplus, Jean-Yves Rodriguez, Lillian Jennifer Cruaud, Astrid |
author_facet | Cruaud, Perrine Rasplus, Jean-Yves Rodriguez, Lillian Jennifer Cruaud, Astrid |
author_sort | Cruaud, Perrine |
collection | PubMed |
description | Until now, the potential of NGS for the construction of barcode libraries or integrative taxonomy has been seldom realised. Here, we amplified (two-step PCR) and simultaneously sequenced (MiSeq) multiple markers from hundreds of fig wasp specimens. We also developed a workflow for quality control of the data. Illumina and Sanger sequences accumulated in the past years were compared. Interestingly, primers and PCR conditions used for the Sanger approach did not require optimisation to construct the MiSeq library. After quality controls, 87% of the species (76% of the specimens) had a valid MiSeq sequence for each marker. Importantly, major clusters did not always correspond to the targeted loci. Nine specimens exhibited two divergent sequences (up to 10%). In 95% of the species, MiSeq and Sanger sequences obtained from the same sampling were similar. For the remaining 5%, species were paraphyletic or the sequences clustered into divergent groups on the Sanger + MiSeq trees (>7%). These problematic cases may represent coding NUMTS or heteroplasms. Our results illustrate that Illumina approaches are not artefact-free and confirm that Sanger databases can contain non-target genes. This highlights the importance of quality controls, working with taxonomists and using multiple markers for DNA-taxonomy or species diversity assessment. |
format | Online Article Text |
id | pubmed-5292727 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-52927272017-02-10 High-throughput sequencing of multiple amplicons for barcoding and integrative taxonomy Cruaud, Perrine Rasplus, Jean-Yves Rodriguez, Lillian Jennifer Cruaud, Astrid Sci Rep Article Until now, the potential of NGS for the construction of barcode libraries or integrative taxonomy has been seldom realised. Here, we amplified (two-step PCR) and simultaneously sequenced (MiSeq) multiple markers from hundreds of fig wasp specimens. We also developed a workflow for quality control of the data. Illumina and Sanger sequences accumulated in the past years were compared. Interestingly, primers and PCR conditions used for the Sanger approach did not require optimisation to construct the MiSeq library. After quality controls, 87% of the species (76% of the specimens) had a valid MiSeq sequence for each marker. Importantly, major clusters did not always correspond to the targeted loci. Nine specimens exhibited two divergent sequences (up to 10%). In 95% of the species, MiSeq and Sanger sequences obtained from the same sampling were similar. For the remaining 5%, species were paraphyletic or the sequences clustered into divergent groups on the Sanger + MiSeq trees (>7%). These problematic cases may represent coding NUMTS or heteroplasms. Our results illustrate that Illumina approaches are not artefact-free and confirm that Sanger databases can contain non-target genes. This highlights the importance of quality controls, working with taxonomists and using multiple markers for DNA-taxonomy or species diversity assessment. Nature Publishing Group 2017-02-06 /pmc/articles/PMC5292727/ /pubmed/28165046 http://dx.doi.org/10.1038/srep41948 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Cruaud, Perrine Rasplus, Jean-Yves Rodriguez, Lillian Jennifer Cruaud, Astrid High-throughput sequencing of multiple amplicons for barcoding and integrative taxonomy |
title | High-throughput sequencing of multiple amplicons for barcoding and integrative taxonomy |
title_full | High-throughput sequencing of multiple amplicons for barcoding and integrative taxonomy |
title_fullStr | High-throughput sequencing of multiple amplicons for barcoding and integrative taxonomy |
title_full_unstemmed | High-throughput sequencing of multiple amplicons for barcoding and integrative taxonomy |
title_short | High-throughput sequencing of multiple amplicons for barcoding and integrative taxonomy |
title_sort | high-throughput sequencing of multiple amplicons for barcoding and integrative taxonomy |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5292727/ https://www.ncbi.nlm.nih.gov/pubmed/28165046 http://dx.doi.org/10.1038/srep41948 |
work_keys_str_mv | AT cruaudperrine highthroughputsequencingofmultipleampliconsforbarcodingandintegrativetaxonomy AT rasplusjeanyves highthroughputsequencingofmultipleampliconsforbarcodingandintegrativetaxonomy AT rodriguezlillianjennifer highthroughputsequencingofmultipleampliconsforbarcodingandintegrativetaxonomy AT cruaudastrid highthroughputsequencingofmultipleampliconsforbarcodingandintegrativetaxonomy |