Cargando…

Replacing Sanger with Next Generation Sequencing to improve coverage and quality of reference DNA barcodes for plants

We estimate the global BOLD Systems database holds core DNA barcodes (rbcL + matK) for about 15% of land plant species and that comprehensive species coverage is still many decades away. Interim performance of the resource is compromised by variable sequence overlap and modest information content wi...

Descripción completa

Detalles Bibliográficos
Autores principales: Wilkinson, Mike J., Szabo, Claudia, Ford, Caroline S., Yarom, Yuval, Croxford, Adam E., Camp, Amanda, Gooding, Paul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5388885/
https://www.ncbi.nlm.nih.gov/pubmed/28401958
http://dx.doi.org/10.1038/srep46040
_version_ 1782521195141267456
author Wilkinson, Mike J.
Szabo, Claudia
Ford, Caroline S.
Yarom, Yuval
Croxford, Adam E.
Camp, Amanda
Gooding, Paul
author_facet Wilkinson, Mike J.
Szabo, Claudia
Ford, Caroline S.
Yarom, Yuval
Croxford, Adam E.
Camp, Amanda
Gooding, Paul
author_sort Wilkinson, Mike J.
collection PubMed
description We estimate the global BOLD Systems database holds core DNA barcodes (rbcL + matK) for about 15% of land plant species and that comprehensive species coverage is still many decades away. Interim performance of the resource is compromised by variable sequence overlap and modest information content within each barcode. Our model predicts that the proportion of species-unique barcodes reduces as the database grows and that ‘false’ species-unique barcodes remain >5% until the database is almost complete. We conclude the current rbcL + matK barcode is unfit for purpose. Genome skimming and supplementary barcodes could improve diagnostic power but would slow new barcode acquisition. We therefore present two novel Next Generation Sequencing protocols (with freeware) capable of accurate, massively parallel de novo assembly of high quality DNA barcodes of >1400 bp. We explore how these capabilities could enhance species diagnosis in the coming decades.
format Online
Article
Text
id pubmed-5388885
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-53888852017-04-14 Replacing Sanger with Next Generation Sequencing to improve coverage and quality of reference DNA barcodes for plants Wilkinson, Mike J. Szabo, Claudia Ford, Caroline S. Yarom, Yuval Croxford, Adam E. Camp, Amanda Gooding, Paul Sci Rep Article We estimate the global BOLD Systems database holds core DNA barcodes (rbcL + matK) for about 15% of land plant species and that comprehensive species coverage is still many decades away. Interim performance of the resource is compromised by variable sequence overlap and modest information content within each barcode. Our model predicts that the proportion of species-unique barcodes reduces as the database grows and that ‘false’ species-unique barcodes remain >5% until the database is almost complete. We conclude the current rbcL + matK barcode is unfit for purpose. Genome skimming and supplementary barcodes could improve diagnostic power but would slow new barcode acquisition. We therefore present two novel Next Generation Sequencing protocols (with freeware) capable of accurate, massively parallel de novo assembly of high quality DNA barcodes of >1400 bp. We explore how these capabilities could enhance species diagnosis in the coming decades. Nature Publishing Group 2017-04-12 /pmc/articles/PMC5388885/ /pubmed/28401958 http://dx.doi.org/10.1038/srep46040 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Wilkinson, Mike J.
Szabo, Claudia
Ford, Caroline S.
Yarom, Yuval
Croxford, Adam E.
Camp, Amanda
Gooding, Paul
Replacing Sanger with Next Generation Sequencing to improve coverage and quality of reference DNA barcodes for plants
title Replacing Sanger with Next Generation Sequencing to improve coverage and quality of reference DNA barcodes for plants
title_full Replacing Sanger with Next Generation Sequencing to improve coverage and quality of reference DNA barcodes for plants
title_fullStr Replacing Sanger with Next Generation Sequencing to improve coverage and quality of reference DNA barcodes for plants
title_full_unstemmed Replacing Sanger with Next Generation Sequencing to improve coverage and quality of reference DNA barcodes for plants
title_short Replacing Sanger with Next Generation Sequencing to improve coverage and quality of reference DNA barcodes for plants
title_sort replacing sanger with next generation sequencing to improve coverage and quality of reference dna barcodes for plants
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5388885/
https://www.ncbi.nlm.nih.gov/pubmed/28401958
http://dx.doi.org/10.1038/srep46040
work_keys_str_mv AT wilkinsonmikej replacingsangerwithnextgenerationsequencingtoimprovecoverageandqualityofreferencednabarcodesforplants
AT szaboclaudia replacingsangerwithnextgenerationsequencingtoimprovecoverageandqualityofreferencednabarcodesforplants
AT fordcarolines replacingsangerwithnextgenerationsequencingtoimprovecoverageandqualityofreferencednabarcodesforplants
AT yaromyuval replacingsangerwithnextgenerationsequencingtoimprovecoverageandqualityofreferencednabarcodesforplants
AT croxfordadame replacingsangerwithnextgenerationsequencingtoimprovecoverageandqualityofreferencednabarcodesforplants
AT campamanda replacingsangerwithnextgenerationsequencingtoimprovecoverageandqualityofreferencednabarcodesforplants
AT goodingpaul replacingsangerwithnextgenerationsequencingtoimprovecoverageandqualityofreferencednabarcodesforplants