Cargando…
Replacing Sanger with Next Generation Sequencing to improve coverage and quality of reference DNA barcodes for plants
We estimate the global BOLD Systems database holds core DNA barcodes (rbcL + matK) for about 15% of land plant species and that comprehensive species coverage is still many decades away. Interim performance of the resource is compromised by variable sequence overlap and modest information content wi...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5388885/ https://www.ncbi.nlm.nih.gov/pubmed/28401958 http://dx.doi.org/10.1038/srep46040 |
_version_ | 1782521195141267456 |
---|---|
author | Wilkinson, Mike J. Szabo, Claudia Ford, Caroline S. Yarom, Yuval Croxford, Adam E. Camp, Amanda Gooding, Paul |
author_facet | Wilkinson, Mike J. Szabo, Claudia Ford, Caroline S. Yarom, Yuval Croxford, Adam E. Camp, Amanda Gooding, Paul |
author_sort | Wilkinson, Mike J. |
collection | PubMed |
description | We estimate the global BOLD Systems database holds core DNA barcodes (rbcL + matK) for about 15% of land plant species and that comprehensive species coverage is still many decades away. Interim performance of the resource is compromised by variable sequence overlap and modest information content within each barcode. Our model predicts that the proportion of species-unique barcodes reduces as the database grows and that ‘false’ species-unique barcodes remain >5% until the database is almost complete. We conclude the current rbcL + matK barcode is unfit for purpose. Genome skimming and supplementary barcodes could improve diagnostic power but would slow new barcode acquisition. We therefore present two novel Next Generation Sequencing protocols (with freeware) capable of accurate, massively parallel de novo assembly of high quality DNA barcodes of >1400 bp. We explore how these capabilities could enhance species diagnosis in the coming decades. |
format | Online Article Text |
id | pubmed-5388885 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-53888852017-04-14 Replacing Sanger with Next Generation Sequencing to improve coverage and quality of reference DNA barcodes for plants Wilkinson, Mike J. Szabo, Claudia Ford, Caroline S. Yarom, Yuval Croxford, Adam E. Camp, Amanda Gooding, Paul Sci Rep Article We estimate the global BOLD Systems database holds core DNA barcodes (rbcL + matK) for about 15% of land plant species and that comprehensive species coverage is still many decades away. Interim performance of the resource is compromised by variable sequence overlap and modest information content within each barcode. Our model predicts that the proportion of species-unique barcodes reduces as the database grows and that ‘false’ species-unique barcodes remain >5% until the database is almost complete. We conclude the current rbcL + matK barcode is unfit for purpose. Genome skimming and supplementary barcodes could improve diagnostic power but would slow new barcode acquisition. We therefore present two novel Next Generation Sequencing protocols (with freeware) capable of accurate, massively parallel de novo assembly of high quality DNA barcodes of >1400 bp. We explore how these capabilities could enhance species diagnosis in the coming decades. Nature Publishing Group 2017-04-12 /pmc/articles/PMC5388885/ /pubmed/28401958 http://dx.doi.org/10.1038/srep46040 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Wilkinson, Mike J. Szabo, Claudia Ford, Caroline S. Yarom, Yuval Croxford, Adam E. Camp, Amanda Gooding, Paul Replacing Sanger with Next Generation Sequencing to improve coverage and quality of reference DNA barcodes for plants |
title | Replacing Sanger with Next Generation Sequencing to improve coverage and quality of reference DNA barcodes for plants |
title_full | Replacing Sanger with Next Generation Sequencing to improve coverage and quality of reference DNA barcodes for plants |
title_fullStr | Replacing Sanger with Next Generation Sequencing to improve coverage and quality of reference DNA barcodes for plants |
title_full_unstemmed | Replacing Sanger with Next Generation Sequencing to improve coverage and quality of reference DNA barcodes for plants |
title_short | Replacing Sanger with Next Generation Sequencing to improve coverage and quality of reference DNA barcodes for plants |
title_sort | replacing sanger with next generation sequencing to improve coverage and quality of reference dna barcodes for plants |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5388885/ https://www.ncbi.nlm.nih.gov/pubmed/28401958 http://dx.doi.org/10.1038/srep46040 |
work_keys_str_mv | AT wilkinsonmikej replacingsangerwithnextgenerationsequencingtoimprovecoverageandqualityofreferencednabarcodesforplants AT szaboclaudia replacingsangerwithnextgenerationsequencingtoimprovecoverageandqualityofreferencednabarcodesforplants AT fordcarolines replacingsangerwithnextgenerationsequencingtoimprovecoverageandqualityofreferencednabarcodesforplants AT yaromyuval replacingsangerwithnextgenerationsequencingtoimprovecoverageandqualityofreferencednabarcodesforplants AT croxfordadame replacingsangerwithnextgenerationsequencingtoimprovecoverageandqualityofreferencednabarcodesforplants AT campamanda replacingsangerwithnextgenerationsequencingtoimprovecoverageandqualityofreferencednabarcodesforplants AT goodingpaul replacingsangerwithnextgenerationsequencingtoimprovecoverageandqualityofreferencednabarcodesforplants |