Cargando…
Beyond DNA barcoding: The unrealized potential of genome skim data in sample identification
Genetic tools are increasingly used to identify and discriminate between species. One key transition in this process was the recognition of the potential of the ca 658bp fragment of the organelle cytochrome c oxidase I (COI) as a barcode region, which revolutionized animal bioidentification and lead...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7496323/ https://www.ncbi.nlm.nih.gov/pubmed/32542933 http://dx.doi.org/10.1111/mec.15507 |
_version_ | 1783583071040176128 |
---|---|
author | Bohmann, Kristine Mirarab, Siavash Bafna, Vineet Gilbert, M. Thomas P. |
author_facet | Bohmann, Kristine Mirarab, Siavash Bafna, Vineet Gilbert, M. Thomas P. |
author_sort | Bohmann, Kristine |
collection | PubMed |
description | Genetic tools are increasingly used to identify and discriminate between species. One key transition in this process was the recognition of the potential of the ca 658bp fragment of the organelle cytochrome c oxidase I (COI) as a barcode region, which revolutionized animal bioidentification and lead, among others, to the instigation of the Barcode of Life Database (BOLD), containing currently barcodes from >7.9 million specimens. Following this discovery, suggestions for other organellar regions and markers, and the primers with which to amplify them, have been continuously proposed. Most recently, the field has taken the leap from PCR‐based generation of DNA references into shotgun sequencing‐based “genome skimming” alternatives, with the ultimate goal of assembling organellar reference genomes. Unfortunately, in genome skimming approaches, much of the nuclear genome (as much as 99% of the sequence data) is discarded, which is not only wasteful, but can also limit the power of discrimination at, or below, the species level. Here, we advocate that the full shotgun sequence data can be used to assign an identity (that we term for convenience its “DNA‐mark”) for both voucher and query samples, without requiring any computationally intensive pretreatment (e.g. assembly) of reads. We argue that if reference databases are populated with such “DNA‐marks,” it will enable future DNA‐based taxonomic identification to complement, or even replace PCR of barcodes with genome skimming, and we discuss how such methodology ultimately could enable identification to population, or even individual, level. |
format | Online Article Text |
id | pubmed-7496323 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-74963232020-09-25 Beyond DNA barcoding: The unrealized potential of genome skim data in sample identification Bohmann, Kristine Mirarab, Siavash Bafna, Vineet Gilbert, M. Thomas P. Mol Ecol News and Views Genetic tools are increasingly used to identify and discriminate between species. One key transition in this process was the recognition of the potential of the ca 658bp fragment of the organelle cytochrome c oxidase I (COI) as a barcode region, which revolutionized animal bioidentification and lead, among others, to the instigation of the Barcode of Life Database (BOLD), containing currently barcodes from >7.9 million specimens. Following this discovery, suggestions for other organellar regions and markers, and the primers with which to amplify them, have been continuously proposed. Most recently, the field has taken the leap from PCR‐based generation of DNA references into shotgun sequencing‐based “genome skimming” alternatives, with the ultimate goal of assembling organellar reference genomes. Unfortunately, in genome skimming approaches, much of the nuclear genome (as much as 99% of the sequence data) is discarded, which is not only wasteful, but can also limit the power of discrimination at, or below, the species level. Here, we advocate that the full shotgun sequence data can be used to assign an identity (that we term for convenience its “DNA‐mark”) for both voucher and query samples, without requiring any computationally intensive pretreatment (e.g. assembly) of reads. We argue that if reference databases are populated with such “DNA‐marks,” it will enable future DNA‐based taxonomic identification to complement, or even replace PCR of barcodes with genome skimming, and we discuss how such methodology ultimately could enable identification to population, or even individual, level. John Wiley and Sons Inc. 2020-06-29 2020-07 /pmc/articles/PMC7496323/ /pubmed/32542933 http://dx.doi.org/10.1111/mec.15507 Text en © 2020 The Authors. Molecular Ecology published by John Wiley & Sons Ltd This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | News and Views Bohmann, Kristine Mirarab, Siavash Bafna, Vineet Gilbert, M. Thomas P. Beyond DNA barcoding: The unrealized potential of genome skim data in sample identification |
title | Beyond DNA barcoding: The unrealized potential of genome skim data in sample identification |
title_full | Beyond DNA barcoding: The unrealized potential of genome skim data in sample identification |
title_fullStr | Beyond DNA barcoding: The unrealized potential of genome skim data in sample identification |
title_full_unstemmed | Beyond DNA barcoding: The unrealized potential of genome skim data in sample identification |
title_short | Beyond DNA barcoding: The unrealized potential of genome skim data in sample identification |
title_sort | beyond dna barcoding: the unrealized potential of genome skim data in sample identification |
topic | News and Views |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7496323/ https://www.ncbi.nlm.nih.gov/pubmed/32542933 http://dx.doi.org/10.1111/mec.15507 |
work_keys_str_mv | AT bohmannkristine beyonddnabarcodingtheunrealizedpotentialofgenomeskimdatainsampleidentification AT mirarabsiavash beyonddnabarcodingtheunrealizedpotentialofgenomeskimdatainsampleidentification AT bafnavineet beyonddnabarcodingtheunrealizedpotentialofgenomeskimdatainsampleidentification AT gilbertmthomasp beyonddnabarcodingtheunrealizedpotentialofgenomeskimdatainsampleidentification |