Cargando…

Skmer: assembly-free and alignment-free sample identification using genome skims

The ability to inexpensively describe taxonomic diversity is critical in this era of rapid climate and biodiversity changes. The recent genome-skimming approach extends current barcoding practices beyond short markers by applying low-pass sequencing and recovering whole organelle genomes computation...

Descripción completa

Detalles Bibliográficos
Autores principales: Sarmashghi, Shahab, Bohmann, Kristine, P. Gilbert, M. Thomas, Bafna, Vineet, Mirarab, Siavash
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6374904/
https://www.ncbi.nlm.nih.gov/pubmed/30760303
http://dx.doi.org/10.1186/s13059-019-1632-4
_version_ 1783395260248883200
author Sarmashghi, Shahab
Bohmann, Kristine
P. Gilbert, M. Thomas
Bafna, Vineet
Mirarab, Siavash
author_facet Sarmashghi, Shahab
Bohmann, Kristine
P. Gilbert, M. Thomas
Bafna, Vineet
Mirarab, Siavash
author_sort Sarmashghi, Shahab
collection PubMed
description The ability to inexpensively describe taxonomic diversity is critical in this era of rapid climate and biodiversity changes. The recent genome-skimming approach extends current barcoding practices beyond short markers by applying low-pass sequencing and recovering whole organelle genomes computationally. This approach discards the nuclear DNA, which constitutes the vast majority of the data. In contrast, we suggest using all unassembled reads. We introduce an assembly-free and alignment-free tool, Skmer, to compute genomic distances between the query and reference genome skims. Skmer shows excellent accuracy in estimating distances and identifying the closest match in reference datasets. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13059-019-1632-4) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6374904
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63749042019-02-26 Skmer: assembly-free and alignment-free sample identification using genome skims Sarmashghi, Shahab Bohmann, Kristine P. Gilbert, M. Thomas Bafna, Vineet Mirarab, Siavash Genome Biol Method The ability to inexpensively describe taxonomic diversity is critical in this era of rapid climate and biodiversity changes. The recent genome-skimming approach extends current barcoding practices beyond short markers by applying low-pass sequencing and recovering whole organelle genomes computationally. This approach discards the nuclear DNA, which constitutes the vast majority of the data. In contrast, we suggest using all unassembled reads. We introduce an assembly-free and alignment-free tool, Skmer, to compute genomic distances between the query and reference genome skims. Skmer shows excellent accuracy in estimating distances and identifying the closest match in reference datasets. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13059-019-1632-4) contains supplementary material, which is available to authorized users. BioMed Central 2019-02-13 /pmc/articles/PMC6374904/ /pubmed/30760303 http://dx.doi.org/10.1186/s13059-019-1632-4 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Method
Sarmashghi, Shahab
Bohmann, Kristine
P. Gilbert, M. Thomas
Bafna, Vineet
Mirarab, Siavash
Skmer: assembly-free and alignment-free sample identification using genome skims
title Skmer: assembly-free and alignment-free sample identification using genome skims
title_full Skmer: assembly-free and alignment-free sample identification using genome skims
title_fullStr Skmer: assembly-free and alignment-free sample identification using genome skims
title_full_unstemmed Skmer: assembly-free and alignment-free sample identification using genome skims
title_short Skmer: assembly-free and alignment-free sample identification using genome skims
title_sort skmer: assembly-free and alignment-free sample identification using genome skims
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6374904/
https://www.ncbi.nlm.nih.gov/pubmed/30760303
http://dx.doi.org/10.1186/s13059-019-1632-4
work_keys_str_mv AT sarmashghishahab skmerassemblyfreeandalignmentfreesampleidentificationusinggenomeskims
AT bohmannkristine skmerassemblyfreeandalignmentfreesampleidentificationusinggenomeskims
AT pgilbertmthomas skmerassemblyfreeandalignmentfreesampleidentificationusinggenomeskims
AT bafnavineet skmerassemblyfreeandalignmentfreesampleidentificationusinggenomeskims
AT mirarabsiavash skmerassemblyfreeandalignmentfreesampleidentificationusinggenomeskims