Cargando…

Lightning-fast genome variant detection with GROM

Current human whole genome sequencing projects produce massive amounts of data, often creating significant computational challenges. Different approaches have been developed for each type of genome variant and method of its detection, necessitating users to run multiple algorithms to find variants....

Descripción completa

Detalles Bibliográficos
Autores principales: Smith, Sean D, Kawash, Joseph K, Grigoriev, Andrey
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5737730/
https://www.ncbi.nlm.nih.gov/pubmed/29048532
http://dx.doi.org/10.1093/gigascience/gix091
_version_ 1783287576528945152
author Smith, Sean D
Kawash, Joseph K
Grigoriev, Andrey
author_facet Smith, Sean D
Kawash, Joseph K
Grigoriev, Andrey
author_sort Smith, Sean D
collection PubMed
description Current human whole genome sequencing projects produce massive amounts of data, often creating significant computational challenges. Different approaches have been developed for each type of genome variant and method of its detection, necessitating users to run multiple algorithms to find variants. We present Genome Rearrangement OmniMapper (GROM), a novel comprehensive variant detection algorithm accepting aligned read files as input and finding SNVs, indels, structural variants (SVs), and copy number variants (CNVs). We show that GROM outperforms state-of-the-art methods on 7 validated benchmarks using 2 whole genome sequencing (WGS) data sets. Additionally, GROM boasts lightning-fast run times, analyzing a 50× WGS human data set (NA12878) on commonly available computer hardware in 11 minutes, more than an order of magnitude (up to 72 times) faster than tools detecting a similar range of variants. Addressing the needs of big data analysis, GROM combines in 1 algorithm SNV, indel, SV, and CNV detection, providing superior speed, sensitivity, and precision. GROM is also able to detect CNVs, SNVs, and indels in non-paired-read WGS libraries, as well as SNVs and indels in whole exome or RNA sequencing data sets.
format Online
Article
Text
id pubmed-5737730
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-57377302018-01-04 Lightning-fast genome variant detection with GROM Smith, Sean D Kawash, Joseph K Grigoriev, Andrey Gigascience Technical Note Current human whole genome sequencing projects produce massive amounts of data, often creating significant computational challenges. Different approaches have been developed for each type of genome variant and method of its detection, necessitating users to run multiple algorithms to find variants. We present Genome Rearrangement OmniMapper (GROM), a novel comprehensive variant detection algorithm accepting aligned read files as input and finding SNVs, indels, structural variants (SVs), and copy number variants (CNVs). We show that GROM outperforms state-of-the-art methods on 7 validated benchmarks using 2 whole genome sequencing (WGS) data sets. Additionally, GROM boasts lightning-fast run times, analyzing a 50× WGS human data set (NA12878) on commonly available computer hardware in 11 minutes, more than an order of magnitude (up to 72 times) faster than tools detecting a similar range of variants. Addressing the needs of big data analysis, GROM combines in 1 algorithm SNV, indel, SV, and CNV detection, providing superior speed, sensitivity, and precision. GROM is also able to detect CNVs, SNVs, and indels in non-paired-read WGS libraries, as well as SNVs and indels in whole exome or RNA sequencing data sets. Oxford University Press 2017-09-18 /pmc/articles/PMC5737730/ /pubmed/29048532 http://dx.doi.org/10.1093/gigascience/gix091 Text en © The Authors 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Smith, Sean D
Kawash, Joseph K
Grigoriev, Andrey
Lightning-fast genome variant detection with GROM
title Lightning-fast genome variant detection with GROM
title_full Lightning-fast genome variant detection with GROM
title_fullStr Lightning-fast genome variant detection with GROM
title_full_unstemmed Lightning-fast genome variant detection with GROM
title_short Lightning-fast genome variant detection with GROM
title_sort lightning-fast genome variant detection with grom
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5737730/
https://www.ncbi.nlm.nih.gov/pubmed/29048532
http://dx.doi.org/10.1093/gigascience/gix091
work_keys_str_mv AT smithseand lightningfastgenomevariantdetectionwithgrom
AT kawashjosephk lightningfastgenomevariantdetectionwithgrom
AT grigorievandrey lightningfastgenomevariantdetectionwithgrom