Cargando…

svtools: population-scale analysis of structural variation

SUMMARY: Large-scale human genetics studies are now employing whole genome sequencing with the goal of conducting comprehensive trait mapping analyses of all forms of genome variation. However, methods for structural variation (SV) analysis have lagged far behind those for smaller scale variants, an...

Descripción completa

Detalles Bibliográficos
Autores principales: Larson, David E, Abel, Haley J, Chiang, Colby, Badve, Abhijit, Das, Indraniel, Eldred, James M, Layer, Ryan M, Hall, Ira M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6853660/
https://www.ncbi.nlm.nih.gov/pubmed/31218349
http://dx.doi.org/10.1093/bioinformatics/btz492
_version_ 1783470076603662336
author Larson, David E
Abel, Haley J
Chiang, Colby
Badve, Abhijit
Das, Indraniel
Eldred, James M
Layer, Ryan M
Hall, Ira M
author_facet Larson, David E
Abel, Haley J
Chiang, Colby
Badve, Abhijit
Das, Indraniel
Eldred, James M
Layer, Ryan M
Hall, Ira M
author_sort Larson, David E
collection PubMed
description SUMMARY: Large-scale human genetics studies are now employing whole genome sequencing with the goal of conducting comprehensive trait mapping analyses of all forms of genome variation. However, methods for structural variation (SV) analysis have lagged far behind those for smaller scale variants, and there is an urgent need to develop more efficient tools that scale to the size of human populations. Here, we present a fast and highly scalable software toolkit (svtools) and cloud-based pipeline for assembling high quality SV maps—including deletions, duplications, mobile element insertions, inversions and other rearrangements—in many thousands of human genomes. We show that this pipeline achieves similar variant detection performance to established per-sample methods (e.g. LUMPY), while providing fast and affordable joint analysis at the scale of ≥100 000 genomes. These tools will help enable the next generation of human genetics studies. AVAILABILITY AND IMPLEMENTATION: svtools is implemented in Python and freely available (MIT) from https://github.com/hall-lab/svtools. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6853660
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-68536602019-11-19 svtools: population-scale analysis of structural variation Larson, David E Abel, Haley J Chiang, Colby Badve, Abhijit Das, Indraniel Eldred, James M Layer, Ryan M Hall, Ira M Bioinformatics Applications Notes SUMMARY: Large-scale human genetics studies are now employing whole genome sequencing with the goal of conducting comprehensive trait mapping analyses of all forms of genome variation. However, methods for structural variation (SV) analysis have lagged far behind those for smaller scale variants, and there is an urgent need to develop more efficient tools that scale to the size of human populations. Here, we present a fast and highly scalable software toolkit (svtools) and cloud-based pipeline for assembling high quality SV maps—including deletions, duplications, mobile element insertions, inversions and other rearrangements—in many thousands of human genomes. We show that this pipeline achieves similar variant detection performance to established per-sample methods (e.g. LUMPY), while providing fast and affordable joint analysis at the scale of ≥100 000 genomes. These tools will help enable the next generation of human genetics studies. AVAILABILITY AND IMPLEMENTATION: svtools is implemented in Python and freely available (MIT) from https://github.com/hall-lab/svtools. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-11-15 2019-06-20 /pmc/articles/PMC6853660/ /pubmed/31218349 http://dx.doi.org/10.1093/bioinformatics/btz492 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Notes
Larson, David E
Abel, Haley J
Chiang, Colby
Badve, Abhijit
Das, Indraniel
Eldred, James M
Layer, Ryan M
Hall, Ira M
svtools: population-scale analysis of structural variation
title svtools: population-scale analysis of structural variation
title_full svtools: population-scale analysis of structural variation
title_fullStr svtools: population-scale analysis of structural variation
title_full_unstemmed svtools: population-scale analysis of structural variation
title_short svtools: population-scale analysis of structural variation
title_sort svtools: population-scale analysis of structural variation
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6853660/
https://www.ncbi.nlm.nih.gov/pubmed/31218349
http://dx.doi.org/10.1093/bioinformatics/btz492
work_keys_str_mv AT larsondavide svtoolspopulationscaleanalysisofstructuralvariation
AT abelhaleyj svtoolspopulationscaleanalysisofstructuralvariation
AT chiangcolby svtoolspopulationscaleanalysisofstructuralvariation
AT badveabhijit svtoolspopulationscaleanalysisofstructuralvariation
AT dasindraniel svtoolspopulationscaleanalysisofstructuralvariation
AT eldredjamesm svtoolspopulationscaleanalysisofstructuralvariation
AT layerryanm svtoolspopulationscaleanalysisofstructuralvariation
AT halliram svtoolspopulationscaleanalysisofstructuralvariation