Cargando…
ANGSD: Analysis of Next Generation Sequencing Data
BACKGROUND: High-throughput DNA sequencing technologies are generating vast amounts of data. Fast, flexible and memory efficient implementations are needed in order to facilitate analyses of thousands of samples simultaneously. RESULTS: We present a multithreaded program suite called ANGSD. This pro...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4248462/ https://www.ncbi.nlm.nih.gov/pubmed/25420514 http://dx.doi.org/10.1186/s12859-014-0356-4 |
_version_ | 1782346805682372608 |
---|---|
author | Korneliussen, Thorfinn Sand Albrechtsen, Anders Nielsen, Rasmus |
author_facet | Korneliussen, Thorfinn Sand Albrechtsen, Anders Nielsen, Rasmus |
author_sort | Korneliussen, Thorfinn Sand |
collection | PubMed |
description | BACKGROUND: High-throughput DNA sequencing technologies are generating vast amounts of data. Fast, flexible and memory efficient implementations are needed in order to facilitate analyses of thousands of samples simultaneously. RESULTS: We present a multithreaded program suite called ANGSD. This program can calculate various summary statistics, and perform association mapping and population genetic analyses utilizing the full information in next generation sequencing data by working directly on the raw sequencing data or by using genotype likelihoods. CONCLUSIONS: The open source c/c++ program ANGSD is available at http://www.popgen.dk/angsd. The program is tested and validated on GNU/Linux systems. The program facilitates multiple input formats including BAM and imputed beagle genotype probability files. The program allow the user to choose between combinations of existing methods and can perform analysis that is not implemented elsewhere. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-014-0356-4) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4248462 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-42484622014-12-02 ANGSD: Analysis of Next Generation Sequencing Data Korneliussen, Thorfinn Sand Albrechtsen, Anders Nielsen, Rasmus BMC Bioinformatics Software BACKGROUND: High-throughput DNA sequencing technologies are generating vast amounts of data. Fast, flexible and memory efficient implementations are needed in order to facilitate analyses of thousands of samples simultaneously. RESULTS: We present a multithreaded program suite called ANGSD. This program can calculate various summary statistics, and perform association mapping and population genetic analyses utilizing the full information in next generation sequencing data by working directly on the raw sequencing data or by using genotype likelihoods. CONCLUSIONS: The open source c/c++ program ANGSD is available at http://www.popgen.dk/angsd. The program is tested and validated on GNU/Linux systems. The program facilitates multiple input formats including BAM and imputed beagle genotype probability files. The program allow the user to choose between combinations of existing methods and can perform analysis that is not implemented elsewhere. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-014-0356-4) contains supplementary material, which is available to authorized users. BioMed Central 2014-11-25 /pmc/articles/PMC4248462/ /pubmed/25420514 http://dx.doi.org/10.1186/s12859-014-0356-4 Text en © Korneliussen et al.; licensee BioMed Central Ltd. 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Korneliussen, Thorfinn Sand Albrechtsen, Anders Nielsen, Rasmus ANGSD: Analysis of Next Generation Sequencing Data |
title | ANGSD: Analysis of Next Generation Sequencing Data |
title_full | ANGSD: Analysis of Next Generation Sequencing Data |
title_fullStr | ANGSD: Analysis of Next Generation Sequencing Data |
title_full_unstemmed | ANGSD: Analysis of Next Generation Sequencing Data |
title_short | ANGSD: Analysis of Next Generation Sequencing Data |
title_sort | angsd: analysis of next generation sequencing data |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4248462/ https://www.ncbi.nlm.nih.gov/pubmed/25420514 http://dx.doi.org/10.1186/s12859-014-0356-4 |
work_keys_str_mv | AT korneliussenthorfinnsand angsdanalysisofnextgenerationsequencingdata AT albrechtsenanders angsdanalysisofnextgenerationsequencingdata AT nielsenrasmus angsdanalysisofnextgenerationsequencingdata |