Cargando…

POPBAM: Tools for Evolutionary Analysis of Short Read Sequence Alignments

BACKGROUND: While many bioinformatics tools currently exist for assembling and discovering variants from next-generation sequence data, there are very few tools available for performing evolutionary analyses from these data. Evolutionary and population genomics studies hold great promise for providi...

Descripción completa

Detalles Bibliográficos
Autor principal: Garrigan, Daniel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Libertas Academica 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3767577/
https://www.ncbi.nlm.nih.gov/pubmed/24027417
http://dx.doi.org/10.4137/EBO.S12751
_version_ 1782283664571236352
author Garrigan, Daniel
author_facet Garrigan, Daniel
author_sort Garrigan, Daniel
collection PubMed
description BACKGROUND: While many bioinformatics tools currently exist for assembling and discovering variants from next-generation sequence data, there are very few tools available for performing evolutionary analyses from these data. Evolutionary and population genomics studies hold great promise for providing valuable insights into natural selection, the effect of mutations on phenotypes, and the origin of species. Thus, there is a need for an extensible and flexible computational tool that can function into a growing number of evolutionary bioinformatics pipelines. RESULTS: This paper describes the POPBAM software, which is a comprehensive set of computational tools for evolutionary analysis of whole-genome alignments consisting of multiple individuals, from multiple populations or species. POPBAM works directly from BAM-formatted assembly files, calls variant sites, and calculates a variety of commonly used evolutionary sequence statistics. POPBAM is designed primarily to perform analyses in sliding windows across chromosomes or scaffolds. POPBAM accurately measures nucleotide diversity, population divergence, linkage disequilibrium, and the frequency spectrum of mutations from two or more populations. POPBAM can also produce phylogenetic trees of all samples in a BAM file. Finally, I demonstrate that the implementation of POPBAM is both fast and memory-efficient, and also can feasibly scale to the analysis of large BAM files with many individuals and populations. Software: The POPBAM program is written in C/C++ and is available from http://dgarriga.github.io/POPBAM. The program has few dependencies and can be built on a variety of Linux platforms. The program is open-source and users are encouraged to participate in the development of this resource.
format Online
Article
Text
id pubmed-3767577
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-37675772013-09-11 POPBAM: Tools for Evolutionary Analysis of Short Read Sequence Alignments Garrigan, Daniel Evol Bioinform Online Original Research BACKGROUND: While many bioinformatics tools currently exist for assembling and discovering variants from next-generation sequence data, there are very few tools available for performing evolutionary analyses from these data. Evolutionary and population genomics studies hold great promise for providing valuable insights into natural selection, the effect of mutations on phenotypes, and the origin of species. Thus, there is a need for an extensible and flexible computational tool that can function into a growing number of evolutionary bioinformatics pipelines. RESULTS: This paper describes the POPBAM software, which is a comprehensive set of computational tools for evolutionary analysis of whole-genome alignments consisting of multiple individuals, from multiple populations or species. POPBAM works directly from BAM-formatted assembly files, calls variant sites, and calculates a variety of commonly used evolutionary sequence statistics. POPBAM is designed primarily to perform analyses in sliding windows across chromosomes or scaffolds. POPBAM accurately measures nucleotide diversity, population divergence, linkage disequilibrium, and the frequency spectrum of mutations from two or more populations. POPBAM can also produce phylogenetic trees of all samples in a BAM file. Finally, I demonstrate that the implementation of POPBAM is both fast and memory-efficient, and also can feasibly scale to the analysis of large BAM files with many individuals and populations. Software: The POPBAM program is written in C/C++ and is available from http://dgarriga.github.io/POPBAM. The program has few dependencies and can be built on a variety of Linux platforms. The program is open-source and users are encouraged to participate in the development of this resource. Libertas Academica 2013-09-01 /pmc/articles/PMC3767577/ /pubmed/24027417 http://dx.doi.org/10.4137/EBO.S12751 Text en © 2013 the author(s), publisher and licensee Libertas Academica Ltd. This is an open access article published under the Creative Commons CC-BY-NC 3.0 license.
spellingShingle Original Research
Garrigan, Daniel
POPBAM: Tools for Evolutionary Analysis of Short Read Sequence Alignments
title POPBAM: Tools for Evolutionary Analysis of Short Read Sequence Alignments
title_full POPBAM: Tools for Evolutionary Analysis of Short Read Sequence Alignments
title_fullStr POPBAM: Tools for Evolutionary Analysis of Short Read Sequence Alignments
title_full_unstemmed POPBAM: Tools for Evolutionary Analysis of Short Read Sequence Alignments
title_short POPBAM: Tools for Evolutionary Analysis of Short Read Sequence Alignments
title_sort popbam: tools for evolutionary analysis of short read sequence alignments
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3767577/
https://www.ncbi.nlm.nih.gov/pubmed/24027417
http://dx.doi.org/10.4137/EBO.S12751
work_keys_str_mv AT garrigandaniel popbamtoolsforevolutionaryanalysisofshortreadsequencealignments