Cargando…

admixr—R package for reproducible analyses using ADMIXTOOLS

SUMMARY: We present a new R package admixr, which provides a convenient interface for performing reproducible population genetic analyses (f(3), D, f(4), f(4)-ratio, qpWave and qpAdm), as implemented by command-line programs in the ADMIXTOOLS software suite. In a traditional ADMIXTOOLS workflow, the...

Descripción completa

Detalles Bibliográficos
Autores principales: Petr, Martin, Vernot, Benjamin, Kelso, Janet
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6736366/
https://www.ncbi.nlm.nih.gov/pubmed/30668635
http://dx.doi.org/10.1093/bioinformatics/btz030
Descripción
Sumario:SUMMARY: We present a new R package admixr, which provides a convenient interface for performing reproducible population genetic analyses (f(3), D, f(4), f(4)-ratio, qpWave and qpAdm), as implemented by command-line programs in the ADMIXTOOLS software suite. In a traditional ADMIXTOOLS workflow, the user must first generate a set of text configuration files tailored to each individual analysis, often using a combination of shell scripting and manual text editing. The non-tabular output files then need to be parsed to extract values of interest prior to further analyses. Our package simplifies this process by automating all low-level configuration and parsing steps, making analyses as simple as running a single R command. Furthermore, we provide a set of R functions for processing, filtering and manipulating datasets in the EIGENSTRAT format. By unifying all steps of the workflow under a single R framework, this package enables the automation of analytic pipelines, significantly improving the reproducibility of population genetic studies. AVAILABILITY AND IMPLEMENTATION: The source code of the R package is available under the MIT license. Installation instructions, reference manual and a tutorial can be found on the package website at https://bioinf.eva.mpg.de/admixr. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.