Cargando…

VCFtoTree: a user-friendly tool to construct locus-specific alignments and phylogenies from thousands of anthropologically relevant genome sequences

BACKGROUND: Constructing alignments and phylogenies for a given locus from large genome sequencing studies with relevant outgroups allow novel evolutionary and anthropological insights. However, no user-friendly tool has been developed to integrate thousands of recently available and anthropological...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Duo, Jaber, Yousef, Pavlidis, Pavlos, Gokcumen, Omer
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5615795/
https://www.ncbi.nlm.nih.gov/pubmed/28950836
http://dx.doi.org/10.1186/s12859-017-1844-0
_version_ 1783266668810600448
author Xu, Duo
Jaber, Yousef
Pavlidis, Pavlos
Gokcumen, Omer
author_facet Xu, Duo
Jaber, Yousef
Pavlidis, Pavlos
Gokcumen, Omer
author_sort Xu, Duo
collection PubMed
description BACKGROUND: Constructing alignments and phylogenies for a given locus from large genome sequencing studies with relevant outgroups allow novel evolutionary and anthropological insights. However, no user-friendly tool has been developed to integrate thousands of recently available and anthropologically relevant genome sequences to construct complete sequence alignments and phylogenies. RESULTS: Here, we provide VCFtoTree, a user friendly tool with a graphical user interface that directly accesses online databases to download, parse and analyze genome variation data for regions of interest. Our pipeline combines popular sequence datasets and tree building algorithms with custom data parsing to generate accurate alignments and phylogenies using all the individuals from the 1000 Genomes Project, Neanderthal and Denisovan genomes, as well as reference genomes of Chimpanzee and Rhesus Macaque. It can also be applied to other phased human genomes, as well as genomes from other species. The output of our pipeline includes an alignment in FASTA format and a tree file in newick format. CONCLUSION: VCFtoTree fulfills the increasing demand for constructing alignments and phylogenies for a given loci from thousands of available genomes. Our software provides a user friendly interface for a wider audience without prerequisite knowledge in programming. VCFtoTree can be accessed from https://github.com/duoduoo/VCFtoTree_3.0.0. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-017-1844-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5615795
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-56157952017-09-28 VCFtoTree: a user-friendly tool to construct locus-specific alignments and phylogenies from thousands of anthropologically relevant genome sequences Xu, Duo Jaber, Yousef Pavlidis, Pavlos Gokcumen, Omer BMC Bioinformatics Software BACKGROUND: Constructing alignments and phylogenies for a given locus from large genome sequencing studies with relevant outgroups allow novel evolutionary and anthropological insights. However, no user-friendly tool has been developed to integrate thousands of recently available and anthropologically relevant genome sequences to construct complete sequence alignments and phylogenies. RESULTS: Here, we provide VCFtoTree, a user friendly tool with a graphical user interface that directly accesses online databases to download, parse and analyze genome variation data for regions of interest. Our pipeline combines popular sequence datasets and tree building algorithms with custom data parsing to generate accurate alignments and phylogenies using all the individuals from the 1000 Genomes Project, Neanderthal and Denisovan genomes, as well as reference genomes of Chimpanzee and Rhesus Macaque. It can also be applied to other phased human genomes, as well as genomes from other species. The output of our pipeline includes an alignment in FASTA format and a tree file in newick format. CONCLUSION: VCFtoTree fulfills the increasing demand for constructing alignments and phylogenies for a given loci from thousands of available genomes. Our software provides a user friendly interface for a wider audience without prerequisite knowledge in programming. VCFtoTree can be accessed from https://github.com/duoduoo/VCFtoTree_3.0.0. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-017-1844-0) contains supplementary material, which is available to authorized users. BioMed Central 2017-09-26 /pmc/articles/PMC5615795/ /pubmed/28950836 http://dx.doi.org/10.1186/s12859-017-1844-0 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Xu, Duo
Jaber, Yousef
Pavlidis, Pavlos
Gokcumen, Omer
VCFtoTree: a user-friendly tool to construct locus-specific alignments and phylogenies from thousands of anthropologically relevant genome sequences
title VCFtoTree: a user-friendly tool to construct locus-specific alignments and phylogenies from thousands of anthropologically relevant genome sequences
title_full VCFtoTree: a user-friendly tool to construct locus-specific alignments and phylogenies from thousands of anthropologically relevant genome sequences
title_fullStr VCFtoTree: a user-friendly tool to construct locus-specific alignments and phylogenies from thousands of anthropologically relevant genome sequences
title_full_unstemmed VCFtoTree: a user-friendly tool to construct locus-specific alignments and phylogenies from thousands of anthropologically relevant genome sequences
title_short VCFtoTree: a user-friendly tool to construct locus-specific alignments and phylogenies from thousands of anthropologically relevant genome sequences
title_sort vcftotree: a user-friendly tool to construct locus-specific alignments and phylogenies from thousands of anthropologically relevant genome sequences
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5615795/
https://www.ncbi.nlm.nih.gov/pubmed/28950836
http://dx.doi.org/10.1186/s12859-017-1844-0
work_keys_str_mv AT xuduo vcftotreeauserfriendlytooltoconstructlocusspecificalignmentsandphylogeniesfromthousandsofanthropologicallyrelevantgenomesequences
AT jaberyousef vcftotreeauserfriendlytooltoconstructlocusspecificalignmentsandphylogeniesfromthousandsofanthropologicallyrelevantgenomesequences
AT pavlidispavlos vcftotreeauserfriendlytooltoconstructlocusspecificalignmentsandphylogeniesfromthousandsofanthropologicallyrelevantgenomesequences
AT gokcumenomer vcftotreeauserfriendlytooltoconstructlocusspecificalignmentsandphylogeniesfromthousandsofanthropologicallyrelevantgenomesequences