Cargando…
VCFtoTree: a user-friendly tool to construct locus-specific alignments and phylogenies from thousands of anthropologically relevant genome sequences
BACKGROUND: Constructing alignments and phylogenies for a given locus from large genome sequencing studies with relevant outgroups allow novel evolutionary and anthropological insights. However, no user-friendly tool has been developed to integrate thousands of recently available and anthropological...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5615795/ https://www.ncbi.nlm.nih.gov/pubmed/28950836 http://dx.doi.org/10.1186/s12859-017-1844-0 |
_version_ | 1783266668810600448 |
---|---|
author | Xu, Duo Jaber, Yousef Pavlidis, Pavlos Gokcumen, Omer |
author_facet | Xu, Duo Jaber, Yousef Pavlidis, Pavlos Gokcumen, Omer |
author_sort | Xu, Duo |
collection | PubMed |
description | BACKGROUND: Constructing alignments and phylogenies for a given locus from large genome sequencing studies with relevant outgroups allow novel evolutionary and anthropological insights. However, no user-friendly tool has been developed to integrate thousands of recently available and anthropologically relevant genome sequences to construct complete sequence alignments and phylogenies. RESULTS: Here, we provide VCFtoTree, a user friendly tool with a graphical user interface that directly accesses online databases to download, parse and analyze genome variation data for regions of interest. Our pipeline combines popular sequence datasets and tree building algorithms with custom data parsing to generate accurate alignments and phylogenies using all the individuals from the 1000 Genomes Project, Neanderthal and Denisovan genomes, as well as reference genomes of Chimpanzee and Rhesus Macaque. It can also be applied to other phased human genomes, as well as genomes from other species. The output of our pipeline includes an alignment in FASTA format and a tree file in newick format. CONCLUSION: VCFtoTree fulfills the increasing demand for constructing alignments and phylogenies for a given loci from thousands of available genomes. Our software provides a user friendly interface for a wider audience without prerequisite knowledge in programming. VCFtoTree can be accessed from https://github.com/duoduoo/VCFtoTree_3.0.0. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-017-1844-0) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5615795 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-56157952017-09-28 VCFtoTree: a user-friendly tool to construct locus-specific alignments and phylogenies from thousands of anthropologically relevant genome sequences Xu, Duo Jaber, Yousef Pavlidis, Pavlos Gokcumen, Omer BMC Bioinformatics Software BACKGROUND: Constructing alignments and phylogenies for a given locus from large genome sequencing studies with relevant outgroups allow novel evolutionary and anthropological insights. However, no user-friendly tool has been developed to integrate thousands of recently available and anthropologically relevant genome sequences to construct complete sequence alignments and phylogenies. RESULTS: Here, we provide VCFtoTree, a user friendly tool with a graphical user interface that directly accesses online databases to download, parse and analyze genome variation data for regions of interest. Our pipeline combines popular sequence datasets and tree building algorithms with custom data parsing to generate accurate alignments and phylogenies using all the individuals from the 1000 Genomes Project, Neanderthal and Denisovan genomes, as well as reference genomes of Chimpanzee and Rhesus Macaque. It can also be applied to other phased human genomes, as well as genomes from other species. The output of our pipeline includes an alignment in FASTA format and a tree file in newick format. CONCLUSION: VCFtoTree fulfills the increasing demand for constructing alignments and phylogenies for a given loci from thousands of available genomes. Our software provides a user friendly interface for a wider audience without prerequisite knowledge in programming. VCFtoTree can be accessed from https://github.com/duoduoo/VCFtoTree_3.0.0. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-017-1844-0) contains supplementary material, which is available to authorized users. BioMed Central 2017-09-26 /pmc/articles/PMC5615795/ /pubmed/28950836 http://dx.doi.org/10.1186/s12859-017-1844-0 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Xu, Duo Jaber, Yousef Pavlidis, Pavlos Gokcumen, Omer VCFtoTree: a user-friendly tool to construct locus-specific alignments and phylogenies from thousands of anthropologically relevant genome sequences |
title | VCFtoTree: a user-friendly tool to construct locus-specific alignments and phylogenies from thousands of anthropologically relevant genome sequences |
title_full | VCFtoTree: a user-friendly tool to construct locus-specific alignments and phylogenies from thousands of anthropologically relevant genome sequences |
title_fullStr | VCFtoTree: a user-friendly tool to construct locus-specific alignments and phylogenies from thousands of anthropologically relevant genome sequences |
title_full_unstemmed | VCFtoTree: a user-friendly tool to construct locus-specific alignments and phylogenies from thousands of anthropologically relevant genome sequences |
title_short | VCFtoTree: a user-friendly tool to construct locus-specific alignments and phylogenies from thousands of anthropologically relevant genome sequences |
title_sort | vcftotree: a user-friendly tool to construct locus-specific alignments and phylogenies from thousands of anthropologically relevant genome sequences |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5615795/ https://www.ncbi.nlm.nih.gov/pubmed/28950836 http://dx.doi.org/10.1186/s12859-017-1844-0 |
work_keys_str_mv | AT xuduo vcftotreeauserfriendlytooltoconstructlocusspecificalignmentsandphylogeniesfromthousandsofanthropologicallyrelevantgenomesequences AT jaberyousef vcftotreeauserfriendlytooltoconstructlocusspecificalignmentsandphylogeniesfromthousandsofanthropologicallyrelevantgenomesequences AT pavlidispavlos vcftotreeauserfriendlytooltoconstructlocusspecificalignmentsandphylogeniesfromthousandsofanthropologicallyrelevantgenomesequences AT gokcumenomer vcftotreeauserfriendlytooltoconstructlocusspecificalignmentsandphylogeniesfromthousandsofanthropologicallyrelevantgenomesequences |