Cargando…

VCF2PopTree: a client-side software to construct population phylogeny from genome-wide SNPs

In the past decades a number of software programs have been developed to infer phylogenetic relationships between populations. However, most of these programs typically use alignments of sequences from genes to build phylogeny. Recently, many standalone or web applications have been developed to han...

Descripción completa

Detalles Bibliográficos
Autores principales: Subramanian, Sankar, Ramasamy, Umayal, Chen, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6901002/
https://www.ncbi.nlm.nih.gov/pubmed/31824783
http://dx.doi.org/10.7717/peerj.8213
_version_ 1783477435497447424
author Subramanian, Sankar
Ramasamy, Umayal
Chen, David
author_facet Subramanian, Sankar
Ramasamy, Umayal
Chen, David
author_sort Subramanian, Sankar
collection PubMed
description In the past decades a number of software programs have been developed to infer phylogenetic relationships between populations. However, most of these programs typically use alignments of sequences from genes to build phylogeny. Recently, many standalone or web applications have been developed to handle large-scale whole genome data, but they are either computationally intensive, dependent on third party software or required significant time and resource of a web server. In the post-genomic era, researchers are able to obtain bioinformatically processed high-quality publication-ready whole genome data for many individuals in a population from next generation sequencing companies due to the reduction in the cost of sequencing and analysis. Such genotype data is typically presented in the Variant Call Format (VCF) and there is no simple software available that directly uses this data format to construct the phylogeny of populations in a short time. To address this limitation, we have developed a user-friendly software, VCF2PopTree that uses genome-wide SNPs to construct and display phylogenetic trees in seconds to minutes. For example, it reads a VCF file containing 4 million SNPs and draws a tree in less than 30 seconds. VCF2PopTree accepts genotype data from a local machine, constructs a tree using UPGMA and Neighbour-Joining algorithms and displays it on a web-browser. It also produces pairwise-diversity matrix in MEGA and PHYLIP file formats as well as trees in the Newick format which could be directly used by other popular phylogenetic software programs. The software including the source code, a test VCF file and a documentation are available at: https://github.com/sansubs/vcf2pop.
format Online
Article
Text
id pubmed-6901002
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-69010022019-12-10 VCF2PopTree: a client-side software to construct population phylogeny from genome-wide SNPs Subramanian, Sankar Ramasamy, Umayal Chen, David PeerJ Bioinformatics In the past decades a number of software programs have been developed to infer phylogenetic relationships between populations. However, most of these programs typically use alignments of sequences from genes to build phylogeny. Recently, many standalone or web applications have been developed to handle large-scale whole genome data, but they are either computationally intensive, dependent on third party software or required significant time and resource of a web server. In the post-genomic era, researchers are able to obtain bioinformatically processed high-quality publication-ready whole genome data for many individuals in a population from next generation sequencing companies due to the reduction in the cost of sequencing and analysis. Such genotype data is typically presented in the Variant Call Format (VCF) and there is no simple software available that directly uses this data format to construct the phylogeny of populations in a short time. To address this limitation, we have developed a user-friendly software, VCF2PopTree that uses genome-wide SNPs to construct and display phylogenetic trees in seconds to minutes. For example, it reads a VCF file containing 4 million SNPs and draws a tree in less than 30 seconds. VCF2PopTree accepts genotype data from a local machine, constructs a tree using UPGMA and Neighbour-Joining algorithms and displays it on a web-browser. It also produces pairwise-diversity matrix in MEGA and PHYLIP file formats as well as trees in the Newick format which could be directly used by other popular phylogenetic software programs. The software including the source code, a test VCF file and a documentation are available at: https://github.com/sansubs/vcf2pop. PeerJ Inc. 2019-12-06 /pmc/articles/PMC6901002/ /pubmed/31824783 http://dx.doi.org/10.7717/peerj.8213 Text en ©2019 Subramanian et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Subramanian, Sankar
Ramasamy, Umayal
Chen, David
VCF2PopTree: a client-side software to construct population phylogeny from genome-wide SNPs
title VCF2PopTree: a client-side software to construct population phylogeny from genome-wide SNPs
title_full VCF2PopTree: a client-side software to construct population phylogeny from genome-wide SNPs
title_fullStr VCF2PopTree: a client-side software to construct population phylogeny from genome-wide SNPs
title_full_unstemmed VCF2PopTree: a client-side software to construct population phylogeny from genome-wide SNPs
title_short VCF2PopTree: a client-side software to construct population phylogeny from genome-wide SNPs
title_sort vcf2poptree: a client-side software to construct population phylogeny from genome-wide snps
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6901002/
https://www.ncbi.nlm.nih.gov/pubmed/31824783
http://dx.doi.org/10.7717/peerj.8213
work_keys_str_mv AT subramaniansankar vcf2poptreeaclientsidesoftwaretoconstructpopulationphylogenyfromgenomewidesnps
AT ramasamyumayal vcf2poptreeaclientsidesoftwaretoconstructpopulationphylogenyfromgenomewidesnps
AT chendavid vcf2poptreeaclientsidesoftwaretoconstructpopulationphylogenyfromgenomewidesnps