Cargando…

SNPhylo: a pipeline to construct a phylogenetic tree from huge SNP data

BACKGROUND: Phylogenetic trees are widely used for genetic and evolutionary studies in various organisms. Advanced sequencing technology has dramatically enriched data available for constructing phylogenetic trees based on single nucleotide polymorphisms (SNPs). However, massive SNP data makes it di...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Tae-Ho, Guo, Hui, Wang, Xiyin, Kim, Changsoo, Paterson, Andrew H
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3945939/
https://www.ncbi.nlm.nih.gov/pubmed/24571581
http://dx.doi.org/10.1186/1471-2164-15-162
_version_ 1782306591044796416
author Lee, Tae-Ho
Guo, Hui
Wang, Xiyin
Kim, Changsoo
Paterson, Andrew H
author_facet Lee, Tae-Ho
Guo, Hui
Wang, Xiyin
Kim, Changsoo
Paterson, Andrew H
author_sort Lee, Tae-Ho
collection PubMed
description BACKGROUND: Phylogenetic trees are widely used for genetic and evolutionary studies in various organisms. Advanced sequencing technology has dramatically enriched data available for constructing phylogenetic trees based on single nucleotide polymorphisms (SNPs). However, massive SNP data makes it difficult to perform reliable analysis, and there has been no ready-to-use pipeline to generate phylogenetic trees from these data. RESULTS: We developed a new pipeline, SNPhylo, to construct phylogenetic trees based on large SNP datasets. The pipeline may enable users to construct a phylogenetic tree from three representative SNP data file formats. In addition, in order to increase reliability of a tree, the pipeline has steps such as removing low quality data and considering linkage disequilibrium. A maximum likelihood method for the inference of phylogeny is also adopted in generation of a tree in our pipeline. CONCLUSIONS: Using SNPhylo, users can easily produce a reliable phylogenetic tree from a large SNP data file. Thus, this pipeline can help a researcher focus more on interpretation of the results of analysis of voluminous data sets, rather than manipulations necessary to accomplish the analysis.
format Online
Article
Text
id pubmed-3945939
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-39459392014-03-08 SNPhylo: a pipeline to construct a phylogenetic tree from huge SNP data Lee, Tae-Ho Guo, Hui Wang, Xiyin Kim, Changsoo Paterson, Andrew H BMC Genomics Software BACKGROUND: Phylogenetic trees are widely used for genetic and evolutionary studies in various organisms. Advanced sequencing technology has dramatically enriched data available for constructing phylogenetic trees based on single nucleotide polymorphisms (SNPs). However, massive SNP data makes it difficult to perform reliable analysis, and there has been no ready-to-use pipeline to generate phylogenetic trees from these data. RESULTS: We developed a new pipeline, SNPhylo, to construct phylogenetic trees based on large SNP datasets. The pipeline may enable users to construct a phylogenetic tree from three representative SNP data file formats. In addition, in order to increase reliability of a tree, the pipeline has steps such as removing low quality data and considering linkage disequilibrium. A maximum likelihood method for the inference of phylogeny is also adopted in generation of a tree in our pipeline. CONCLUSIONS: Using SNPhylo, users can easily produce a reliable phylogenetic tree from a large SNP data file. Thus, this pipeline can help a researcher focus more on interpretation of the results of analysis of voluminous data sets, rather than manipulations necessary to accomplish the analysis. BioMed Central 2014-02-26 /pmc/articles/PMC3945939/ /pubmed/24571581 http://dx.doi.org/10.1186/1471-2164-15-162 Text en Copyright © 2014 Lee et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Lee, Tae-Ho
Guo, Hui
Wang, Xiyin
Kim, Changsoo
Paterson, Andrew H
SNPhylo: a pipeline to construct a phylogenetic tree from huge SNP data
title SNPhylo: a pipeline to construct a phylogenetic tree from huge SNP data
title_full SNPhylo: a pipeline to construct a phylogenetic tree from huge SNP data
title_fullStr SNPhylo: a pipeline to construct a phylogenetic tree from huge SNP data
title_full_unstemmed SNPhylo: a pipeline to construct a phylogenetic tree from huge SNP data
title_short SNPhylo: a pipeline to construct a phylogenetic tree from huge SNP data
title_sort snphylo: a pipeline to construct a phylogenetic tree from huge snp data
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3945939/
https://www.ncbi.nlm.nih.gov/pubmed/24571581
http://dx.doi.org/10.1186/1471-2164-15-162
work_keys_str_mv AT leetaeho snphyloapipelinetoconstructaphylogenetictreefromhugesnpdata
AT guohui snphyloapipelinetoconstructaphylogenetictreefromhugesnpdata
AT wangxiyin snphyloapipelinetoconstructaphylogenetictreefromhugesnpdata
AT kimchangsoo snphyloapipelinetoconstructaphylogenetictreefromhugesnpdata
AT patersonandrewh snphyloapipelinetoconstructaphylogenetictreefromhugesnpdata