Cargando…

Willows: a memory efficient tree and forest construction package

BACKGROUND: Existing tree and forest methods are powerful bioinformatics tools to explore high dimensional data including high throughput genomic data. However, they cannot deal with the data generated by recent genotyping platforms for single nucleotide polymorphisms due to the massive size of the...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Heping, Wang, Minghui, Chen, Xiang
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2683818/
https://www.ncbi.nlm.nih.gov/pubmed/19416535
http://dx.doi.org/10.1186/1471-2105-10-130
_version_ 1782167139437772800
author Zhang, Heping
Wang, Minghui
Chen, Xiang
author_facet Zhang, Heping
Wang, Minghui
Chen, Xiang
author_sort Zhang, Heping
collection PubMed
description BACKGROUND: Existing tree and forest methods are powerful bioinformatics tools to explore high dimensional data including high throughput genomic data. However, they cannot deal with the data generated by recent genotyping platforms for single nucleotide polymorphisms due to the massive size of the data and its excessive memory demand. RESULTS: Using the recursive partitioning technique, we developed a new software package, Willows, to maximize the utility of the computer memory and make it feasible to analyze massive genotype data. This package includes three tree-based methods – classification tree, random forest, and deterministic forest, and can efficiently handle the massive amount of SNP data. In addition, this package can easily set different options (e.g., algorithms and specifications) and predict the class of test samples. CONCLUSION: We developed Willows in a user friendly interface with the goal of maximizing the use of memory, which is critical for analysis of genomic data. The Willows package is well documented and publicly available at .
format Text
id pubmed-2683818
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26838182009-05-19 Willows: a memory efficient tree and forest construction package Zhang, Heping Wang, Minghui Chen, Xiang BMC Bioinformatics Software BACKGROUND: Existing tree and forest methods are powerful bioinformatics tools to explore high dimensional data including high throughput genomic data. However, they cannot deal with the data generated by recent genotyping platforms for single nucleotide polymorphisms due to the massive size of the data and its excessive memory demand. RESULTS: Using the recursive partitioning technique, we developed a new software package, Willows, to maximize the utility of the computer memory and make it feasible to analyze massive genotype data. This package includes three tree-based methods – classification tree, random forest, and deterministic forest, and can efficiently handle the massive amount of SNP data. In addition, this package can easily set different options (e.g., algorithms and specifications) and predict the class of test samples. CONCLUSION: We developed Willows in a user friendly interface with the goal of maximizing the use of memory, which is critical for analysis of genomic data. The Willows package is well documented and publicly available at . BioMed Central 2009-05-05 /pmc/articles/PMC2683818/ /pubmed/19416535 http://dx.doi.org/10.1186/1471-2105-10-130 Text en Copyright © 2009 Zhang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Zhang, Heping
Wang, Minghui
Chen, Xiang
Willows: a memory efficient tree and forest construction package
title Willows: a memory efficient tree and forest construction package
title_full Willows: a memory efficient tree and forest construction package
title_fullStr Willows: a memory efficient tree and forest construction package
title_full_unstemmed Willows: a memory efficient tree and forest construction package
title_short Willows: a memory efficient tree and forest construction package
title_sort willows: a memory efficient tree and forest construction package
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2683818/
https://www.ncbi.nlm.nih.gov/pubmed/19416535
http://dx.doi.org/10.1186/1471-2105-10-130
work_keys_str_mv AT zhangheping willowsamemoryefficienttreeandforestconstructionpackage
AT wangminghui willowsamemoryefficienttreeandforestconstructionpackage
AT chenxiang willowsamemoryefficienttreeandforestconstructionpackage