Cargando…
Willows: a memory efficient tree and forest construction package
BACKGROUND: Existing tree and forest methods are powerful bioinformatics tools to explore high dimensional data including high throughput genomic data. However, they cannot deal with the data generated by recent genotyping platforms for single nucleotide polymorphisms due to the massive size of the...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2683818/ https://www.ncbi.nlm.nih.gov/pubmed/19416535 http://dx.doi.org/10.1186/1471-2105-10-130 |
_version_ | 1782167139437772800 |
---|---|
author | Zhang, Heping Wang, Minghui Chen, Xiang |
author_facet | Zhang, Heping Wang, Minghui Chen, Xiang |
author_sort | Zhang, Heping |
collection | PubMed |
description | BACKGROUND: Existing tree and forest methods are powerful bioinformatics tools to explore high dimensional data including high throughput genomic data. However, they cannot deal with the data generated by recent genotyping platforms for single nucleotide polymorphisms due to the massive size of the data and its excessive memory demand. RESULTS: Using the recursive partitioning technique, we developed a new software package, Willows, to maximize the utility of the computer memory and make it feasible to analyze massive genotype data. This package includes three tree-based methods – classification tree, random forest, and deterministic forest, and can efficiently handle the massive amount of SNP data. In addition, this package can easily set different options (e.g., algorithms and specifications) and predict the class of test samples. CONCLUSION: We developed Willows in a user friendly interface with the goal of maximizing the use of memory, which is critical for analysis of genomic data. The Willows package is well documented and publicly available at . |
format | Text |
id | pubmed-2683818 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-26838182009-05-19 Willows: a memory efficient tree and forest construction package Zhang, Heping Wang, Minghui Chen, Xiang BMC Bioinformatics Software BACKGROUND: Existing tree and forest methods are powerful bioinformatics tools to explore high dimensional data including high throughput genomic data. However, they cannot deal with the data generated by recent genotyping platforms for single nucleotide polymorphisms due to the massive size of the data and its excessive memory demand. RESULTS: Using the recursive partitioning technique, we developed a new software package, Willows, to maximize the utility of the computer memory and make it feasible to analyze massive genotype data. This package includes three tree-based methods – classification tree, random forest, and deterministic forest, and can efficiently handle the massive amount of SNP data. In addition, this package can easily set different options (e.g., algorithms and specifications) and predict the class of test samples. CONCLUSION: We developed Willows in a user friendly interface with the goal of maximizing the use of memory, which is critical for analysis of genomic data. The Willows package is well documented and publicly available at . BioMed Central 2009-05-05 /pmc/articles/PMC2683818/ /pubmed/19416535 http://dx.doi.org/10.1186/1471-2105-10-130 Text en Copyright © 2009 Zhang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Software Zhang, Heping Wang, Minghui Chen, Xiang Willows: a memory efficient tree and forest construction package |
title | Willows: a memory efficient tree and forest construction package |
title_full | Willows: a memory efficient tree and forest construction package |
title_fullStr | Willows: a memory efficient tree and forest construction package |
title_full_unstemmed | Willows: a memory efficient tree and forest construction package |
title_short | Willows: a memory efficient tree and forest construction package |
title_sort | willows: a memory efficient tree and forest construction package |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2683818/ https://www.ncbi.nlm.nih.gov/pubmed/19416535 http://dx.doi.org/10.1186/1471-2105-10-130 |
work_keys_str_mv | AT zhangheping willowsamemoryefficienttreeandforestconstructionpackage AT wangminghui willowsamemoryefficienttreeandforestconstructionpackage AT chenxiang willowsamemoryefficienttreeandforestconstructionpackage |