Cargando…

The Sleipnir library for computational functional genomics

Motivation: Biological data generation has accelerated to the point where hundreds or thousands of whole-genome datasets of various types are available for many model organisms. This wealth of data can lead to valuable biological insights when analyzed in an integrated manner, but the computational...

Descripción completa

Detalles Bibliográficos
Autores principales: Huttenhower, Curtis, Schroeder, Mark, Chikina, Maria D, Troyanskaya, Olga G.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2718674/
https://www.ncbi.nlm.nih.gov/pubmed/18499696
http://dx.doi.org/10.1093/bioinformatics/btn237
_version_ 1782170013624434688
author Huttenhower, Curtis
Schroeder, Mark
Chikina, Maria D
Troyanskaya, Olga G.
author_facet Huttenhower, Curtis
Schroeder, Mark
Chikina, Maria D
Troyanskaya, Olga G.
author_sort Huttenhower, Curtis
collection PubMed
description Motivation: Biological data generation has accelerated to the point where hundreds or thousands of whole-genome datasets of various types are available for many model organisms. This wealth of data can lead to valuable biological insights when analyzed in an integrated manner, but the computational challenge of managing such large data collections is substantial. In order to mine these data efficiently, it is necessary to develop methods that use storage, memory and processing resources carefully. Results: The Sleipnir C++ library implements a variety of machine learning and data manipulation algorithms with a focus on heterogeneous data integration and efficiency for very large biological data collections. Sleipnir allows microarray processing, functional ontology mining, clustering, Bayesian learning and inference and support vector machine tasks to be performed for heterogeneous data on scales not previously practical. In addition to the library, which can easily be integrated into new computational systems, prebuilt tools are provided to perform a variety of common tasks. Many tools are multithreaded for parallelization in desktop or high-throughput computing environments, and most tasks can be performed in minutes for hundreds of datasets using a standard personal computer. Availability: Source code (C++) and documentation are available at http://function.princeton.edu/sleipnir and compiled binaries are available from the authors on request. Contact: ogt@princeton.edu
format Text
id pubmed-2718674
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-27186742009-07-31 The Sleipnir library for computational functional genomics Huttenhower, Curtis Schroeder, Mark Chikina, Maria D Troyanskaya, Olga G. Bioinformatics Applications Note Motivation: Biological data generation has accelerated to the point where hundreds or thousands of whole-genome datasets of various types are available for many model organisms. This wealth of data can lead to valuable biological insights when analyzed in an integrated manner, but the computational challenge of managing such large data collections is substantial. In order to mine these data efficiently, it is necessary to develop methods that use storage, memory and processing resources carefully. Results: The Sleipnir C++ library implements a variety of machine learning and data manipulation algorithms with a focus on heterogeneous data integration and efficiency for very large biological data collections. Sleipnir allows microarray processing, functional ontology mining, clustering, Bayesian learning and inference and support vector machine tasks to be performed for heterogeneous data on scales not previously practical. In addition to the library, which can easily be integrated into new computational systems, prebuilt tools are provided to perform a variety of common tasks. Many tools are multithreaded for parallelization in desktop or high-throughput computing environments, and most tasks can be performed in minutes for hundreds of datasets using a standard personal computer. Availability: Source code (C++) and documentation are available at http://function.princeton.edu/sleipnir and compiled binaries are available from the authors on request. Contact: ogt@princeton.edu Oxford University Press 2008-07-01 2008-05-21 /pmc/articles/PMC2718674/ /pubmed/18499696 http://dx.doi.org/10.1093/bioinformatics/btn237 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Note
Huttenhower, Curtis
Schroeder, Mark
Chikina, Maria D
Troyanskaya, Olga G.
The Sleipnir library for computational functional genomics
title The Sleipnir library for computational functional genomics
title_full The Sleipnir library for computational functional genomics
title_fullStr The Sleipnir library for computational functional genomics
title_full_unstemmed The Sleipnir library for computational functional genomics
title_short The Sleipnir library for computational functional genomics
title_sort sleipnir library for computational functional genomics
topic Applications Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2718674/
https://www.ncbi.nlm.nih.gov/pubmed/18499696
http://dx.doi.org/10.1093/bioinformatics/btn237
work_keys_str_mv AT huttenhowercurtis thesleipnirlibraryforcomputationalfunctionalgenomics
AT schroedermark thesleipnirlibraryforcomputationalfunctionalgenomics
AT chikinamariad thesleipnirlibraryforcomputationalfunctionalgenomics
AT troyanskayaolgag thesleipnirlibraryforcomputationalfunctionalgenomics
AT huttenhowercurtis sleipnirlibraryforcomputationalfunctionalgenomics
AT schroedermark sleipnirlibraryforcomputationalfunctionalgenomics
AT chikinamariad sleipnirlibraryforcomputationalfunctionalgenomics
AT troyanskayaolgag sleipnirlibraryforcomputationalfunctionalgenomics