Cargando…

The non-negative matrix factorization toolbox for biological data mining

BACKGROUND: Non-negative matrix factorization (NMF) has been introduced as an important method for mining biological data. Though there currently exists packages implemented in R and other programming languages, they either provide only a few optimization algorithms or focus on a specific applicatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Yifeng, Ngom, Alioune
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3736608/
https://www.ncbi.nlm.nih.gov/pubmed/23591137
http://dx.doi.org/10.1186/1751-0473-8-10
_version_ 1782279777628979200
author Li, Yifeng
Ngom, Alioune
author_facet Li, Yifeng
Ngom, Alioune
author_sort Li, Yifeng
collection PubMed
description BACKGROUND: Non-negative matrix factorization (NMF) has been introduced as an important method for mining biological data. Though there currently exists packages implemented in R and other programming languages, they either provide only a few optimization algorithms or focus on a specific application field. There does not exist a complete NMF package for the bioinformatics community, and in order to perform various data mining tasks on biological data. RESULTS: We provide a convenient MATLAB toolbox containing both the implementations of various NMF techniques and a variety of NMF-based data mining approaches for analyzing biological data. Data mining approaches implemented within the toolbox include data clustering and bi-clustering, feature extraction and selection, sample classification, missing values imputation, data visualization, and statistical comparison. CONCLUSIONS: A series of analysis such as molecular pattern discovery, biological process identification, dimension reduction, disease prediction, visualization, and statistical comparison can be performed using this toolbox.
format Online
Article
Text
id pubmed-3736608
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-37366082013-08-07 The non-negative matrix factorization toolbox for biological data mining Li, Yifeng Ngom, Alioune Source Code Biol Med Methodology BACKGROUND: Non-negative matrix factorization (NMF) has been introduced as an important method for mining biological data. Though there currently exists packages implemented in R and other programming languages, they either provide only a few optimization algorithms or focus on a specific application field. There does not exist a complete NMF package for the bioinformatics community, and in order to perform various data mining tasks on biological data. RESULTS: We provide a convenient MATLAB toolbox containing both the implementations of various NMF techniques and a variety of NMF-based data mining approaches for analyzing biological data. Data mining approaches implemented within the toolbox include data clustering and bi-clustering, feature extraction and selection, sample classification, missing values imputation, data visualization, and statistical comparison. CONCLUSIONS: A series of analysis such as molecular pattern discovery, biological process identification, dimension reduction, disease prediction, visualization, and statistical comparison can be performed using this toolbox. BioMed Central 2013-04-16 /pmc/articles/PMC3736608/ /pubmed/23591137 http://dx.doi.org/10.1186/1751-0473-8-10 Text en Copyright © 2013 Li and Ngom; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology
Li, Yifeng
Ngom, Alioune
The non-negative matrix factorization toolbox for biological data mining
title The non-negative matrix factorization toolbox for biological data mining
title_full The non-negative matrix factorization toolbox for biological data mining
title_fullStr The non-negative matrix factorization toolbox for biological data mining
title_full_unstemmed The non-negative matrix factorization toolbox for biological data mining
title_short The non-negative matrix factorization toolbox for biological data mining
title_sort non-negative matrix factorization toolbox for biological data mining
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3736608/
https://www.ncbi.nlm.nih.gov/pubmed/23591137
http://dx.doi.org/10.1186/1751-0473-8-10
work_keys_str_mv AT liyifeng thenonnegativematrixfactorizationtoolboxforbiologicaldatamining
AT ngomalioune thenonnegativematrixfactorizationtoolboxforbiologicaldatamining
AT liyifeng nonnegativematrixfactorizationtoolboxforbiologicaldatamining
AT ngomalioune nonnegativematrixfactorizationtoolboxforbiologicaldatamining