Cargando…

A flexible R package for nonnegative matrix factorization

BACKGROUND: Nonnegative Matrix Factorization (NMF) is an unsupervised learning technique that has been applied successfully in several fields, including signal processing, face recognition and text mining. Recent applications of NMF in bioinformatics have demonstrated its ability to extract meaningf...

Descripción completa

Detalles Bibliográficos
Autores principales: Gaujoux, Renaud, Seoighe, Cathal
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2912887/
https://www.ncbi.nlm.nih.gov/pubmed/20598126
http://dx.doi.org/10.1186/1471-2105-11-367
_version_ 1782184643859054592
author Gaujoux, Renaud
Seoighe, Cathal
author_facet Gaujoux, Renaud
Seoighe, Cathal
author_sort Gaujoux, Renaud
collection PubMed
description BACKGROUND: Nonnegative Matrix Factorization (NMF) is an unsupervised learning technique that has been applied successfully in several fields, including signal processing, face recognition and text mining. Recent applications of NMF in bioinformatics have demonstrated its ability to extract meaningful information from high-dimensional data such as gene expression microarrays. Developments in NMF theory and applications have resulted in a variety of algorithms and methods. However, most NMF implementations have been on commercial platforms, while those that are freely available typically require programming skills. This limits their use by the wider research community. RESULTS: Our objective is to provide the bioinformatics community with an open-source, easy-to-use and unified interface to standard NMF algorithms, as well as with a simple framework to help implement and test new NMF methods. For that purpose, we have developed a package for the R/BioConductor platform. The package ports public code to R, and is structured to enable users to easily modify and/or add algorithms. It includes a number of published NMF algorithms and initialization methods and facilitates the combination of these to produce new NMF strategies. Commonly used benchmark data and visualization methods are provided to help in the comparison and interpretation of the results. CONCLUSIONS: The NMF package helps realize the potential of Nonnegative Matrix Factorization, especially in bioinformatics, providing easy access to methods that have already yielded new insights in many applications. Documentation, source code and sample data are available from CRAN.
format Text
id pubmed-2912887
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29128872010-07-31 A flexible R package for nonnegative matrix factorization Gaujoux, Renaud Seoighe, Cathal BMC Bioinformatics Software BACKGROUND: Nonnegative Matrix Factorization (NMF) is an unsupervised learning technique that has been applied successfully in several fields, including signal processing, face recognition and text mining. Recent applications of NMF in bioinformatics have demonstrated its ability to extract meaningful information from high-dimensional data such as gene expression microarrays. Developments in NMF theory and applications have resulted in a variety of algorithms and methods. However, most NMF implementations have been on commercial platforms, while those that are freely available typically require programming skills. This limits their use by the wider research community. RESULTS: Our objective is to provide the bioinformatics community with an open-source, easy-to-use and unified interface to standard NMF algorithms, as well as with a simple framework to help implement and test new NMF methods. For that purpose, we have developed a package for the R/BioConductor platform. The package ports public code to R, and is structured to enable users to easily modify and/or add algorithms. It includes a number of published NMF algorithms and initialization methods and facilitates the combination of these to produce new NMF strategies. Commonly used benchmark data and visualization methods are provided to help in the comparison and interpretation of the results. CONCLUSIONS: The NMF package helps realize the potential of Nonnegative Matrix Factorization, especially in bioinformatics, providing easy access to methods that have already yielded new insights in many applications. Documentation, source code and sample data are available from CRAN. BioMed Central 2010-07-02 /pmc/articles/PMC2912887/ /pubmed/20598126 http://dx.doi.org/10.1186/1471-2105-11-367 Text en Copyright ©2010 Gaujoux and Seoighe; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Gaujoux, Renaud
Seoighe, Cathal
A flexible R package for nonnegative matrix factorization
title A flexible R package for nonnegative matrix factorization
title_full A flexible R package for nonnegative matrix factorization
title_fullStr A flexible R package for nonnegative matrix factorization
title_full_unstemmed A flexible R package for nonnegative matrix factorization
title_short A flexible R package for nonnegative matrix factorization
title_sort flexible r package for nonnegative matrix factorization
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2912887/
https://www.ncbi.nlm.nih.gov/pubmed/20598126
http://dx.doi.org/10.1186/1471-2105-11-367
work_keys_str_mv AT gaujouxrenaud aflexiblerpackagefornonnegativematrixfactorization
AT seoighecathal aflexiblerpackagefornonnegativematrixfactorization
AT gaujouxrenaud flexiblerpackagefornonnegativematrixfactorization
AT seoighecathal flexiblerpackagefornonnegativematrixfactorization