Cargando…
CloudNMF: A MapReduce Implementation of Nonnegative Matrix Factorization for Large-scale Biological Datasets
In the past decades, advances in high-throughput technologies have led to the generation of huge amounts of biological data that require analysis and interpretation. Recently, nonnegative matrix factorization (NMF) has been introduced as an efficient way to reduce the complexity of data as well as t...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4411332/ https://www.ncbi.nlm.nih.gov/pubmed/23933456 http://dx.doi.org/10.1016/j.gpb.2013.06.001 |
_version_ | 1782368455237828608 |
---|---|
author | Liao, Ruiqi Zhang, Yifan Guan, Jihong Zhou, Shuigeng |
author_facet | Liao, Ruiqi Zhang, Yifan Guan, Jihong Zhou, Shuigeng |
author_sort | Liao, Ruiqi |
collection | PubMed |
description | In the past decades, advances in high-throughput technologies have led to the generation of huge amounts of biological data that require analysis and interpretation. Recently, nonnegative matrix factorization (NMF) has been introduced as an efficient way to reduce the complexity of data as well as to interpret them, and has been applied to various fields of biological research. In this paper, we present CloudNMF, a distributed open-source implementation of NMF on a MapReduce framework. Experimental evaluation demonstrated that CloudNMF is scalable and can be used to deal with huge amounts of data, which may enable various kinds of a high-throughput biological data analysis in the cloud. CloudNMF is freely accessible at http://admis.fudan.edu.cn/projects/CloudNMF.html. |
format | Online Article Text |
id | pubmed-4411332 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-44113322015-05-06 CloudNMF: A MapReduce Implementation of Nonnegative Matrix Factorization for Large-scale Biological Datasets Liao, Ruiqi Zhang, Yifan Guan, Jihong Zhou, Shuigeng Genomics Proteomics Bioinformatics Application Note In the past decades, advances in high-throughput technologies have led to the generation of huge amounts of biological data that require analysis and interpretation. Recently, nonnegative matrix factorization (NMF) has been introduced as an efficient way to reduce the complexity of data as well as to interpret them, and has been applied to various fields of biological research. In this paper, we present CloudNMF, a distributed open-source implementation of NMF on a MapReduce framework. Experimental evaluation demonstrated that CloudNMF is scalable and can be used to deal with huge amounts of data, which may enable various kinds of a high-throughput biological data analysis in the cloud. CloudNMF is freely accessible at http://admis.fudan.edu.cn/projects/CloudNMF.html. Elsevier 2014-02 2013-08-08 /pmc/articles/PMC4411332/ /pubmed/23933456 http://dx.doi.org/10.1016/j.gpb.2013.06.001 Text en © 2013 Beijing Institute of Genomics, Chinese Academy of Sciences and Genetics Society of China. Production and hosting by Elsevier B.V. All rights reserved. http://creativecommons.org/licenses/by-nc-sa/3.0/ This is an open access article under the CC BY-NC-SA license (http://creativecommons.org/licenses/by-nc-sa/3.0/). |
spellingShingle | Application Note Liao, Ruiqi Zhang, Yifan Guan, Jihong Zhou, Shuigeng CloudNMF: A MapReduce Implementation of Nonnegative Matrix Factorization for Large-scale Biological Datasets |
title | CloudNMF: A MapReduce Implementation of Nonnegative Matrix Factorization for Large-scale Biological Datasets |
title_full | CloudNMF: A MapReduce Implementation of Nonnegative Matrix Factorization for Large-scale Biological Datasets |
title_fullStr | CloudNMF: A MapReduce Implementation of Nonnegative Matrix Factorization for Large-scale Biological Datasets |
title_full_unstemmed | CloudNMF: A MapReduce Implementation of Nonnegative Matrix Factorization for Large-scale Biological Datasets |
title_short | CloudNMF: A MapReduce Implementation of Nonnegative Matrix Factorization for Large-scale Biological Datasets |
title_sort | cloudnmf: a mapreduce implementation of nonnegative matrix factorization for large-scale biological datasets |
topic | Application Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4411332/ https://www.ncbi.nlm.nih.gov/pubmed/23933456 http://dx.doi.org/10.1016/j.gpb.2013.06.001 |
work_keys_str_mv | AT liaoruiqi cloudnmfamapreduceimplementationofnonnegativematrixfactorizationforlargescalebiologicaldatasets AT zhangyifan cloudnmfamapreduceimplementationofnonnegativematrixfactorizationforlargescalebiologicaldatasets AT guanjihong cloudnmfamapreduceimplementationofnonnegativematrixfactorizationforlargescalebiologicaldatasets AT zhoushuigeng cloudnmfamapreduceimplementationofnonnegativematrixfactorizationforlargescalebiologicaldatasets |