Cargando…

CloudNMF: A MapReduce Implementation of Nonnegative Matrix Factorization for Large-scale Biological Datasets

In the past decades, advances in high-throughput technologies have led to the generation of huge amounts of biological data that require analysis and interpretation. Recently, nonnegative matrix factorization (NMF) has been introduced as an efficient way to reduce the complexity of data as well as t...

Descripción completa

Detalles Bibliográficos
Autores principales: Liao, Ruiqi, Zhang, Yifan, Guan, Jihong, Zhou, Shuigeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4411332/
https://www.ncbi.nlm.nih.gov/pubmed/23933456
http://dx.doi.org/10.1016/j.gpb.2013.06.001
_version_ 1782368455237828608
author Liao, Ruiqi
Zhang, Yifan
Guan, Jihong
Zhou, Shuigeng
author_facet Liao, Ruiqi
Zhang, Yifan
Guan, Jihong
Zhou, Shuigeng
author_sort Liao, Ruiqi
collection PubMed
description In the past decades, advances in high-throughput technologies have led to the generation of huge amounts of biological data that require analysis and interpretation. Recently, nonnegative matrix factorization (NMF) has been introduced as an efficient way to reduce the complexity of data as well as to interpret them, and has been applied to various fields of biological research. In this paper, we present CloudNMF, a distributed open-source implementation of NMF on a MapReduce framework. Experimental evaluation demonstrated that CloudNMF is scalable and can be used to deal with huge amounts of data, which may enable various kinds of a high-throughput biological data analysis in the cloud. CloudNMF is freely accessible at http://admis.fudan.edu.cn/projects/CloudNMF.html.
format Online
Article
Text
id pubmed-4411332
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-44113322015-05-06 CloudNMF: A MapReduce Implementation of Nonnegative Matrix Factorization for Large-scale Biological Datasets Liao, Ruiqi Zhang, Yifan Guan, Jihong Zhou, Shuigeng Genomics Proteomics Bioinformatics Application Note In the past decades, advances in high-throughput technologies have led to the generation of huge amounts of biological data that require analysis and interpretation. Recently, nonnegative matrix factorization (NMF) has been introduced as an efficient way to reduce the complexity of data as well as to interpret them, and has been applied to various fields of biological research. In this paper, we present CloudNMF, a distributed open-source implementation of NMF on a MapReduce framework. Experimental evaluation demonstrated that CloudNMF is scalable and can be used to deal with huge amounts of data, which may enable various kinds of a high-throughput biological data analysis in the cloud. CloudNMF is freely accessible at http://admis.fudan.edu.cn/projects/CloudNMF.html. Elsevier 2014-02 2013-08-08 /pmc/articles/PMC4411332/ /pubmed/23933456 http://dx.doi.org/10.1016/j.gpb.2013.06.001 Text en © 2013 Beijing Institute of Genomics, Chinese Academy of Sciences and Genetics Society of China. Production and hosting by Elsevier B.V. All rights reserved. http://creativecommons.org/licenses/by-nc-sa/3.0/ This is an open access article under the CC BY-NC-SA license (http://creativecommons.org/licenses/by-nc-sa/3.0/).
spellingShingle Application Note
Liao, Ruiqi
Zhang, Yifan
Guan, Jihong
Zhou, Shuigeng
CloudNMF: A MapReduce Implementation of Nonnegative Matrix Factorization for Large-scale Biological Datasets
title CloudNMF: A MapReduce Implementation of Nonnegative Matrix Factorization for Large-scale Biological Datasets
title_full CloudNMF: A MapReduce Implementation of Nonnegative Matrix Factorization for Large-scale Biological Datasets
title_fullStr CloudNMF: A MapReduce Implementation of Nonnegative Matrix Factorization for Large-scale Biological Datasets
title_full_unstemmed CloudNMF: A MapReduce Implementation of Nonnegative Matrix Factorization for Large-scale Biological Datasets
title_short CloudNMF: A MapReduce Implementation of Nonnegative Matrix Factorization for Large-scale Biological Datasets
title_sort cloudnmf: a mapreduce implementation of nonnegative matrix factorization for large-scale biological datasets
topic Application Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4411332/
https://www.ncbi.nlm.nih.gov/pubmed/23933456
http://dx.doi.org/10.1016/j.gpb.2013.06.001
work_keys_str_mv AT liaoruiqi cloudnmfamapreduceimplementationofnonnegativematrixfactorizationforlargescalebiologicaldatasets
AT zhangyifan cloudnmfamapreduceimplementationofnonnegativematrixfactorizationforlargescalebiologicaldatasets
AT guanjihong cloudnmfamapreduceimplementationofnonnegativematrixfactorizationforlargescalebiologicaldatasets
AT zhoushuigeng cloudnmfamapreduceimplementationofnonnegativematrixfactorizationforlargescalebiologicaldatasets