Cargando…

Bi-clustering of metabolic data using matrix factorization tools

Metabolic phenotyping technologies based on Nuclear Magnetic Spectroscopy (NMR) and Mass Spectrometry (MS) generate vast amounts of unrefined data from biological samples. Clustering strategies are frequently employed to provide insight into patterns of relationships between samples and metabolites....

Descripción completa

Detalles Bibliográficos
Autores principales: Gu, Quan, Veselkov, Kirill
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Academic Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6297113/
https://www.ncbi.nlm.nih.gov/pubmed/29438828
http://dx.doi.org/10.1016/j.ymeth.2018.02.004
_version_ 1783381128744271872
author Gu, Quan
Veselkov, Kirill
author_facet Gu, Quan
Veselkov, Kirill
author_sort Gu, Quan
collection PubMed
description Metabolic phenotyping technologies based on Nuclear Magnetic Spectroscopy (NMR) and Mass Spectrometry (MS) generate vast amounts of unrefined data from biological samples. Clustering strategies are frequently employed to provide insight into patterns of relationships between samples and metabolites. Here, we propose the use of a non-negative matrix factorization driven bi-clustering strategy for metabolic phenotyping data in order to discover subsets of interrelated metabolites that exhibit similar behaviour across subsets of samples. The proposed strategy incorporates bi-cross validation and statistical segmentation techniques to automatically determine the number and structure of bi-clusters. This alternative approach is in contrast to the widely used conventional clustering approaches that incorporate all molecular peaks for clustering in metabolic studies and require a priori specification of the number of clusters. We perform the comparative analysis of the proposed strategy with other bi-clustering approaches, which were developed in the context of genomics and transcriptomics research. We demonstrate the superior performance of the proposed bi-clustering strategy on both simulated (NMR) and real (MS) bacterial metabolic data.
format Online
Article
Text
id pubmed-6297113
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Academic Press
record_format MEDLINE/PubMed
spelling pubmed-62971132018-12-21 Bi-clustering of metabolic data using matrix factorization tools Gu, Quan Veselkov, Kirill Methods Article Metabolic phenotyping technologies based on Nuclear Magnetic Spectroscopy (NMR) and Mass Spectrometry (MS) generate vast amounts of unrefined data from biological samples. Clustering strategies are frequently employed to provide insight into patterns of relationships between samples and metabolites. Here, we propose the use of a non-negative matrix factorization driven bi-clustering strategy for metabolic phenotyping data in order to discover subsets of interrelated metabolites that exhibit similar behaviour across subsets of samples. The proposed strategy incorporates bi-cross validation and statistical segmentation techniques to automatically determine the number and structure of bi-clusters. This alternative approach is in contrast to the widely used conventional clustering approaches that incorporate all molecular peaks for clustering in metabolic studies and require a priori specification of the number of clusters. We perform the comparative analysis of the proposed strategy with other bi-clustering approaches, which were developed in the context of genomics and transcriptomics research. We demonstrate the superior performance of the proposed bi-clustering strategy on both simulated (NMR) and real (MS) bacterial metabolic data. Academic Press 2018-12-01 /pmc/articles/PMC6297113/ /pubmed/29438828 http://dx.doi.org/10.1016/j.ymeth.2018.02.004 Text en © 2018 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Gu, Quan
Veselkov, Kirill
Bi-clustering of metabolic data using matrix factorization tools
title Bi-clustering of metabolic data using matrix factorization tools
title_full Bi-clustering of metabolic data using matrix factorization tools
title_fullStr Bi-clustering of metabolic data using matrix factorization tools
title_full_unstemmed Bi-clustering of metabolic data using matrix factorization tools
title_short Bi-clustering of metabolic data using matrix factorization tools
title_sort bi-clustering of metabolic data using matrix factorization tools
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6297113/
https://www.ncbi.nlm.nih.gov/pubmed/29438828
http://dx.doi.org/10.1016/j.ymeth.2018.02.004
work_keys_str_mv AT guquan biclusteringofmetabolicdatausingmatrixfactorizationtools
AT veselkovkirill biclusteringofmetabolicdatausingmatrixfactorizationtools