Cargando…

Kernel approaches for differential expression analysis of mass spectrometry-based metabolomics data

BACKGROUND: Data generated from metabolomics experiments are different from other types of “-omics” data. For example, a common phenomenon in mass spectrometry (MS)-based metabolomics data is that the data matrix frequently contains missing values, which complicates some quantitative analyses. One w...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhan, Xiang, Patterson, Andrew D, Ghosh, Debashis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4359587/
https://www.ncbi.nlm.nih.gov/pubmed/25887233
http://dx.doi.org/10.1186/s12859-015-0506-3
_version_ 1782361438253219840
author Zhan, Xiang
Patterson, Andrew D
Ghosh, Debashis
author_facet Zhan, Xiang
Patterson, Andrew D
Ghosh, Debashis
author_sort Zhan, Xiang
collection PubMed
description BACKGROUND: Data generated from metabolomics experiments are different from other types of “-omics” data. For example, a common phenomenon in mass spectrometry (MS)-based metabolomics data is that the data matrix frequently contains missing values, which complicates some quantitative analyses. One way to tackle this problem is to treat them as absent. Hence there are two types of information that are available in metabolomics data: presence/absence of a metabolite and a quantitative value of the abundance level of a metabolite if it is present. Combining these two layers of information poses challenges to the application of traditional statistical approaches in differential expression analysis. RESULTS: In this article, we propose a novel kernel-based score test for the metabolomics differential expression analysis. In order to simultaneously capture both the continuous pattern and discrete pattern in metabolomics data, two new kinds of kernels are designed. One is the distance-based kernel and the other is the stratified kernel. While we initially describe the procedures in the case of single-metabolite analysis, we extend the methods to handle metabolite sets as well. CONCLUSIONS: Evaluation based on both simulated data and real data from a liver cancer metabolomics study indicates that our kernel method has a better performance than some existing alternatives. An implementation of the proposed kernel method in the R statistical computing environment is available at http://works.bepress.com/debashis_ghosh/60/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0506-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4359587
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43595872015-03-15 Kernel approaches for differential expression analysis of mass spectrometry-based metabolomics data Zhan, Xiang Patterson, Andrew D Ghosh, Debashis BMC Bioinformatics Research Article BACKGROUND: Data generated from metabolomics experiments are different from other types of “-omics” data. For example, a common phenomenon in mass spectrometry (MS)-based metabolomics data is that the data matrix frequently contains missing values, which complicates some quantitative analyses. One way to tackle this problem is to treat them as absent. Hence there are two types of information that are available in metabolomics data: presence/absence of a metabolite and a quantitative value of the abundance level of a metabolite if it is present. Combining these two layers of information poses challenges to the application of traditional statistical approaches in differential expression analysis. RESULTS: In this article, we propose a novel kernel-based score test for the metabolomics differential expression analysis. In order to simultaneously capture both the continuous pattern and discrete pattern in metabolomics data, two new kinds of kernels are designed. One is the distance-based kernel and the other is the stratified kernel. While we initially describe the procedures in the case of single-metabolite analysis, we extend the methods to handle metabolite sets as well. CONCLUSIONS: Evaluation based on both simulated data and real data from a liver cancer metabolomics study indicates that our kernel method has a better performance than some existing alternatives. An implementation of the proposed kernel method in the R statistical computing environment is available at http://works.bepress.com/debashis_ghosh/60/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0506-3) contains supplementary material, which is available to authorized users. BioMed Central 2015-03-11 /pmc/articles/PMC4359587/ /pubmed/25887233 http://dx.doi.org/10.1186/s12859-015-0506-3 Text en © Zhan et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Zhan, Xiang
Patterson, Andrew D
Ghosh, Debashis
Kernel approaches for differential expression analysis of mass spectrometry-based metabolomics data
title Kernel approaches for differential expression analysis of mass spectrometry-based metabolomics data
title_full Kernel approaches for differential expression analysis of mass spectrometry-based metabolomics data
title_fullStr Kernel approaches for differential expression analysis of mass spectrometry-based metabolomics data
title_full_unstemmed Kernel approaches for differential expression analysis of mass spectrometry-based metabolomics data
title_short Kernel approaches for differential expression analysis of mass spectrometry-based metabolomics data
title_sort kernel approaches for differential expression analysis of mass spectrometry-based metabolomics data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4359587/
https://www.ncbi.nlm.nih.gov/pubmed/25887233
http://dx.doi.org/10.1186/s12859-015-0506-3
work_keys_str_mv AT zhanxiang kernelapproachesfordifferentialexpressionanalysisofmassspectrometrybasedmetabolomicsdata
AT pattersonandrewd kernelapproachesfordifferentialexpressionanalysisofmassspectrometrybasedmetabolomicsdata
AT ghoshdebashis kernelapproachesfordifferentialexpressionanalysisofmassspectrometrybasedmetabolomicsdata