Cargando…

IntLIM: integration using linear models of metabolomics and gene expression data

BACKGROUND: Integration of transcriptomic and metabolomic data improves functional interpretation of disease-related metabolomic phenotypes, and facilitates discovery of putative metabolite biomarkers and gene targets. For this reason, these data are increasingly collected in large (> 100 partici...

Descripción completa

Detalles Bibliográficos
Autores principales:	Siddiqui, Jalal K., Baskin, Elizabeth, Liu, Mingrui, Cantemir-Stone, Carmen Z., Zhang, Bofei, Bonneville, Russell, McElroy, Joseph P., Coombes, Kevin R., Mathé, Ewy A.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2018
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5838881/ https://www.ncbi.nlm.nih.gov/pubmed/29506475 http://dx.doi.org/10.1186/s12859-018-2085-6

_version_	1783304322089484288
author	Siddiqui, Jalal K. Baskin, Elizabeth Liu, Mingrui Cantemir-Stone, Carmen Z. Zhang, Bofei Bonneville, Russell McElroy, Joseph P. Coombes, Kevin R. Mathé, Ewy A.
author_facet	Siddiqui, Jalal K. Baskin, Elizabeth Liu, Mingrui Cantemir-Stone, Carmen Z. Zhang, Bofei Bonneville, Russell McElroy, Joseph P. Coombes, Kevin R. Mathé, Ewy A.
author_sort	Siddiqui, Jalal K.
collection	PubMed
description	BACKGROUND: Integration of transcriptomic and metabolomic data improves functional interpretation of disease-related metabolomic phenotypes, and facilitates discovery of putative metabolite biomarkers and gene targets. For this reason, these data are increasingly collected in large (> 100 participants) cohorts, thereby driving a need for the development of user-friendly and open-source methods/tools for their integration. Of note, clinical/translational studies typically provide snapshot (e.g. one time point) gene and metabolite profiles and, oftentimes, most metabolites measured are not identified. Thus, in these types of studies, pathway/network approaches that take into account the complexity of transcript-metabolite relationships may neither be applicable nor readily uncover novel relationships. With this in mind, we propose a simple linear modeling approach to capture disease-(or other phenotype) specific gene-metabolite associations, with the assumption that co-regulation patterns reflect functionally related genes and metabolites. RESULTS: The proposed linear model, metabolite ~ gene + phenotype + gene:phenotype, specifically evaluates whether gene-metabolite relationships differ by phenotype, by testing whether the relationship in one phenotype is significantly different from the relationship in another phenotype (via a statistical interaction gene:phenotype p-value). Statistical interaction p-values for all possible gene-metabolite pairs are computed and significant pairs are then clustered by the directionality of associations (e.g. strong positive association in one phenotype, strong negative association in another phenotype). We implemented our approach as an R package, IntLIM, which includes a user-friendly R Shiny web interface, thereby making the integrative analyses accessible to non-computational experts. We applied IntLIM to two previously published datasets, collected in the NCI-60 cancer cell lines and in human breast tumor and non-tumor tissue, for which transcriptomic and metabolomic data are available. We demonstrate that IntLIM captures relevant tumor-specific gene-metabolite associations involved in known cancer-related pathways, including glutamine metabolism. Using IntLIM, we also uncover biologically relevant novel relationships that could be further tested experimentally. CONCLUSIONS: IntLIM provides a user-friendly, reproducible framework to integrate transcriptomic and metabolomic data and help interpret metabolomic data and uncover novel gene-metabolite relationships. The IntLIM R package is publicly available in GitHub (https://github.com/mathelab/IntLIM) and includes a user-friendly web application, vignettes, sample data and data/code to reproduce results. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2085-6) contains supplementary material, which is available to authorized users.
format	Online Article Text
id	pubmed-5838881
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-58388812018-03-09 IntLIM: integration using linear models of metabolomics and gene expression data Siddiqui, Jalal K. Baskin, Elizabeth Liu, Mingrui Cantemir-Stone, Carmen Z. Zhang, Bofei Bonneville, Russell McElroy, Joseph P. Coombes, Kevin R. Mathé, Ewy A. BMC Bioinformatics Research Article BACKGROUND: Integration of transcriptomic and metabolomic data improves functional interpretation of disease-related metabolomic phenotypes, and facilitates discovery of putative metabolite biomarkers and gene targets. For this reason, these data are increasingly collected in large (> 100 participants) cohorts, thereby driving a need for the development of user-friendly and open-source methods/tools for their integration. Of note, clinical/translational studies typically provide snapshot (e.g. one time point) gene and metabolite profiles and, oftentimes, most metabolites measured are not identified. Thus, in these types of studies, pathway/network approaches that take into account the complexity of transcript-metabolite relationships may neither be applicable nor readily uncover novel relationships. With this in mind, we propose a simple linear modeling approach to capture disease-(or other phenotype) specific gene-metabolite associations, with the assumption that co-regulation patterns reflect functionally related genes and metabolites. RESULTS: The proposed linear model, metabolite ~ gene + phenotype + gene:phenotype, specifically evaluates whether gene-metabolite relationships differ by phenotype, by testing whether the relationship in one phenotype is significantly different from the relationship in another phenotype (via a statistical interaction gene:phenotype p-value). Statistical interaction p-values for all possible gene-metabolite pairs are computed and significant pairs are then clustered by the directionality of associations (e.g. strong positive association in one phenotype, strong negative association in another phenotype). We implemented our approach as an R package, IntLIM, which includes a user-friendly R Shiny web interface, thereby making the integrative analyses accessible to non-computational experts. We applied IntLIM to two previously published datasets, collected in the NCI-60 cancer cell lines and in human breast tumor and non-tumor tissue, for which transcriptomic and metabolomic data are available. We demonstrate that IntLIM captures relevant tumor-specific gene-metabolite associations involved in known cancer-related pathways, including glutamine metabolism. Using IntLIM, we also uncover biologically relevant novel relationships that could be further tested experimentally. CONCLUSIONS: IntLIM provides a user-friendly, reproducible framework to integrate transcriptomic and metabolomic data and help interpret metabolomic data and uncover novel gene-metabolite relationships. The IntLIM R package is publicly available in GitHub (https://github.com/mathelab/IntLIM) and includes a user-friendly web application, vignettes, sample data and data/code to reproduce results. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2085-6) contains supplementary material, which is available to authorized users. BioMed Central 2018-03-05 /pmc/articles/PMC5838881/ /pubmed/29506475 http://dx.doi.org/10.1186/s12859-018-2085-6 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Article Siddiqui, Jalal K. Baskin, Elizabeth Liu, Mingrui Cantemir-Stone, Carmen Z. Zhang, Bofei Bonneville, Russell McElroy, Joseph P. Coombes, Kevin R. Mathé, Ewy A. IntLIM: integration using linear models of metabolomics and gene expression data
title	IntLIM: integration using linear models of metabolomics and gene expression data
title_full	IntLIM: integration using linear models of metabolomics and gene expression data
title_fullStr	IntLIM: integration using linear models of metabolomics and gene expression data
title_full_unstemmed	IntLIM: integration using linear models of metabolomics and gene expression data
title_short	IntLIM: integration using linear models of metabolomics and gene expression data
title_sort	intlim: integration using linear models of metabolomics and gene expression data
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5838881/ https://www.ncbi.nlm.nih.gov/pubmed/29506475 http://dx.doi.org/10.1186/s12859-018-2085-6
work_keys_str_mv	AT siddiquijalalk intlimintegrationusinglinearmodelsofmetabolomicsandgeneexpressiondata AT baskinelizabeth intlimintegrationusinglinearmodelsofmetabolomicsandgeneexpressiondata AT liumingrui intlimintegrationusinglinearmodelsofmetabolomicsandgeneexpressiondata AT cantemirstonecarmenz intlimintegrationusinglinearmodelsofmetabolomicsandgeneexpressiondata AT zhangbofei intlimintegrationusinglinearmodelsofmetabolomicsandgeneexpressiondata AT bonnevillerussell intlimintegrationusinglinearmodelsofmetabolomicsandgeneexpressiondata AT mcelroyjosephp intlimintegrationusinglinearmodelsofmetabolomicsandgeneexpressiondata AT coombeskevinr intlimintegrationusinglinearmodelsofmetabolomicsandgeneexpressiondata AT matheewya intlimintegrationusinglinearmodelsofmetabolomicsandgeneexpressiondata

IntLIM: integration using linear models of metabolomics and gene expression data

Ejemplares similares