Cargando…

Multi-omic modelling of inflammatory bowel disease with regularized canonical correlation analysis

BACKGROUND: Personalized medicine requires finding relationships between variables that influence a patient’s phenotype and predicting an outcome. Sparse generalized canonical correlation analysis identifies relationships between different groups of variables. This method requires establishing a mod...

Descripción completa

Detalles Bibliográficos
Autores principales: Revilla, Lluís, Mayorgas, Aida, Corraliza, Ana M., Masamunt, Maria C., Metwaly, Amira, Haller, Dirk, Tristán, Eva, Carrasco, Anna, Esteve, Maria, Panés, Julian, Ricart, Elena, Lozano, Juan J., Salas, Azucena
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7870068/
https://www.ncbi.nlm.nih.gov/pubmed/33556098
http://dx.doi.org/10.1371/journal.pone.0246367
_version_ 1783648738092253184
author Revilla, Lluís
Mayorgas, Aida
Corraliza, Ana M.
Masamunt, Maria C.
Metwaly, Amira
Haller, Dirk
Tristán, Eva
Carrasco, Anna
Esteve, Maria
Panés, Julian
Ricart, Elena
Lozano, Juan J.
Salas, Azucena
author_facet Revilla, Lluís
Mayorgas, Aida
Corraliza, Ana M.
Masamunt, Maria C.
Metwaly, Amira
Haller, Dirk
Tristán, Eva
Carrasco, Anna
Esteve, Maria
Panés, Julian
Ricart, Elena
Lozano, Juan J.
Salas, Azucena
author_sort Revilla, Lluís
collection PubMed
description BACKGROUND: Personalized medicine requires finding relationships between variables that influence a patient’s phenotype and predicting an outcome. Sparse generalized canonical correlation analysis identifies relationships between different groups of variables. This method requires establishing a model of the expected interaction between those variables. Describing these interactions is challenging when the relationship is unknown or when there is no pre-established hypothesis. Thus, our aim was to develop a method to find the relationships between microbiome and host transcriptome data and the relevant clinical variables in a complex disease, such as Crohn’s disease. RESULTS: We present here a method to identify interactions based on canonical correlation analysis. We show that the model is the most important factor to identify relationships between blocks using a dataset of Crohn’s disease patients with longitudinal sampling. First the analysis was tested in two previously published datasets: a glioma and a Crohn’s disease and ulcerative colitis dataset where we describe how to select the optimum parameters. Using such parameters, we analyzed our Crohn’s disease data set. We selected the model with the highest inner average variance explained to identify relationships between transcriptome, gut microbiome and clinically relevant variables. Adding the clinically relevant variables improved the average variance explained by the model compared to multiple co-inertia analysis. CONCLUSIONS: The methodology described herein provides a general framework for identifying interactions between sets of omic data and clinically relevant variables. Following this method, we found genes and microorganisms that were related to each other independently of the model, while others were specific to the model used. Thus, model selection proved crucial to finding the existing relationships in multi-omics datasets.
format Online
Article
Text
id pubmed-7870068
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-78700682021-02-11 Multi-omic modelling of inflammatory bowel disease with regularized canonical correlation analysis Revilla, Lluís Mayorgas, Aida Corraliza, Ana M. Masamunt, Maria C. Metwaly, Amira Haller, Dirk Tristán, Eva Carrasco, Anna Esteve, Maria Panés, Julian Ricart, Elena Lozano, Juan J. Salas, Azucena PLoS One Research Article BACKGROUND: Personalized medicine requires finding relationships between variables that influence a patient’s phenotype and predicting an outcome. Sparse generalized canonical correlation analysis identifies relationships between different groups of variables. This method requires establishing a model of the expected interaction between those variables. Describing these interactions is challenging when the relationship is unknown or when there is no pre-established hypothesis. Thus, our aim was to develop a method to find the relationships between microbiome and host transcriptome data and the relevant clinical variables in a complex disease, such as Crohn’s disease. RESULTS: We present here a method to identify interactions based on canonical correlation analysis. We show that the model is the most important factor to identify relationships between blocks using a dataset of Crohn’s disease patients with longitudinal sampling. First the analysis was tested in two previously published datasets: a glioma and a Crohn’s disease and ulcerative colitis dataset where we describe how to select the optimum parameters. Using such parameters, we analyzed our Crohn’s disease data set. We selected the model with the highest inner average variance explained to identify relationships between transcriptome, gut microbiome and clinically relevant variables. Adding the clinically relevant variables improved the average variance explained by the model compared to multiple co-inertia analysis. CONCLUSIONS: The methodology described herein provides a general framework for identifying interactions between sets of omic data and clinically relevant variables. Following this method, we found genes and microorganisms that were related to each other independently of the model, while others were specific to the model used. Thus, model selection proved crucial to finding the existing relationships in multi-omics datasets. Public Library of Science 2021-02-08 /pmc/articles/PMC7870068/ /pubmed/33556098 http://dx.doi.org/10.1371/journal.pone.0246367 Text en © 2021 Revilla et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Revilla, Lluís
Mayorgas, Aida
Corraliza, Ana M.
Masamunt, Maria C.
Metwaly, Amira
Haller, Dirk
Tristán, Eva
Carrasco, Anna
Esteve, Maria
Panés, Julian
Ricart, Elena
Lozano, Juan J.
Salas, Azucena
Multi-omic modelling of inflammatory bowel disease with regularized canonical correlation analysis
title Multi-omic modelling of inflammatory bowel disease with regularized canonical correlation analysis
title_full Multi-omic modelling of inflammatory bowel disease with regularized canonical correlation analysis
title_fullStr Multi-omic modelling of inflammatory bowel disease with regularized canonical correlation analysis
title_full_unstemmed Multi-omic modelling of inflammatory bowel disease with regularized canonical correlation analysis
title_short Multi-omic modelling of inflammatory bowel disease with regularized canonical correlation analysis
title_sort multi-omic modelling of inflammatory bowel disease with regularized canonical correlation analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7870068/
https://www.ncbi.nlm.nih.gov/pubmed/33556098
http://dx.doi.org/10.1371/journal.pone.0246367
work_keys_str_mv AT revillalluis multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis
AT mayorgasaida multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis
AT corralizaanam multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis
AT masamuntmariac multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis
AT metwalyamira multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis
AT hallerdirk multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis
AT tristaneva multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis
AT carrascoanna multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis
AT estevemaria multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis
AT panesjulian multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis
AT ricartelena multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis
AT lozanojuanj multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis
AT salasazucena multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis