Cargando…
Multi-omic modelling of inflammatory bowel disease with regularized canonical correlation analysis
BACKGROUND: Personalized medicine requires finding relationships between variables that influence a patient’s phenotype and predicting an outcome. Sparse generalized canonical correlation analysis identifies relationships between different groups of variables. This method requires establishing a mod...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7870068/ https://www.ncbi.nlm.nih.gov/pubmed/33556098 http://dx.doi.org/10.1371/journal.pone.0246367 |
_version_ | 1783648738092253184 |
---|---|
author | Revilla, Lluís Mayorgas, Aida Corraliza, Ana M. Masamunt, Maria C. Metwaly, Amira Haller, Dirk Tristán, Eva Carrasco, Anna Esteve, Maria Panés, Julian Ricart, Elena Lozano, Juan J. Salas, Azucena |
author_facet | Revilla, Lluís Mayorgas, Aida Corraliza, Ana M. Masamunt, Maria C. Metwaly, Amira Haller, Dirk Tristán, Eva Carrasco, Anna Esteve, Maria Panés, Julian Ricart, Elena Lozano, Juan J. Salas, Azucena |
author_sort | Revilla, Lluís |
collection | PubMed |
description | BACKGROUND: Personalized medicine requires finding relationships between variables that influence a patient’s phenotype and predicting an outcome. Sparse generalized canonical correlation analysis identifies relationships between different groups of variables. This method requires establishing a model of the expected interaction between those variables. Describing these interactions is challenging when the relationship is unknown or when there is no pre-established hypothesis. Thus, our aim was to develop a method to find the relationships between microbiome and host transcriptome data and the relevant clinical variables in a complex disease, such as Crohn’s disease. RESULTS: We present here a method to identify interactions based on canonical correlation analysis. We show that the model is the most important factor to identify relationships between blocks using a dataset of Crohn’s disease patients with longitudinal sampling. First the analysis was tested in two previously published datasets: a glioma and a Crohn’s disease and ulcerative colitis dataset where we describe how to select the optimum parameters. Using such parameters, we analyzed our Crohn’s disease data set. We selected the model with the highest inner average variance explained to identify relationships between transcriptome, gut microbiome and clinically relevant variables. Adding the clinically relevant variables improved the average variance explained by the model compared to multiple co-inertia analysis. CONCLUSIONS: The methodology described herein provides a general framework for identifying interactions between sets of omic data and clinically relevant variables. Following this method, we found genes and microorganisms that were related to each other independently of the model, while others were specific to the model used. Thus, model selection proved crucial to finding the existing relationships in multi-omics datasets. |
format | Online Article Text |
id | pubmed-7870068 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-78700682021-02-11 Multi-omic modelling of inflammatory bowel disease with regularized canonical correlation analysis Revilla, Lluís Mayorgas, Aida Corraliza, Ana M. Masamunt, Maria C. Metwaly, Amira Haller, Dirk Tristán, Eva Carrasco, Anna Esteve, Maria Panés, Julian Ricart, Elena Lozano, Juan J. Salas, Azucena PLoS One Research Article BACKGROUND: Personalized medicine requires finding relationships between variables that influence a patient’s phenotype and predicting an outcome. Sparse generalized canonical correlation analysis identifies relationships between different groups of variables. This method requires establishing a model of the expected interaction between those variables. Describing these interactions is challenging when the relationship is unknown or when there is no pre-established hypothesis. Thus, our aim was to develop a method to find the relationships between microbiome and host transcriptome data and the relevant clinical variables in a complex disease, such as Crohn’s disease. RESULTS: We present here a method to identify interactions based on canonical correlation analysis. We show that the model is the most important factor to identify relationships between blocks using a dataset of Crohn’s disease patients with longitudinal sampling. First the analysis was tested in two previously published datasets: a glioma and a Crohn’s disease and ulcerative colitis dataset where we describe how to select the optimum parameters. Using such parameters, we analyzed our Crohn’s disease data set. We selected the model with the highest inner average variance explained to identify relationships between transcriptome, gut microbiome and clinically relevant variables. Adding the clinically relevant variables improved the average variance explained by the model compared to multiple co-inertia analysis. CONCLUSIONS: The methodology described herein provides a general framework for identifying interactions between sets of omic data and clinically relevant variables. Following this method, we found genes and microorganisms that were related to each other independently of the model, while others were specific to the model used. Thus, model selection proved crucial to finding the existing relationships in multi-omics datasets. Public Library of Science 2021-02-08 /pmc/articles/PMC7870068/ /pubmed/33556098 http://dx.doi.org/10.1371/journal.pone.0246367 Text en © 2021 Revilla et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Revilla, Lluís Mayorgas, Aida Corraliza, Ana M. Masamunt, Maria C. Metwaly, Amira Haller, Dirk Tristán, Eva Carrasco, Anna Esteve, Maria Panés, Julian Ricart, Elena Lozano, Juan J. Salas, Azucena Multi-omic modelling of inflammatory bowel disease with regularized canonical correlation analysis |
title | Multi-omic modelling of inflammatory bowel disease with regularized canonical correlation analysis |
title_full | Multi-omic modelling of inflammatory bowel disease with regularized canonical correlation analysis |
title_fullStr | Multi-omic modelling of inflammatory bowel disease with regularized canonical correlation analysis |
title_full_unstemmed | Multi-omic modelling of inflammatory bowel disease with regularized canonical correlation analysis |
title_short | Multi-omic modelling of inflammatory bowel disease with regularized canonical correlation analysis |
title_sort | multi-omic modelling of inflammatory bowel disease with regularized canonical correlation analysis |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7870068/ https://www.ncbi.nlm.nih.gov/pubmed/33556098 http://dx.doi.org/10.1371/journal.pone.0246367 |
work_keys_str_mv | AT revillalluis multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis AT mayorgasaida multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis AT corralizaanam multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis AT masamuntmariac multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis AT metwalyamira multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis AT hallerdirk multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis AT tristaneva multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis AT carrascoanna multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis AT estevemaria multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis AT panesjulian multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis AT ricartelena multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis AT lozanojuanj multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis AT salasazucena multiomicmodellingofinflammatoryboweldiseasewithregularizedcanonicalcorrelationanalysis |