Cargando…

MONTI: A Multi-Omics Non-negative Tensor Decomposition Framework for Gene-Level Integrative Analysis

Multi-omics data is frequently measured to enrich the comprehension of biological mechanisms underlying certain phenotypes. However, due to the complex relations and high dimension of multi-omics data, it is difficult to associate omics features to certain biological traits of interest. For example,...

Descripción completa

Detalles Bibliográficos
Autores principales: Jung, Inuk, Kim, Minsu, Rhee, Sungmin, Lim, Sangsoo, Kim, Sun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8461247/
https://www.ncbi.nlm.nih.gov/pubmed/34567063
http://dx.doi.org/10.3389/fgene.2021.682841
_version_ 1784571935295799296
author Jung, Inuk
Kim, Minsu
Rhee, Sungmin
Lim, Sangsoo
Kim, Sun
author_facet Jung, Inuk
Kim, Minsu
Rhee, Sungmin
Lim, Sangsoo
Kim, Sun
author_sort Jung, Inuk
collection PubMed
description Multi-omics data is frequently measured to enrich the comprehension of biological mechanisms underlying certain phenotypes. However, due to the complex relations and high dimension of multi-omics data, it is difficult to associate omics features to certain biological traits of interest. For example, the clinically valuable breast cancer subtypes are well-defined at the molecular level, but are poorly classified using gene expression data. Here, we propose a multi-omics analysis method called MONTI (Multi-Omics Non-negative Tensor decomposition for Integrative analysis), which goal is to select multi-omics features that are able to represent trait specific characteristics. Here, we demonstrate the strength of multi-omics integrated analysis in terms of cancer subtyping. The multi-omics data are first integrated in a biologically meaningful manner to form a three dimensional tensor, which is then decomposed using a non-negative tensor decomposition method. From the result, MONTI selects highly informative subtype specific multi-omics features. MONTI was applied to three case studies of 597 breast cancer, 314 colon cancer, and 305 stomach cancer cohorts. For all the case studies, we found that the subtype classification accuracy significantly improved when utilizing all available multi-omics data. MONTI was able to detect subtype specific gene sets that showed to be strongly regulated by certain omics, from which correlation between omics types could be inferred. Furthermore, various clinical attributes of nine cancer types were analyzed using MONTI, which showed that some clinical attributes could be well explained using multi-omics data. We demonstrated that integrating multi-omics data in a gene centric manner improves detecting cancer subtype specific features and other clinical features, which may be used to further understand the molecular characteristics of interest. The software and data used in this study are available at: https://github.com/inukj/MONTI.
format Online
Article
Text
id pubmed-8461247
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-84612472021-09-25 MONTI: A Multi-Omics Non-negative Tensor Decomposition Framework for Gene-Level Integrative Analysis Jung, Inuk Kim, Minsu Rhee, Sungmin Lim, Sangsoo Kim, Sun Front Genet Genetics Multi-omics data is frequently measured to enrich the comprehension of biological mechanisms underlying certain phenotypes. However, due to the complex relations and high dimension of multi-omics data, it is difficult to associate omics features to certain biological traits of interest. For example, the clinically valuable breast cancer subtypes are well-defined at the molecular level, but are poorly classified using gene expression data. Here, we propose a multi-omics analysis method called MONTI (Multi-Omics Non-negative Tensor decomposition for Integrative analysis), which goal is to select multi-omics features that are able to represent trait specific characteristics. Here, we demonstrate the strength of multi-omics integrated analysis in terms of cancer subtyping. The multi-omics data are first integrated in a biologically meaningful manner to form a three dimensional tensor, which is then decomposed using a non-negative tensor decomposition method. From the result, MONTI selects highly informative subtype specific multi-omics features. MONTI was applied to three case studies of 597 breast cancer, 314 colon cancer, and 305 stomach cancer cohorts. For all the case studies, we found that the subtype classification accuracy significantly improved when utilizing all available multi-omics data. MONTI was able to detect subtype specific gene sets that showed to be strongly regulated by certain omics, from which correlation between omics types could be inferred. Furthermore, various clinical attributes of nine cancer types were analyzed using MONTI, which showed that some clinical attributes could be well explained using multi-omics data. We demonstrated that integrating multi-omics data in a gene centric manner improves detecting cancer subtype specific features and other clinical features, which may be used to further understand the molecular characteristics of interest. The software and data used in this study are available at: https://github.com/inukj/MONTI. Frontiers Media S.A. 2021-09-10 /pmc/articles/PMC8461247/ /pubmed/34567063 http://dx.doi.org/10.3389/fgene.2021.682841 Text en Copyright © 2021 Jung, Kim, Rhee, Lim and Kim. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Jung, Inuk
Kim, Minsu
Rhee, Sungmin
Lim, Sangsoo
Kim, Sun
MONTI: A Multi-Omics Non-negative Tensor Decomposition Framework for Gene-Level Integrative Analysis
title MONTI: A Multi-Omics Non-negative Tensor Decomposition Framework for Gene-Level Integrative Analysis
title_full MONTI: A Multi-Omics Non-negative Tensor Decomposition Framework for Gene-Level Integrative Analysis
title_fullStr MONTI: A Multi-Omics Non-negative Tensor Decomposition Framework for Gene-Level Integrative Analysis
title_full_unstemmed MONTI: A Multi-Omics Non-negative Tensor Decomposition Framework for Gene-Level Integrative Analysis
title_short MONTI: A Multi-Omics Non-negative Tensor Decomposition Framework for Gene-Level Integrative Analysis
title_sort monti: a multi-omics non-negative tensor decomposition framework for gene-level integrative analysis
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8461247/
https://www.ncbi.nlm.nih.gov/pubmed/34567063
http://dx.doi.org/10.3389/fgene.2021.682841
work_keys_str_mv AT junginuk montiamultiomicsnonnegativetensordecompositionframeworkforgenelevelintegrativeanalysis
AT kimminsu montiamultiomicsnonnegativetensordecompositionframeworkforgenelevelintegrativeanalysis
AT rheesungmin montiamultiomicsnonnegativetensordecompositionframeworkforgenelevelintegrativeanalysis
AT limsangsoo montiamultiomicsnonnegativetensordecompositionframeworkforgenelevelintegrativeanalysis
AT kimsun montiamultiomicsnonnegativetensordecompositionframeworkforgenelevelintegrativeanalysis