Cargando…

CLCLSA: Cross-omics Linked embedding with Contrastive Learning and Self Attention for multi-omics integration with incomplete multi-omics data

Integration of heterogeneous and high-dimensional multi-omics data is becoming increasingly important in understanding genetic data. Each omics technique only provides a limited view of the underlying biological process and integrating heterogeneous omics layers simultaneously would lead to a more c...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhao, Chen, Liu, Anqi, Zhang, Xiao, Cao, Xuewei, Ding, Zhengming, Sha, Qiuying, Shen, Hui, Deng, Hong-Wen, Zhou, Weihua
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Cornell University 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10120753/ https://www.ncbi.nlm.nih.gov/pubmed/37090237

_version_	1785029233780719616
author	Zhao, Chen Liu, Anqi Zhang, Xiao Cao, Xuewei Ding, Zhengming Sha, Qiuying Shen, Hui Deng, Hong-Wen Zhou, Weihua
author_facet	Zhao, Chen Liu, Anqi Zhang, Xiao Cao, Xuewei Ding, Zhengming Sha, Qiuying Shen, Hui Deng, Hong-Wen Zhou, Weihua
author_sort	Zhao, Chen
collection	PubMed
description	Integration of heterogeneous and high-dimensional multi-omics data is becoming increasingly important in understanding genetic data. Each omics technique only provides a limited view of the underlying biological process and integrating heterogeneous omics layers simultaneously would lead to a more comprehensive and detailed understanding of diseases and phenotypes. However, one obstacle faced when performing multi-omics data integration is the existence of unpaired multi-omics data due to instrument sensitivity and cost. Studies may fail if certain aspects of the subjects are missing or incomplete. In this paper, we propose a deep learning method for multi-omics integration with incomplete data by Cross-omics Linked unified embedding with Contrastive Learning and Self Attention (CLCLSA). Utilizing complete multi-omics data as supervision, the model employs cross-omics autoencoders to learn the feature representation across different types of biological data. The multi-omics contrastive learning, which is used to maximize the mutual information between different types of omics, is employed before latent feature concatenation. In addition, the feature-level self-attention and omics-level self-attention are employed to dynamically identify the most informative features for multi-omics data integration. Extensive experiments were conducted on four public multi-omics datasets. The experimental results indicated that the proposed CLCLSA outperformed the state-of-the-art approaches for multi-omics data classification using incomplete multi-omics data.
format	Online Article Text
id	pubmed-10120753
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Cornell University
record_format	MEDLINE/PubMed
spelling	pubmed-101207532023-04-22 CLCLSA: Cross-omics Linked embedding with Contrastive Learning and Self Attention for multi-omics integration with incomplete multi-omics data Zhao, Chen Liu, Anqi Zhang, Xiao Cao, Xuewei Ding, Zhengming Sha, Qiuying Shen, Hui Deng, Hong-Wen Zhou, Weihua ArXiv Article Integration of heterogeneous and high-dimensional multi-omics data is becoming increasingly important in understanding genetic data. Each omics technique only provides a limited view of the underlying biological process and integrating heterogeneous omics layers simultaneously would lead to a more comprehensive and detailed understanding of diseases and phenotypes. However, one obstacle faced when performing multi-omics data integration is the existence of unpaired multi-omics data due to instrument sensitivity and cost. Studies may fail if certain aspects of the subjects are missing or incomplete. In this paper, we propose a deep learning method for multi-omics integration with incomplete data by Cross-omics Linked unified embedding with Contrastive Learning and Self Attention (CLCLSA). Utilizing complete multi-omics data as supervision, the model employs cross-omics autoencoders to learn the feature representation across different types of biological data. The multi-omics contrastive learning, which is used to maximize the mutual information between different types of omics, is employed before latent feature concatenation. In addition, the feature-level self-attention and omics-level self-attention are employed to dynamically identify the most informative features for multi-omics data integration. Extensive experiments were conducted on four public multi-omics datasets. The experimental results indicated that the proposed CLCLSA outperformed the state-of-the-art approaches for multi-omics data classification using incomplete multi-omics data. Cornell University 2023-04-12 /pmc/articles/PMC10120753/ /pubmed/37090237 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle	Article Zhao, Chen Liu, Anqi Zhang, Xiao Cao, Xuewei Ding, Zhengming Sha, Qiuying Shen, Hui Deng, Hong-Wen Zhou, Weihua CLCLSA: Cross-omics Linked embedding with Contrastive Learning and Self Attention for multi-omics integration with incomplete multi-omics data
title	CLCLSA: Cross-omics Linked embedding with Contrastive Learning and Self Attention for multi-omics integration with incomplete multi-omics data
title_full	CLCLSA: Cross-omics Linked embedding with Contrastive Learning and Self Attention for multi-omics integration with incomplete multi-omics data
title_fullStr	CLCLSA: Cross-omics Linked embedding with Contrastive Learning and Self Attention for multi-omics integration with incomplete multi-omics data
title_full_unstemmed	CLCLSA: Cross-omics Linked embedding with Contrastive Learning and Self Attention for multi-omics integration with incomplete multi-omics data
title_short	CLCLSA: Cross-omics Linked embedding with Contrastive Learning and Self Attention for multi-omics integration with incomplete multi-omics data
title_sort	clclsa: cross-omics linked embedding with contrastive learning and self attention for multi-omics integration with incomplete multi-omics data
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10120753/ https://www.ncbi.nlm.nih.gov/pubmed/37090237
work_keys_str_mv	AT zhaochen clclsacrossomicslinkedembeddingwithcontrastivelearningandselfattentionformultiomicsintegrationwithincompletemultiomicsdata AT liuanqi clclsacrossomicslinkedembeddingwithcontrastivelearningandselfattentionformultiomicsintegrationwithincompletemultiomicsdata AT zhangxiao clclsacrossomicslinkedembeddingwithcontrastivelearningandselfattentionformultiomicsintegrationwithincompletemultiomicsdata AT caoxuewei clclsacrossomicslinkedembeddingwithcontrastivelearningandselfattentionformultiomicsintegrationwithincompletemultiomicsdata AT dingzhengming clclsacrossomicslinkedembeddingwithcontrastivelearningandselfattentionformultiomicsintegrationwithincompletemultiomicsdata AT shaqiuying clclsacrossomicslinkedembeddingwithcontrastivelearningandselfattentionformultiomicsintegrationwithincompletemultiomicsdata AT shenhui clclsacrossomicslinkedembeddingwithcontrastivelearningandselfattentionformultiomicsintegrationwithincompletemultiomicsdata AT denghongwen clclsacrossomicslinkedembeddingwithcontrastivelearningandselfattentionformultiomicsintegrationwithincompletemultiomicsdata AT zhouweihua clclsacrossomicslinkedembeddingwithcontrastivelearningandselfattentionformultiomicsintegrationwithincompletemultiomicsdata

CLCLSA: Cross-omics Linked embedding with Contrastive Learning and Self Attention for multi-omics integration with incomplete multi-omics data

Ejemplares similares