Cargando…

Unsupervised neural network for single cell Multi-omics INTegration (UMINT): an application to health and disease

Multi-omics studies have enabled us to understand the mechanistic drivers behind complex disease states and progressions, thereby providing novel and actionable biological insights into health status. However, integrating data from multiple modalities is challenging due to high dimensionality and di...

Descripción completa

Detalles Bibliográficos
Autores principales: Maitra, Chayan, Seal, Dibyendu B., Das, Vivek, De, Rajat K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10244650/
https://www.ncbi.nlm.nih.gov/pubmed/37293552
http://dx.doi.org/10.3389/fmolb.2023.1184748
_version_ 1785054688017645568
author Maitra, Chayan
Seal, Dibyendu B.
Das, Vivek
De, Rajat K.
author_facet Maitra, Chayan
Seal, Dibyendu B.
Das, Vivek
De, Rajat K.
author_sort Maitra, Chayan
collection PubMed
description Multi-omics studies have enabled us to understand the mechanistic drivers behind complex disease states and progressions, thereby providing novel and actionable biological insights into health status. However, integrating data from multiple modalities is challenging due to high dimensionality and diverse nature of data, and noise associated with each platform. Sparsity in data, non-overlapping features and technical batch effects make the task of learning more complicated. Conventional machine learning (ML) tools are not quite effective against such data integration hazards due to their simplistic nature with less capacity. In addition, existing methods for single cell multi-omics integration are computationally expensive. Therefore, in this work, we have introduced a novel Unsupervised neural network for single cell Multi-omics INTegration (UMINT). UMINT serves as a promising model for integrating variable number of single cell omics layers with high dimensions. It has a light-weight architecture with substantially reduced number of parameters. The proposed model is capable of learning a latent low-dimensional embedding that can extract useful features from the data facilitating further downstream analyses. UMINT has been applied to integrate healthy and disease CITE-seq (paired RNA and surface proteins) datasets including a rare disease Mucosa-Associated Lymphoid Tissue (MALT) tumor. It has been benchmarked against existing state-of-the-art methods for single cell multi-omics integration. Furthermore, UMINT is capable of integrating paired single cell gene expression and ATAC-seq (Transposase-Accessible Chromatin) assays as well.
format Online
Article
Text
id pubmed-10244650
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-102446502023-06-08 Unsupervised neural network for single cell Multi-omics INTegration (UMINT): an application to health and disease Maitra, Chayan Seal, Dibyendu B. Das, Vivek De, Rajat K. Front Mol Biosci Molecular Biosciences Multi-omics studies have enabled us to understand the mechanistic drivers behind complex disease states and progressions, thereby providing novel and actionable biological insights into health status. However, integrating data from multiple modalities is challenging due to high dimensionality and diverse nature of data, and noise associated with each platform. Sparsity in data, non-overlapping features and technical batch effects make the task of learning more complicated. Conventional machine learning (ML) tools are not quite effective against such data integration hazards due to their simplistic nature with less capacity. In addition, existing methods for single cell multi-omics integration are computationally expensive. Therefore, in this work, we have introduced a novel Unsupervised neural network for single cell Multi-omics INTegration (UMINT). UMINT serves as a promising model for integrating variable number of single cell omics layers with high dimensions. It has a light-weight architecture with substantially reduced number of parameters. The proposed model is capable of learning a latent low-dimensional embedding that can extract useful features from the data facilitating further downstream analyses. UMINT has been applied to integrate healthy and disease CITE-seq (paired RNA and surface proteins) datasets including a rare disease Mucosa-Associated Lymphoid Tissue (MALT) tumor. It has been benchmarked against existing state-of-the-art methods for single cell multi-omics integration. Furthermore, UMINT is capable of integrating paired single cell gene expression and ATAC-seq (Transposase-Accessible Chromatin) assays as well. Frontiers Media S.A. 2023-05-24 /pmc/articles/PMC10244650/ /pubmed/37293552 http://dx.doi.org/10.3389/fmolb.2023.1184748 Text en Copyright © 2023 Maitra, Seal, Das and De. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Molecular Biosciences
Maitra, Chayan
Seal, Dibyendu B.
Das, Vivek
De, Rajat K.
Unsupervised neural network for single cell Multi-omics INTegration (UMINT): an application to health and disease
title Unsupervised neural network for single cell Multi-omics INTegration (UMINT): an application to health and disease
title_full Unsupervised neural network for single cell Multi-omics INTegration (UMINT): an application to health and disease
title_fullStr Unsupervised neural network for single cell Multi-omics INTegration (UMINT): an application to health and disease
title_full_unstemmed Unsupervised neural network for single cell Multi-omics INTegration (UMINT): an application to health and disease
title_short Unsupervised neural network for single cell Multi-omics INTegration (UMINT): an application to health and disease
title_sort unsupervised neural network for single cell multi-omics integration (umint): an application to health and disease
topic Molecular Biosciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10244650/
https://www.ncbi.nlm.nih.gov/pubmed/37293552
http://dx.doi.org/10.3389/fmolb.2023.1184748
work_keys_str_mv AT maitrachayan unsupervisedneuralnetworkforsinglecellmultiomicsintegrationumintanapplicationtohealthanddisease
AT sealdibyendub unsupervisedneuralnetworkforsinglecellmultiomicsintegrationumintanapplicationtohealthanddisease
AT dasvivek unsupervisedneuralnetworkforsinglecellmultiomicsintegrationumintanapplicationtohealthanddisease
AT derajatk unsupervisedneuralnetworkforsinglecellmultiomicsintegrationumintanapplicationtohealthanddisease