Cargando…
Continuous lifelong learning for modeling of gene regulation from single cell multiome data by leveraging atlas-scale external data
Accurate context-specific Gene Regulatory Networks (GRNs) inference from genomics data is a crucial task in computational biology. However, existing methods face limitations, such as reliance on gene expression data alone, lower resolution from bulk data, and data scarcity for specific cellular syst...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10418251/ https://www.ncbi.nlm.nih.gov/pubmed/37577525 http://dx.doi.org/10.1101/2023.08.01.551575 |
_version_ | 1785088222782554112 |
---|---|
author | Yuan, Qiuyue Duren, Zhana |
author_facet | Yuan, Qiuyue Duren, Zhana |
author_sort | Yuan, Qiuyue |
collection | PubMed |
description | Accurate context-specific Gene Regulatory Networks (GRNs) inference from genomics data is a crucial task in computational biology. However, existing methods face limitations, such as reliance on gene expression data alone, lower resolution from bulk data, and data scarcity for specific cellular systems. Despite recent technological advancements, including single-cell sequencing and the integration of ATAC-seq and RNA-seq data, learning such complex mechanisms from limited independent data points still presents a daunting challenge, impeding GRN inference accuracy. To overcome this challenge, we present LINGER (LIfelong neural Network for GEne Regulation), a novel deep learning-based method to infer GRNs from single-cell multiome data with paired gene expression and chromatin accessibility data from the same cell. LINGER incorporates both 1) atlas-scale external bulk data across diverse cellular contexts and 2) the knowledge of transcription factor (TF) motif matching to cis-regulatory elements as a manifold regularization to address the challenge of limited data and extensive parameter space in GRN inference. Our results demonstrate that LINGER achieves 2–3 fold higher accuracy over existing methods. LINGER reveals a complex regulatory landscape of genome-wide association studies, enabling enhanced interpretation of disease-associated variants and genes. Additionally, following the GRN inference from a reference sc-multiome data, LINGER allows for the estimation of TF activity solely from bulk or single-cell gene expression data, leveraging the abundance of available gene expression data to identify driver regulators from case-control studies. Overall, LINGER provides a comprehensive tool for robust gene regulation inference from genomics data, empowering deeper insights into cellular mechanisms. |
format | Online Article Text |
id | pubmed-10418251 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-104182512023-08-12 Continuous lifelong learning for modeling of gene regulation from single cell multiome data by leveraging atlas-scale external data Yuan, Qiuyue Duren, Zhana bioRxiv Article Accurate context-specific Gene Regulatory Networks (GRNs) inference from genomics data is a crucial task in computational biology. However, existing methods face limitations, such as reliance on gene expression data alone, lower resolution from bulk data, and data scarcity for specific cellular systems. Despite recent technological advancements, including single-cell sequencing and the integration of ATAC-seq and RNA-seq data, learning such complex mechanisms from limited independent data points still presents a daunting challenge, impeding GRN inference accuracy. To overcome this challenge, we present LINGER (LIfelong neural Network for GEne Regulation), a novel deep learning-based method to infer GRNs from single-cell multiome data with paired gene expression and chromatin accessibility data from the same cell. LINGER incorporates both 1) atlas-scale external bulk data across diverse cellular contexts and 2) the knowledge of transcription factor (TF) motif matching to cis-regulatory elements as a manifold regularization to address the challenge of limited data and extensive parameter space in GRN inference. Our results demonstrate that LINGER achieves 2–3 fold higher accuracy over existing methods. LINGER reveals a complex regulatory landscape of genome-wide association studies, enabling enhanced interpretation of disease-associated variants and genes. Additionally, following the GRN inference from a reference sc-multiome data, LINGER allows for the estimation of TF activity solely from bulk or single-cell gene expression data, leveraging the abundance of available gene expression data to identify driver regulators from case-control studies. Overall, LINGER provides a comprehensive tool for robust gene regulation inference from genomics data, empowering deeper insights into cellular mechanisms. Cold Spring Harbor Laboratory 2023-08-03 /pmc/articles/PMC10418251/ /pubmed/37577525 http://dx.doi.org/10.1101/2023.08.01.551575 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator. |
spellingShingle | Article Yuan, Qiuyue Duren, Zhana Continuous lifelong learning for modeling of gene regulation from single cell multiome data by leveraging atlas-scale external data |
title | Continuous lifelong learning for modeling of gene regulation from single cell multiome data by leveraging atlas-scale external data |
title_full | Continuous lifelong learning for modeling of gene regulation from single cell multiome data by leveraging atlas-scale external data |
title_fullStr | Continuous lifelong learning for modeling of gene regulation from single cell multiome data by leveraging atlas-scale external data |
title_full_unstemmed | Continuous lifelong learning for modeling of gene regulation from single cell multiome data by leveraging atlas-scale external data |
title_short | Continuous lifelong learning for modeling of gene regulation from single cell multiome data by leveraging atlas-scale external data |
title_sort | continuous lifelong learning for modeling of gene regulation from single cell multiome data by leveraging atlas-scale external data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10418251/ https://www.ncbi.nlm.nih.gov/pubmed/37577525 http://dx.doi.org/10.1101/2023.08.01.551575 |
work_keys_str_mv | AT yuanqiuyue continuouslifelonglearningformodelingofgeneregulationfromsinglecellmultiomedatabyleveragingatlasscaleexternaldata AT durenzhana continuouslifelonglearningformodelingofgeneregulationfromsinglecellmultiomedatabyleveragingatlasscaleexternaldata |