Cargando…

Cell type–specific interpretation of noncoding variants using deep learning–based methods

Interpretation of noncoding genomic variants is one of the most important challenges in human genetics. Machine learning methods have emerged recently as a powerful tool to solve this problem. State-of-the-art approaches allow prediction of transcriptional and epigenetic effects caused by noncoding...

Descripción completa

Detalles Bibliográficos
Autores principales: Sindeeva, Maria, Chekanov, Nikolay, Avetisian, Manvel, Shashkova, Tatiana I, Baranov, Nikita, Malkin, Elian, Lapin, Alexander, Kardymon, Olga, Fishman, Veniamin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10041527/
https://www.ncbi.nlm.nih.gov/pubmed/36971292
http://dx.doi.org/10.1093/gigascience/giad015
_version_ 1784912739565568000
author Sindeeva, Maria
Chekanov, Nikolay
Avetisian, Manvel
Shashkova, Tatiana I
Baranov, Nikita
Malkin, Elian
Lapin, Alexander
Kardymon, Olga
Fishman, Veniamin
author_facet Sindeeva, Maria
Chekanov, Nikolay
Avetisian, Manvel
Shashkova, Tatiana I
Baranov, Nikita
Malkin, Elian
Lapin, Alexander
Kardymon, Olga
Fishman, Veniamin
author_sort Sindeeva, Maria
collection PubMed
description Interpretation of noncoding genomic variants is one of the most important challenges in human genetics. Machine learning methods have emerged recently as a powerful tool to solve this problem. State-of-the-art approaches allow prediction of transcriptional and epigenetic effects caused by noncoding mutations. However, these approaches require specific experimental data for training and cannot generalize across cell types where required features were not experimentally measured. We show here that available epigenetic characteristics of human cell types are extremely sparse, limiting those approaches that rely on specific epigenetic input. We propose a new neural network architecture, DeepCT, which can learn complex interconnections of epigenetic features and infer unmeasured data from any available input. Furthermore, we show that DeepCT can learn cell type–specific properties, build biologically meaningful vector representations of cell types, and utilize these representations to generate cell type–specific predictions of the effects of noncoding variations in the human genome.
format Online
Article
Text
id pubmed-10041527
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-100415272023-03-28 Cell type–specific interpretation of noncoding variants using deep learning–based methods Sindeeva, Maria Chekanov, Nikolay Avetisian, Manvel Shashkova, Tatiana I Baranov, Nikita Malkin, Elian Lapin, Alexander Kardymon, Olga Fishman, Veniamin Gigascience Research Interpretation of noncoding genomic variants is one of the most important challenges in human genetics. Machine learning methods have emerged recently as a powerful tool to solve this problem. State-of-the-art approaches allow prediction of transcriptional and epigenetic effects caused by noncoding mutations. However, these approaches require specific experimental data for training and cannot generalize across cell types where required features were not experimentally measured. We show here that available epigenetic characteristics of human cell types are extremely sparse, limiting those approaches that rely on specific epigenetic input. We propose a new neural network architecture, DeepCT, which can learn complex interconnections of epigenetic features and infer unmeasured data from any available input. Furthermore, we show that DeepCT can learn cell type–specific properties, build biologically meaningful vector representations of cell types, and utilize these representations to generate cell type–specific predictions of the effects of noncoding variations in the human genome. Oxford University Press 2023-03-27 /pmc/articles/PMC10041527/ /pubmed/36971292 http://dx.doi.org/10.1093/gigascience/giad015 Text en © The Author(s) 2023. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Sindeeva, Maria
Chekanov, Nikolay
Avetisian, Manvel
Shashkova, Tatiana I
Baranov, Nikita
Malkin, Elian
Lapin, Alexander
Kardymon, Olga
Fishman, Veniamin
Cell type–specific interpretation of noncoding variants using deep learning–based methods
title Cell type–specific interpretation of noncoding variants using deep learning–based methods
title_full Cell type–specific interpretation of noncoding variants using deep learning–based methods
title_fullStr Cell type–specific interpretation of noncoding variants using deep learning–based methods
title_full_unstemmed Cell type–specific interpretation of noncoding variants using deep learning–based methods
title_short Cell type–specific interpretation of noncoding variants using deep learning–based methods
title_sort cell type–specific interpretation of noncoding variants using deep learning–based methods
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10041527/
https://www.ncbi.nlm.nih.gov/pubmed/36971292
http://dx.doi.org/10.1093/gigascience/giad015
work_keys_str_mv AT sindeevamaria celltypespecificinterpretationofnoncodingvariantsusingdeeplearningbasedmethods
AT chekanovnikolay celltypespecificinterpretationofnoncodingvariantsusingdeeplearningbasedmethods
AT avetisianmanvel celltypespecificinterpretationofnoncodingvariantsusingdeeplearningbasedmethods
AT shashkovatatianai celltypespecificinterpretationofnoncodingvariantsusingdeeplearningbasedmethods
AT baranovnikita celltypespecificinterpretationofnoncodingvariantsusingdeeplearningbasedmethods
AT malkinelian celltypespecificinterpretationofnoncodingvariantsusingdeeplearningbasedmethods
AT lapinalexander celltypespecificinterpretationofnoncodingvariantsusingdeeplearningbasedmethods
AT kardymonolga celltypespecificinterpretationofnoncodingvariantsusingdeeplearningbasedmethods
AT fishmanveniamin celltypespecificinterpretationofnoncodingvariantsusingdeeplearningbasedmethods