Cargando…
Cell type–specific interpretation of noncoding variants using deep learning–based methods
Interpretation of noncoding genomic variants is one of the most important challenges in human genetics. Machine learning methods have emerged recently as a powerful tool to solve this problem. State-of-the-art approaches allow prediction of transcriptional and epigenetic effects caused by noncoding...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10041527/ https://www.ncbi.nlm.nih.gov/pubmed/36971292 http://dx.doi.org/10.1093/gigascience/giad015 |
_version_ | 1784912739565568000 |
---|---|
author | Sindeeva, Maria Chekanov, Nikolay Avetisian, Manvel Shashkova, Tatiana I Baranov, Nikita Malkin, Elian Lapin, Alexander Kardymon, Olga Fishman, Veniamin |
author_facet | Sindeeva, Maria Chekanov, Nikolay Avetisian, Manvel Shashkova, Tatiana I Baranov, Nikita Malkin, Elian Lapin, Alexander Kardymon, Olga Fishman, Veniamin |
author_sort | Sindeeva, Maria |
collection | PubMed |
description | Interpretation of noncoding genomic variants is one of the most important challenges in human genetics. Machine learning methods have emerged recently as a powerful tool to solve this problem. State-of-the-art approaches allow prediction of transcriptional and epigenetic effects caused by noncoding mutations. However, these approaches require specific experimental data for training and cannot generalize across cell types where required features were not experimentally measured. We show here that available epigenetic characteristics of human cell types are extremely sparse, limiting those approaches that rely on specific epigenetic input. We propose a new neural network architecture, DeepCT, which can learn complex interconnections of epigenetic features and infer unmeasured data from any available input. Furthermore, we show that DeepCT can learn cell type–specific properties, build biologically meaningful vector representations of cell types, and utilize these representations to generate cell type–specific predictions of the effects of noncoding variations in the human genome. |
format | Online Article Text |
id | pubmed-10041527 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-100415272023-03-28 Cell type–specific interpretation of noncoding variants using deep learning–based methods Sindeeva, Maria Chekanov, Nikolay Avetisian, Manvel Shashkova, Tatiana I Baranov, Nikita Malkin, Elian Lapin, Alexander Kardymon, Olga Fishman, Veniamin Gigascience Research Interpretation of noncoding genomic variants is one of the most important challenges in human genetics. Machine learning methods have emerged recently as a powerful tool to solve this problem. State-of-the-art approaches allow prediction of transcriptional and epigenetic effects caused by noncoding mutations. However, these approaches require specific experimental data for training and cannot generalize across cell types where required features were not experimentally measured. We show here that available epigenetic characteristics of human cell types are extremely sparse, limiting those approaches that rely on specific epigenetic input. We propose a new neural network architecture, DeepCT, which can learn complex interconnections of epigenetic features and infer unmeasured data from any available input. Furthermore, we show that DeepCT can learn cell type–specific properties, build biologically meaningful vector representations of cell types, and utilize these representations to generate cell type–specific predictions of the effects of noncoding variations in the human genome. Oxford University Press 2023-03-27 /pmc/articles/PMC10041527/ /pubmed/36971292 http://dx.doi.org/10.1093/gigascience/giad015 Text en © The Author(s) 2023. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Sindeeva, Maria Chekanov, Nikolay Avetisian, Manvel Shashkova, Tatiana I Baranov, Nikita Malkin, Elian Lapin, Alexander Kardymon, Olga Fishman, Veniamin Cell type–specific interpretation of noncoding variants using deep learning–based methods |
title | Cell type–specific interpretation of noncoding variants using deep learning–based methods |
title_full | Cell type–specific interpretation of noncoding variants using deep learning–based methods |
title_fullStr | Cell type–specific interpretation of noncoding variants using deep learning–based methods |
title_full_unstemmed | Cell type–specific interpretation of noncoding variants using deep learning–based methods |
title_short | Cell type–specific interpretation of noncoding variants using deep learning–based methods |
title_sort | cell type–specific interpretation of noncoding variants using deep learning–based methods |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10041527/ https://www.ncbi.nlm.nih.gov/pubmed/36971292 http://dx.doi.org/10.1093/gigascience/giad015 |
work_keys_str_mv | AT sindeevamaria celltypespecificinterpretationofnoncodingvariantsusingdeeplearningbasedmethods AT chekanovnikolay celltypespecificinterpretationofnoncodingvariantsusingdeeplearningbasedmethods AT avetisianmanvel celltypespecificinterpretationofnoncodingvariantsusingdeeplearningbasedmethods AT shashkovatatianai celltypespecificinterpretationofnoncodingvariantsusingdeeplearningbasedmethods AT baranovnikita celltypespecificinterpretationofnoncodingvariantsusingdeeplearningbasedmethods AT malkinelian celltypespecificinterpretationofnoncodingvariantsusingdeeplearningbasedmethods AT lapinalexander celltypespecificinterpretationofnoncodingvariantsusingdeeplearningbasedmethods AT kardymonolga celltypespecificinterpretationofnoncodingvariantsusingdeeplearningbasedmethods AT fishmanveniamin celltypespecificinterpretationofnoncodingvariantsusingdeeplearningbasedmethods |