Cargando…

Annotating functional effects of non-coding variants in neuropsychiatric cell types by deep transfer learning

Genomewide association studies (GWAS) have identified a large number of loci associated with neuropsychiatric traits, however, understanding the molecular mechanisms underlying these loci remains difficult. To help prioritize causal variants and interpret their functions, computational methods have...

Descripción completa

Detalles Bibliográficos
Autores principales: Lai, Boqiao, Qian, Sheng, Zhang, Hanwei, Zhang, Siwei, Kozlova, Alena, Duan, Jubao, Xu, Jinbo, He, Xin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9135341/
https://www.ncbi.nlm.nih.gov/pubmed/35576194
http://dx.doi.org/10.1371/journal.pcbi.1010011
_version_ 1784713942507978752
author Lai, Boqiao
Qian, Sheng
Zhang, Hanwei
Zhang, Siwei
Kozlova, Alena
Duan, Jubao
Xu, Jinbo
He, Xin
author_facet Lai, Boqiao
Qian, Sheng
Zhang, Hanwei
Zhang, Siwei
Kozlova, Alena
Duan, Jubao
Xu, Jinbo
He, Xin
author_sort Lai, Boqiao
collection PubMed
description Genomewide association studies (GWAS) have identified a large number of loci associated with neuropsychiatric traits, however, understanding the molecular mechanisms underlying these loci remains difficult. To help prioritize causal variants and interpret their functions, computational methods have been developed to predict regulatory effects of non-coding variants. An emerging approach to variant annotation is deep learning models that predict regulatory functions from DNA sequences alone. While such models have been trained on large publicly available dataset such as ENCODE, neuropsychiatric trait-related cell types are under-represented in these datasets, thus there is an urgent need of better tools and resources to annotate variant functions in such cellular contexts. To fill this gap, we collected a large collection of neurodevelopment-related cell/tissue types, and trained deep Convolutional Neural Networks (ResNet) using such data. Furthermore, our model, called MetaChrom, borrows information from public epigenomic consortium to improve the accuracy via transfer learning. We show that MetaChrom is substantially better in predicting experimentally determined chromatin accessibility variants than popular variant annotation tools such as CADD and delta-SVM. By combining GWAS data with MetaChrom predictions, we prioritized 31 SNPs for Schizophrenia, suggesting potential risk genes and the biological contexts where they act. In summary, MetaChrom provides functional annotations of any DNA variants in the neuro-development context and the general method of MetaChrom can also be extended to other disease-related cell or tissue types.
format Online
Article
Text
id pubmed-9135341
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-91353412022-05-27 Annotating functional effects of non-coding variants in neuropsychiatric cell types by deep transfer learning Lai, Boqiao Qian, Sheng Zhang, Hanwei Zhang, Siwei Kozlova, Alena Duan, Jubao Xu, Jinbo He, Xin PLoS Comput Biol Research Article Genomewide association studies (GWAS) have identified a large number of loci associated with neuropsychiatric traits, however, understanding the molecular mechanisms underlying these loci remains difficult. To help prioritize causal variants and interpret their functions, computational methods have been developed to predict regulatory effects of non-coding variants. An emerging approach to variant annotation is deep learning models that predict regulatory functions from DNA sequences alone. While such models have been trained on large publicly available dataset such as ENCODE, neuropsychiatric trait-related cell types are under-represented in these datasets, thus there is an urgent need of better tools and resources to annotate variant functions in such cellular contexts. To fill this gap, we collected a large collection of neurodevelopment-related cell/tissue types, and trained deep Convolutional Neural Networks (ResNet) using such data. Furthermore, our model, called MetaChrom, borrows information from public epigenomic consortium to improve the accuracy via transfer learning. We show that MetaChrom is substantially better in predicting experimentally determined chromatin accessibility variants than popular variant annotation tools such as CADD and delta-SVM. By combining GWAS data with MetaChrom predictions, we prioritized 31 SNPs for Schizophrenia, suggesting potential risk genes and the biological contexts where they act. In summary, MetaChrom provides functional annotations of any DNA variants in the neuro-development context and the general method of MetaChrom can also be extended to other disease-related cell or tissue types. Public Library of Science 2022-05-16 /pmc/articles/PMC9135341/ /pubmed/35576194 http://dx.doi.org/10.1371/journal.pcbi.1010011 Text en © 2022 Lai et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Lai, Boqiao
Qian, Sheng
Zhang, Hanwei
Zhang, Siwei
Kozlova, Alena
Duan, Jubao
Xu, Jinbo
He, Xin
Annotating functional effects of non-coding variants in neuropsychiatric cell types by deep transfer learning
title Annotating functional effects of non-coding variants in neuropsychiatric cell types by deep transfer learning
title_full Annotating functional effects of non-coding variants in neuropsychiatric cell types by deep transfer learning
title_fullStr Annotating functional effects of non-coding variants in neuropsychiatric cell types by deep transfer learning
title_full_unstemmed Annotating functional effects of non-coding variants in neuropsychiatric cell types by deep transfer learning
title_short Annotating functional effects of non-coding variants in neuropsychiatric cell types by deep transfer learning
title_sort annotating functional effects of non-coding variants in neuropsychiatric cell types by deep transfer learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9135341/
https://www.ncbi.nlm.nih.gov/pubmed/35576194
http://dx.doi.org/10.1371/journal.pcbi.1010011
work_keys_str_mv AT laiboqiao annotatingfunctionaleffectsofnoncodingvariantsinneuropsychiatriccelltypesbydeeptransferlearning
AT qiansheng annotatingfunctionaleffectsofnoncodingvariantsinneuropsychiatriccelltypesbydeeptransferlearning
AT zhanghanwei annotatingfunctionaleffectsofnoncodingvariantsinneuropsychiatriccelltypesbydeeptransferlearning
AT zhangsiwei annotatingfunctionaleffectsofnoncodingvariantsinneuropsychiatriccelltypesbydeeptransferlearning
AT kozlovaalena annotatingfunctionaleffectsofnoncodingvariantsinneuropsychiatriccelltypesbydeeptransferlearning
AT duanjubao annotatingfunctionaleffectsofnoncodingvariantsinneuropsychiatriccelltypesbydeeptransferlearning
AT xujinbo annotatingfunctionaleffectsofnoncodingvariantsinneuropsychiatriccelltypesbydeeptransferlearning
AT hexin annotatingfunctionaleffectsofnoncodingvariantsinneuropsychiatriccelltypesbydeeptransferlearning