Cargando…
Annotating functional effects of non-coding variants in neuropsychiatric cell types by deep transfer learning
Genomewide association studies (GWAS) have identified a large number of loci associated with neuropsychiatric traits, however, understanding the molecular mechanisms underlying these loci remains difficult. To help prioritize causal variants and interpret their functions, computational methods have...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9135341/ https://www.ncbi.nlm.nih.gov/pubmed/35576194 http://dx.doi.org/10.1371/journal.pcbi.1010011 |
_version_ | 1784713942507978752 |
---|---|
author | Lai, Boqiao Qian, Sheng Zhang, Hanwei Zhang, Siwei Kozlova, Alena Duan, Jubao Xu, Jinbo He, Xin |
author_facet | Lai, Boqiao Qian, Sheng Zhang, Hanwei Zhang, Siwei Kozlova, Alena Duan, Jubao Xu, Jinbo He, Xin |
author_sort | Lai, Boqiao |
collection | PubMed |
description | Genomewide association studies (GWAS) have identified a large number of loci associated with neuropsychiatric traits, however, understanding the molecular mechanisms underlying these loci remains difficult. To help prioritize causal variants and interpret their functions, computational methods have been developed to predict regulatory effects of non-coding variants. An emerging approach to variant annotation is deep learning models that predict regulatory functions from DNA sequences alone. While such models have been trained on large publicly available dataset such as ENCODE, neuropsychiatric trait-related cell types are under-represented in these datasets, thus there is an urgent need of better tools and resources to annotate variant functions in such cellular contexts. To fill this gap, we collected a large collection of neurodevelopment-related cell/tissue types, and trained deep Convolutional Neural Networks (ResNet) using such data. Furthermore, our model, called MetaChrom, borrows information from public epigenomic consortium to improve the accuracy via transfer learning. We show that MetaChrom is substantially better in predicting experimentally determined chromatin accessibility variants than popular variant annotation tools such as CADD and delta-SVM. By combining GWAS data with MetaChrom predictions, we prioritized 31 SNPs for Schizophrenia, suggesting potential risk genes and the biological contexts where they act. In summary, MetaChrom provides functional annotations of any DNA variants in the neuro-development context and the general method of MetaChrom can also be extended to other disease-related cell or tissue types. |
format | Online Article Text |
id | pubmed-9135341 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-91353412022-05-27 Annotating functional effects of non-coding variants in neuropsychiatric cell types by deep transfer learning Lai, Boqiao Qian, Sheng Zhang, Hanwei Zhang, Siwei Kozlova, Alena Duan, Jubao Xu, Jinbo He, Xin PLoS Comput Biol Research Article Genomewide association studies (GWAS) have identified a large number of loci associated with neuropsychiatric traits, however, understanding the molecular mechanisms underlying these loci remains difficult. To help prioritize causal variants and interpret their functions, computational methods have been developed to predict regulatory effects of non-coding variants. An emerging approach to variant annotation is deep learning models that predict regulatory functions from DNA sequences alone. While such models have been trained on large publicly available dataset such as ENCODE, neuropsychiatric trait-related cell types are under-represented in these datasets, thus there is an urgent need of better tools and resources to annotate variant functions in such cellular contexts. To fill this gap, we collected a large collection of neurodevelopment-related cell/tissue types, and trained deep Convolutional Neural Networks (ResNet) using such data. Furthermore, our model, called MetaChrom, borrows information from public epigenomic consortium to improve the accuracy via transfer learning. We show that MetaChrom is substantially better in predicting experimentally determined chromatin accessibility variants than popular variant annotation tools such as CADD and delta-SVM. By combining GWAS data with MetaChrom predictions, we prioritized 31 SNPs for Schizophrenia, suggesting potential risk genes and the biological contexts where they act. In summary, MetaChrom provides functional annotations of any DNA variants in the neuro-development context and the general method of MetaChrom can also be extended to other disease-related cell or tissue types. Public Library of Science 2022-05-16 /pmc/articles/PMC9135341/ /pubmed/35576194 http://dx.doi.org/10.1371/journal.pcbi.1010011 Text en © 2022 Lai et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Lai, Boqiao Qian, Sheng Zhang, Hanwei Zhang, Siwei Kozlova, Alena Duan, Jubao Xu, Jinbo He, Xin Annotating functional effects of non-coding variants in neuropsychiatric cell types by deep transfer learning |
title | Annotating functional effects of non-coding variants in neuropsychiatric cell types by deep transfer learning |
title_full | Annotating functional effects of non-coding variants in neuropsychiatric cell types by deep transfer learning |
title_fullStr | Annotating functional effects of non-coding variants in neuropsychiatric cell types by deep transfer learning |
title_full_unstemmed | Annotating functional effects of non-coding variants in neuropsychiatric cell types by deep transfer learning |
title_short | Annotating functional effects of non-coding variants in neuropsychiatric cell types by deep transfer learning |
title_sort | annotating functional effects of non-coding variants in neuropsychiatric cell types by deep transfer learning |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9135341/ https://www.ncbi.nlm.nih.gov/pubmed/35576194 http://dx.doi.org/10.1371/journal.pcbi.1010011 |
work_keys_str_mv | AT laiboqiao annotatingfunctionaleffectsofnoncodingvariantsinneuropsychiatriccelltypesbydeeptransferlearning AT qiansheng annotatingfunctionaleffectsofnoncodingvariantsinneuropsychiatriccelltypesbydeeptransferlearning AT zhanghanwei annotatingfunctionaleffectsofnoncodingvariantsinneuropsychiatriccelltypesbydeeptransferlearning AT zhangsiwei annotatingfunctionaleffectsofnoncodingvariantsinneuropsychiatriccelltypesbydeeptransferlearning AT kozlovaalena annotatingfunctionaleffectsofnoncodingvariantsinneuropsychiatriccelltypesbydeeptransferlearning AT duanjubao annotatingfunctionaleffectsofnoncodingvariantsinneuropsychiatriccelltypesbydeeptransferlearning AT xujinbo annotatingfunctionaleffectsofnoncodingvariantsinneuropsychiatriccelltypesbydeeptransferlearning AT hexin annotatingfunctionaleffectsofnoncodingvariantsinneuropsychiatriccelltypesbydeeptransferlearning |