Cargando…

DeepFun: a deep learning sequence-based model to decipher non-coding variant effect in a tissue- and cell type-specific manner

More than 90% of the genetic variants identified from genome-wide association studies (GWAS) are located in non-coding regions of the human genome. Here, we present a user-friendly web server, DeepFun (https://bioinfo.uth.edu/deepfun/), to assess the functional activity of non-coding genetic variant...

Descripción completa

Detalles Bibliográficos
Autores principales: Pei, Guangsheng, Hu, Ruifeng, Jia, Peilin, Zhao, Zhongming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8262726/
https://www.ncbi.nlm.nih.gov/pubmed/34048560
http://dx.doi.org/10.1093/nar/gkab429
_version_ 1783719240207958016
author Pei, Guangsheng
Hu, Ruifeng
Jia, Peilin
Zhao, Zhongming
author_facet Pei, Guangsheng
Hu, Ruifeng
Jia, Peilin
Zhao, Zhongming
author_sort Pei, Guangsheng
collection PubMed
description More than 90% of the genetic variants identified from genome-wide association studies (GWAS) are located in non-coding regions of the human genome. Here, we present a user-friendly web server, DeepFun (https://bioinfo.uth.edu/deepfun/), to assess the functional activity of non-coding genetic variants. This new server is built on a convolutional neural network (CNN) framework that has been extensively evaluated. Specifically, we collected chromatin profiles from ENCODE and Roadmap projects to construct the feature space, including 1548 DNase I accessibility, 1536 histone mark, and 4795 transcription factor binding profiles covering 225 tissues or cell types. With such comprehensive epigenomics annotations, DeepFun expands the functionality of existing non-coding variant prioritizing tools to provide a more specific functional assessment on non-coding variants in a tissue- and cell type-specific manner. By using the datasets from various GWAS studies, we conducted independent validations and demonstrated the functions of the DeepFun web server in predicting the effect of a non-coding variant in a specific tissue or cell type, as well as visualizing the potential motifs in the region around variants. We expect our server will be widely used in genetics, functional genomics, and disease studies.
format Online
Article
Text
id pubmed-8262726
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-82627262021-07-08 DeepFun: a deep learning sequence-based model to decipher non-coding variant effect in a tissue- and cell type-specific manner Pei, Guangsheng Hu, Ruifeng Jia, Peilin Zhao, Zhongming Nucleic Acids Res Web Server Issue More than 90% of the genetic variants identified from genome-wide association studies (GWAS) are located in non-coding regions of the human genome. Here, we present a user-friendly web server, DeepFun (https://bioinfo.uth.edu/deepfun/), to assess the functional activity of non-coding genetic variants. This new server is built on a convolutional neural network (CNN) framework that has been extensively evaluated. Specifically, we collected chromatin profiles from ENCODE and Roadmap projects to construct the feature space, including 1548 DNase I accessibility, 1536 histone mark, and 4795 transcription factor binding profiles covering 225 tissues or cell types. With such comprehensive epigenomics annotations, DeepFun expands the functionality of existing non-coding variant prioritizing tools to provide a more specific functional assessment on non-coding variants in a tissue- and cell type-specific manner. By using the datasets from various GWAS studies, we conducted independent validations and demonstrated the functions of the DeepFun web server in predicting the effect of a non-coding variant in a specific tissue or cell type, as well as visualizing the potential motifs in the region around variants. We expect our server will be widely used in genetics, functional genomics, and disease studies. Oxford University Press 2021-05-28 /pmc/articles/PMC8262726/ /pubmed/34048560 http://dx.doi.org/10.1093/nar/gkab429 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Web Server Issue
Pei, Guangsheng
Hu, Ruifeng
Jia, Peilin
Zhao, Zhongming
DeepFun: a deep learning sequence-based model to decipher non-coding variant effect in a tissue- and cell type-specific manner
title DeepFun: a deep learning sequence-based model to decipher non-coding variant effect in a tissue- and cell type-specific manner
title_full DeepFun: a deep learning sequence-based model to decipher non-coding variant effect in a tissue- and cell type-specific manner
title_fullStr DeepFun: a deep learning sequence-based model to decipher non-coding variant effect in a tissue- and cell type-specific manner
title_full_unstemmed DeepFun: a deep learning sequence-based model to decipher non-coding variant effect in a tissue- and cell type-specific manner
title_short DeepFun: a deep learning sequence-based model to decipher non-coding variant effect in a tissue- and cell type-specific manner
title_sort deepfun: a deep learning sequence-based model to decipher non-coding variant effect in a tissue- and cell type-specific manner
topic Web Server Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8262726/
https://www.ncbi.nlm.nih.gov/pubmed/34048560
http://dx.doi.org/10.1093/nar/gkab429
work_keys_str_mv AT peiguangsheng deepfunadeeplearningsequencebasedmodeltodeciphernoncodingvarianteffectinatissueandcelltypespecificmanner
AT huruifeng deepfunadeeplearningsequencebasedmodeltodeciphernoncodingvarianteffectinatissueandcelltypespecificmanner
AT jiapeilin deepfunadeeplearningsequencebasedmodeltodeciphernoncodingvarianteffectinatissueandcelltypespecificmanner
AT zhaozhongming deepfunadeeplearningsequencebasedmodeltodeciphernoncodingvarianteffectinatissueandcelltypespecificmanner