Cargando…

A statistical framework for predicting critical regions of p53-dependent enhancers

P53 is the ‘guardian of the genome’ and is responsible for regulating cell cycle and apoptosis. The genomic p53 binding regions, where activating transcriptional factors and cofactors like p300 simultaneously bind, are called ‘p53-dependent enhancers’, which play an important role in tumorigenesis....

Descripción completa

Detalles Bibliográficos
Autores principales: Niu, Xiaohui, Deng, Kaixuan, Liu, Lifen, Yang, Kun, Hu, Xuehai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8138796/
https://www.ncbi.nlm.nih.gov/pubmed/32392580
http://dx.doi.org/10.1093/bib/bbaa053
_version_ 1783695879449870336
author Niu, Xiaohui
Deng, Kaixuan
Liu, Lifen
Yang, Kun
Hu, Xuehai
author_facet Niu, Xiaohui
Deng, Kaixuan
Liu, Lifen
Yang, Kun
Hu, Xuehai
author_sort Niu, Xiaohui
collection PubMed
description P53 is the ‘guardian of the genome’ and is responsible for regulating cell cycle and apoptosis. The genomic p53 binding regions, where activating transcriptional factors and cofactors like p300 simultaneously bind, are called ‘p53-dependent enhancers’, which play an important role in tumorigenesis. Current experimental assays generally provide a broad peak of each enhancer element, leaving our knowledge about critical enhancer regions (CERs) limited. Under the inspiration of enhancer dissection by CRISPR-Cas9 screen library on genome-wide p53 binding sites, here we introduce a statistical framework called ‘Computational CRISPR Strategy’ (CCS), to predict whether a given DNA fragment will be a p53-dependent CER by employing 7-mer as feature extractions along with random forest as the regressor. When training on a p53 CRISPR enhancer dataset, CCS not only accurately fitted the top-ranked enriched single guide RNAs (sgRNAs) but also successfully reproduced two known CERs that were validated by experiments. When applying it to an independent testing dataset on a tilling of a 2K-b genomic region of CRISPR-deCDKN1A-Lib, the trained model shows great generalizability by identifying a CER containing five top-ranked sgRNAs. A feature importance analysis further indicates that top-ranked 7-mers are mapped onto informative TF motifs including POU5F1 and SOX5, which are differentially enriched in p53-dependent CERs and are potential factors to make a general p53 binding site to form a p53-dependent CER, providing the interpretability of the trained model. Our results demonstrate that CCS is an alternative way of the CRISPR experiment to screen the genome for mapping p53-dependent CERs.
format Online
Article
Text
id pubmed-8138796
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-81387962021-05-25 A statistical framework for predicting critical regions of p53-dependent enhancers Niu, Xiaohui Deng, Kaixuan Liu, Lifen Yang, Kun Hu, Xuehai Brief Bioinform Problem Solving Protocol P53 is the ‘guardian of the genome’ and is responsible for regulating cell cycle and apoptosis. The genomic p53 binding regions, where activating transcriptional factors and cofactors like p300 simultaneously bind, are called ‘p53-dependent enhancers’, which play an important role in tumorigenesis. Current experimental assays generally provide a broad peak of each enhancer element, leaving our knowledge about critical enhancer regions (CERs) limited. Under the inspiration of enhancer dissection by CRISPR-Cas9 screen library on genome-wide p53 binding sites, here we introduce a statistical framework called ‘Computational CRISPR Strategy’ (CCS), to predict whether a given DNA fragment will be a p53-dependent CER by employing 7-mer as feature extractions along with random forest as the regressor. When training on a p53 CRISPR enhancer dataset, CCS not only accurately fitted the top-ranked enriched single guide RNAs (sgRNAs) but also successfully reproduced two known CERs that were validated by experiments. When applying it to an independent testing dataset on a tilling of a 2K-b genomic region of CRISPR-deCDKN1A-Lib, the trained model shows great generalizability by identifying a CER containing five top-ranked sgRNAs. A feature importance analysis further indicates that top-ranked 7-mers are mapped onto informative TF motifs including POU5F1 and SOX5, which are differentially enriched in p53-dependent CERs and are potential factors to make a general p53 binding site to form a p53-dependent CER, providing the interpretability of the trained model. Our results demonstrate that CCS is an alternative way of the CRISPR experiment to screen the genome for mapping p53-dependent CERs. Oxford University Press 2020-05-11 /pmc/articles/PMC8138796/ /pubmed/32392580 http://dx.doi.org/10.1093/bib/bbaa053 Text en © The Author(s) 2020. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Problem Solving Protocol
Niu, Xiaohui
Deng, Kaixuan
Liu, Lifen
Yang, Kun
Hu, Xuehai
A statistical framework for predicting critical regions of p53-dependent enhancers
title A statistical framework for predicting critical regions of p53-dependent enhancers
title_full A statistical framework for predicting critical regions of p53-dependent enhancers
title_fullStr A statistical framework for predicting critical regions of p53-dependent enhancers
title_full_unstemmed A statistical framework for predicting critical regions of p53-dependent enhancers
title_short A statistical framework for predicting critical regions of p53-dependent enhancers
title_sort statistical framework for predicting critical regions of p53-dependent enhancers
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8138796/
https://www.ncbi.nlm.nih.gov/pubmed/32392580
http://dx.doi.org/10.1093/bib/bbaa053
work_keys_str_mv AT niuxiaohui astatisticalframeworkforpredictingcriticalregionsofp53dependentenhancers
AT dengkaixuan astatisticalframeworkforpredictingcriticalregionsofp53dependentenhancers
AT liulifen astatisticalframeworkforpredictingcriticalregionsofp53dependentenhancers
AT yangkun astatisticalframeworkforpredictingcriticalregionsofp53dependentenhancers
AT huxuehai astatisticalframeworkforpredictingcriticalregionsofp53dependentenhancers
AT niuxiaohui statisticalframeworkforpredictingcriticalregionsofp53dependentenhancers
AT dengkaixuan statisticalframeworkforpredictingcriticalregionsofp53dependentenhancers
AT liulifen statisticalframeworkforpredictingcriticalregionsofp53dependentenhancers
AT yangkun statisticalframeworkforpredictingcriticalregionsofp53dependentenhancers
AT huxuehai statisticalframeworkforpredictingcriticalregionsofp53dependentenhancers