Cargando…
A statistical framework for predicting critical regions of p53-dependent enhancers
P53 is the ‘guardian of the genome’ and is responsible for regulating cell cycle and apoptosis. The genomic p53 binding regions, where activating transcriptional factors and cofactors like p300 simultaneously bind, are called ‘p53-dependent enhancers’, which play an important role in tumorigenesis....
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8138796/ https://www.ncbi.nlm.nih.gov/pubmed/32392580 http://dx.doi.org/10.1093/bib/bbaa053 |
_version_ | 1783695879449870336 |
---|---|
author | Niu, Xiaohui Deng, Kaixuan Liu, Lifen Yang, Kun Hu, Xuehai |
author_facet | Niu, Xiaohui Deng, Kaixuan Liu, Lifen Yang, Kun Hu, Xuehai |
author_sort | Niu, Xiaohui |
collection | PubMed |
description | P53 is the ‘guardian of the genome’ and is responsible for regulating cell cycle and apoptosis. The genomic p53 binding regions, where activating transcriptional factors and cofactors like p300 simultaneously bind, are called ‘p53-dependent enhancers’, which play an important role in tumorigenesis. Current experimental assays generally provide a broad peak of each enhancer element, leaving our knowledge about critical enhancer regions (CERs) limited. Under the inspiration of enhancer dissection by CRISPR-Cas9 screen library on genome-wide p53 binding sites, here we introduce a statistical framework called ‘Computational CRISPR Strategy’ (CCS), to predict whether a given DNA fragment will be a p53-dependent CER by employing 7-mer as feature extractions along with random forest as the regressor. When training on a p53 CRISPR enhancer dataset, CCS not only accurately fitted the top-ranked enriched single guide RNAs (sgRNAs) but also successfully reproduced two known CERs that were validated by experiments. When applying it to an independent testing dataset on a tilling of a 2K-b genomic region of CRISPR-deCDKN1A-Lib, the trained model shows great generalizability by identifying a CER containing five top-ranked sgRNAs. A feature importance analysis further indicates that top-ranked 7-mers are mapped onto informative TF motifs including POU5F1 and SOX5, which are differentially enriched in p53-dependent CERs and are potential factors to make a general p53 binding site to form a p53-dependent CER, providing the interpretability of the trained model. Our results demonstrate that CCS is an alternative way of the CRISPR experiment to screen the genome for mapping p53-dependent CERs. |
format | Online Article Text |
id | pubmed-8138796 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-81387962021-05-25 A statistical framework for predicting critical regions of p53-dependent enhancers Niu, Xiaohui Deng, Kaixuan Liu, Lifen Yang, Kun Hu, Xuehai Brief Bioinform Problem Solving Protocol P53 is the ‘guardian of the genome’ and is responsible for regulating cell cycle and apoptosis. The genomic p53 binding regions, where activating transcriptional factors and cofactors like p300 simultaneously bind, are called ‘p53-dependent enhancers’, which play an important role in tumorigenesis. Current experimental assays generally provide a broad peak of each enhancer element, leaving our knowledge about critical enhancer regions (CERs) limited. Under the inspiration of enhancer dissection by CRISPR-Cas9 screen library on genome-wide p53 binding sites, here we introduce a statistical framework called ‘Computational CRISPR Strategy’ (CCS), to predict whether a given DNA fragment will be a p53-dependent CER by employing 7-mer as feature extractions along with random forest as the regressor. When training on a p53 CRISPR enhancer dataset, CCS not only accurately fitted the top-ranked enriched single guide RNAs (sgRNAs) but also successfully reproduced two known CERs that were validated by experiments. When applying it to an independent testing dataset on a tilling of a 2K-b genomic region of CRISPR-deCDKN1A-Lib, the trained model shows great generalizability by identifying a CER containing five top-ranked sgRNAs. A feature importance analysis further indicates that top-ranked 7-mers are mapped onto informative TF motifs including POU5F1 and SOX5, which are differentially enriched in p53-dependent CERs and are potential factors to make a general p53 binding site to form a p53-dependent CER, providing the interpretability of the trained model. Our results demonstrate that CCS is an alternative way of the CRISPR experiment to screen the genome for mapping p53-dependent CERs. Oxford University Press 2020-05-11 /pmc/articles/PMC8138796/ /pubmed/32392580 http://dx.doi.org/10.1093/bib/bbaa053 Text en © The Author(s) 2020. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Problem Solving Protocol Niu, Xiaohui Deng, Kaixuan Liu, Lifen Yang, Kun Hu, Xuehai A statistical framework for predicting critical regions of p53-dependent enhancers |
title | A statistical framework for predicting critical regions of p53-dependent enhancers |
title_full | A statistical framework for predicting critical regions of p53-dependent enhancers |
title_fullStr | A statistical framework for predicting critical regions of p53-dependent enhancers |
title_full_unstemmed | A statistical framework for predicting critical regions of p53-dependent enhancers |
title_short | A statistical framework for predicting critical regions of p53-dependent enhancers |
title_sort | statistical framework for predicting critical regions of p53-dependent enhancers |
topic | Problem Solving Protocol |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8138796/ https://www.ncbi.nlm.nih.gov/pubmed/32392580 http://dx.doi.org/10.1093/bib/bbaa053 |
work_keys_str_mv | AT niuxiaohui astatisticalframeworkforpredictingcriticalregionsofp53dependentenhancers AT dengkaixuan astatisticalframeworkforpredictingcriticalregionsofp53dependentenhancers AT liulifen astatisticalframeworkforpredictingcriticalregionsofp53dependentenhancers AT yangkun astatisticalframeworkforpredictingcriticalregionsofp53dependentenhancers AT huxuehai astatisticalframeworkforpredictingcriticalregionsofp53dependentenhancers AT niuxiaohui statisticalframeworkforpredictingcriticalregionsofp53dependentenhancers AT dengkaixuan statisticalframeworkforpredictingcriticalregionsofp53dependentenhancers AT liulifen statisticalframeworkforpredictingcriticalregionsofp53dependentenhancers AT yangkun statisticalframeworkforpredictingcriticalregionsofp53dependentenhancers AT huxuehai statisticalframeworkforpredictingcriticalregionsofp53dependentenhancers |