Cargando…
Integration of Genome and Chromatin Structure with Gene Expression Profiles To Predict c-MYC Recognition Site Binding and Function
The MYC genes encode nuclear sequence specific–binding DNA-binding proteins that are pleiotropic regulators of cellular function, and the c-MYC proto-oncogene is deregulated and/or mutated in most human cancers. Experimental studies of MYC binding to the genome are not fully consistent. While many c...
Autores principales: | , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1847699/ https://www.ncbi.nlm.nih.gov/pubmed/17411336 http://dx.doi.org/10.1371/journal.pcbi.0030063 |
_version_ | 1782132914099585024 |
---|---|
author | Chen, Yili Blackwell, Thomas W Chen, Ji Gao, Jing Lee, Angel W States, David J |
author_facet | Chen, Yili Blackwell, Thomas W Chen, Ji Gao, Jing Lee, Angel W States, David J |
author_sort | Chen, Yili |
collection | PubMed |
description | The MYC genes encode nuclear sequence specific–binding DNA-binding proteins that are pleiotropic regulators of cellular function, and the c-MYC proto-oncogene is deregulated and/or mutated in most human cancers. Experimental studies of MYC binding to the genome are not fully consistent. While many c-MYC recognition sites can be identified in c-MYC responsive genes, other motif matches—even experimentally confirmed sites—are associated with genes showing no c-MYC response. We have developed a computational model that integrates multiple sources of evidence to predict which genes will bind and be regulated by MYC in vivo. First, a Bayesian network classifier is used to predict those c-MYC recognition sites that are most likely to exhibit high-occupancy binding in chromatin immunoprecipitation studies. This classifier incorporates genomic sequence, experimentally determined genomic chromatin acetylation islands, and predicted methylation status from a computational model estimating the likelihood of genomic DNA methylation. We find that the predictions from this classifier are also applicable to other transcription factors, such as cAMP-response element-binding protein, whose binding sites are sensitive to DNA methylation. Second, the MYC binding probability is combined with the gene expression profile data from nine independent microarray datasets in multiple tissues. Finally, we may consider gene function annotations in Gene Ontology to predict the c-MYC targets. We assess the performance of our prediction results by comparing them with the c-myc targets identified in the biomedical literature. In total, we predict 460 likely c-MYC target genes in the human genome, of which 67 have been reported to be both bound and regulated by MYC, 68 are bound by MYC, and another 80 are MYC-regulated. The approach thus successfully identifies many known c-MYC targets and suggests many novel sites. Our findings suggest that to identify c-MYC genomic targets, integration of different data sources helps to improve the accuracy. |
format | Text |
id | pubmed-1847699 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-18476992007-04-06 Integration of Genome and Chromatin Structure with Gene Expression Profiles To Predict c-MYC Recognition Site Binding and Function Chen, Yili Blackwell, Thomas W Chen, Ji Gao, Jing Lee, Angel W States, David J PLoS Comput Biol Research Article The MYC genes encode nuclear sequence specific–binding DNA-binding proteins that are pleiotropic regulators of cellular function, and the c-MYC proto-oncogene is deregulated and/or mutated in most human cancers. Experimental studies of MYC binding to the genome are not fully consistent. While many c-MYC recognition sites can be identified in c-MYC responsive genes, other motif matches—even experimentally confirmed sites—are associated with genes showing no c-MYC response. We have developed a computational model that integrates multiple sources of evidence to predict which genes will bind and be regulated by MYC in vivo. First, a Bayesian network classifier is used to predict those c-MYC recognition sites that are most likely to exhibit high-occupancy binding in chromatin immunoprecipitation studies. This classifier incorporates genomic sequence, experimentally determined genomic chromatin acetylation islands, and predicted methylation status from a computational model estimating the likelihood of genomic DNA methylation. We find that the predictions from this classifier are also applicable to other transcription factors, such as cAMP-response element-binding protein, whose binding sites are sensitive to DNA methylation. Second, the MYC binding probability is combined with the gene expression profile data from nine independent microarray datasets in multiple tissues. Finally, we may consider gene function annotations in Gene Ontology to predict the c-MYC targets. We assess the performance of our prediction results by comparing them with the c-myc targets identified in the biomedical literature. In total, we predict 460 likely c-MYC target genes in the human genome, of which 67 have been reported to be both bound and regulated by MYC, 68 are bound by MYC, and another 80 are MYC-regulated. The approach thus successfully identifies many known c-MYC targets and suggests many novel sites. Our findings suggest that to identify c-MYC genomic targets, integration of different data sources helps to improve the accuracy. Public Library of Science 2007-04 2007-04-06 /pmc/articles/PMC1847699/ /pubmed/17411336 http://dx.doi.org/10.1371/journal.pcbi.0030063 Text en © 2007 Chen et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Chen, Yili Blackwell, Thomas W Chen, Ji Gao, Jing Lee, Angel W States, David J Integration of Genome and Chromatin Structure with Gene Expression Profiles To Predict c-MYC Recognition Site Binding and Function |
title | Integration of Genome and Chromatin Structure with Gene Expression Profiles To Predict c-MYC Recognition Site Binding and Function |
title_full | Integration of Genome and Chromatin Structure with Gene Expression Profiles To Predict c-MYC Recognition Site Binding and Function |
title_fullStr | Integration of Genome and Chromatin Structure with Gene Expression Profiles To Predict c-MYC Recognition Site Binding and Function |
title_full_unstemmed | Integration of Genome and Chromatin Structure with Gene Expression Profiles To Predict c-MYC Recognition Site Binding and Function |
title_short | Integration of Genome and Chromatin Structure with Gene Expression Profiles To Predict c-MYC Recognition Site Binding and Function |
title_sort | integration of genome and chromatin structure with gene expression profiles to predict c-myc recognition site binding and function |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1847699/ https://www.ncbi.nlm.nih.gov/pubmed/17411336 http://dx.doi.org/10.1371/journal.pcbi.0030063 |
work_keys_str_mv | AT chenyili integrationofgenomeandchromatinstructurewithgeneexpressionprofilestopredictcmycrecognitionsitebindingandfunction AT blackwellthomasw integrationofgenomeandchromatinstructurewithgeneexpressionprofilestopredictcmycrecognitionsitebindingandfunction AT chenji integrationofgenomeandchromatinstructurewithgeneexpressionprofilestopredictcmycrecognitionsitebindingandfunction AT gaojing integrationofgenomeandchromatinstructurewithgeneexpressionprofilestopredictcmycrecognitionsitebindingandfunction AT leeangelw integrationofgenomeandchromatinstructurewithgeneexpressionprofilestopredictcmycrecognitionsitebindingandfunction AT statesdavidj integrationofgenomeandchromatinstructurewithgeneexpressionprofilestopredictcmycrecognitionsitebindingandfunction |