Cargando…

Integration of Genome and Chromatin Structure with Gene Expression Profiles To Predict c-MYC Recognition Site Binding and Function

The MYC genes encode nuclear sequence specific–binding DNA-binding proteins that are pleiotropic regulators of cellular function, and the c-MYC proto-oncogene is deregulated and/or mutated in most human cancers. Experimental studies of MYC binding to the genome are not fully consistent. While many c...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Yili, Blackwell, Thomas W, Chen, Ji, Gao, Jing, Lee, Angel W, States, David J
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1847699/
https://www.ncbi.nlm.nih.gov/pubmed/17411336
http://dx.doi.org/10.1371/journal.pcbi.0030063
_version_ 1782132914099585024
author Chen, Yili
Blackwell, Thomas W
Chen, Ji
Gao, Jing
Lee, Angel W
States, David J
author_facet Chen, Yili
Blackwell, Thomas W
Chen, Ji
Gao, Jing
Lee, Angel W
States, David J
author_sort Chen, Yili
collection PubMed
description The MYC genes encode nuclear sequence specific–binding DNA-binding proteins that are pleiotropic regulators of cellular function, and the c-MYC proto-oncogene is deregulated and/or mutated in most human cancers. Experimental studies of MYC binding to the genome are not fully consistent. While many c-MYC recognition sites can be identified in c-MYC responsive genes, other motif matches—even experimentally confirmed sites—are associated with genes showing no c-MYC response. We have developed a computational model that integrates multiple sources of evidence to predict which genes will bind and be regulated by MYC in vivo. First, a Bayesian network classifier is used to predict those c-MYC recognition sites that are most likely to exhibit high-occupancy binding in chromatin immunoprecipitation studies. This classifier incorporates genomic sequence, experimentally determined genomic chromatin acetylation islands, and predicted methylation status from a computational model estimating the likelihood of genomic DNA methylation. We find that the predictions from this classifier are also applicable to other transcription factors, such as cAMP-response element-binding protein, whose binding sites are sensitive to DNA methylation. Second, the MYC binding probability is combined with the gene expression profile data from nine independent microarray datasets in multiple tissues. Finally, we may consider gene function annotations in Gene Ontology to predict the c-MYC targets. We assess the performance of our prediction results by comparing them with the c-myc targets identified in the biomedical literature. In total, we predict 460 likely c-MYC target genes in the human genome, of which 67 have been reported to be both bound and regulated by MYC, 68 are bound by MYC, and another 80 are MYC-regulated. The approach thus successfully identifies many known c-MYC targets and suggests many novel sites. Our findings suggest that to identify c-MYC genomic targets, integration of different data sources helps to improve the accuracy.
format Text
id pubmed-1847699
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-18476992007-04-06 Integration of Genome and Chromatin Structure with Gene Expression Profiles To Predict c-MYC Recognition Site Binding and Function Chen, Yili Blackwell, Thomas W Chen, Ji Gao, Jing Lee, Angel W States, David J PLoS Comput Biol Research Article The MYC genes encode nuclear sequence specific–binding DNA-binding proteins that are pleiotropic regulators of cellular function, and the c-MYC proto-oncogene is deregulated and/or mutated in most human cancers. Experimental studies of MYC binding to the genome are not fully consistent. While many c-MYC recognition sites can be identified in c-MYC responsive genes, other motif matches—even experimentally confirmed sites—are associated with genes showing no c-MYC response. We have developed a computational model that integrates multiple sources of evidence to predict which genes will bind and be regulated by MYC in vivo. First, a Bayesian network classifier is used to predict those c-MYC recognition sites that are most likely to exhibit high-occupancy binding in chromatin immunoprecipitation studies. This classifier incorporates genomic sequence, experimentally determined genomic chromatin acetylation islands, and predicted methylation status from a computational model estimating the likelihood of genomic DNA methylation. We find that the predictions from this classifier are also applicable to other transcription factors, such as cAMP-response element-binding protein, whose binding sites are sensitive to DNA methylation. Second, the MYC binding probability is combined with the gene expression profile data from nine independent microarray datasets in multiple tissues. Finally, we may consider gene function annotations in Gene Ontology to predict the c-MYC targets. We assess the performance of our prediction results by comparing them with the c-myc targets identified in the biomedical literature. In total, we predict 460 likely c-MYC target genes in the human genome, of which 67 have been reported to be both bound and regulated by MYC, 68 are bound by MYC, and another 80 are MYC-regulated. The approach thus successfully identifies many known c-MYC targets and suggests many novel sites. Our findings suggest that to identify c-MYC genomic targets, integration of different data sources helps to improve the accuracy. Public Library of Science 2007-04 2007-04-06 /pmc/articles/PMC1847699/ /pubmed/17411336 http://dx.doi.org/10.1371/journal.pcbi.0030063 Text en © 2007 Chen et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Chen, Yili
Blackwell, Thomas W
Chen, Ji
Gao, Jing
Lee, Angel W
States, David J
Integration of Genome and Chromatin Structure with Gene Expression Profiles To Predict c-MYC Recognition Site Binding and Function
title Integration of Genome and Chromatin Structure with Gene Expression Profiles To Predict c-MYC Recognition Site Binding and Function
title_full Integration of Genome and Chromatin Structure with Gene Expression Profiles To Predict c-MYC Recognition Site Binding and Function
title_fullStr Integration of Genome and Chromatin Structure with Gene Expression Profiles To Predict c-MYC Recognition Site Binding and Function
title_full_unstemmed Integration of Genome and Chromatin Structure with Gene Expression Profiles To Predict c-MYC Recognition Site Binding and Function
title_short Integration of Genome and Chromatin Structure with Gene Expression Profiles To Predict c-MYC Recognition Site Binding and Function
title_sort integration of genome and chromatin structure with gene expression profiles to predict c-myc recognition site binding and function
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1847699/
https://www.ncbi.nlm.nih.gov/pubmed/17411336
http://dx.doi.org/10.1371/journal.pcbi.0030063
work_keys_str_mv AT chenyili integrationofgenomeandchromatinstructurewithgeneexpressionprofilestopredictcmycrecognitionsitebindingandfunction
AT blackwellthomasw integrationofgenomeandchromatinstructurewithgeneexpressionprofilestopredictcmycrecognitionsitebindingandfunction
AT chenji integrationofgenomeandchromatinstructurewithgeneexpressionprofilestopredictcmycrecognitionsitebindingandfunction
AT gaojing integrationofgenomeandchromatinstructurewithgeneexpressionprofilestopredictcmycrecognitionsitebindingandfunction
AT leeangelw integrationofgenomeandchromatinstructurewithgeneexpressionprofilestopredictcmycrecognitionsitebindingandfunction
AT statesdavidj integrationofgenomeandchromatinstructurewithgeneexpressionprofilestopredictcmycrecognitionsitebindingandfunction