Cargando…
Predicting promoter activities of primary human DNA sequences
We developed a computer program that can predict the intrinsic promoter activities of primary human DNA sequences. We observed promoter activity using a quantitative luciferase assay and generated a prediction model using multiple linear regression. Our program achieved a prediction accuracy correla...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3113590/ https://www.ncbi.nlm.nih.gov/pubmed/21486745 http://dx.doi.org/10.1093/nar/gkr173 |
_version_ | 1782205950426349568 |
---|---|
author | Irie, Takuma Park, Sung-Joon Yamashita, Riu Seki, Masahide Yada, Tetsushi Sugano, Sumio Nakai, Kenta Suzuki, Yutaka |
author_facet | Irie, Takuma Park, Sung-Joon Yamashita, Riu Seki, Masahide Yada, Tetsushi Sugano, Sumio Nakai, Kenta Suzuki, Yutaka |
author_sort | Irie, Takuma |
collection | PubMed |
description | We developed a computer program that can predict the intrinsic promoter activities of primary human DNA sequences. We observed promoter activity using a quantitative luciferase assay and generated a prediction model using multiple linear regression. Our program achieved a prediction accuracy correlation coefficient of 0.87 between the predicted and observed promoter activities. We evaluated the prediction accuracy of the program using massive sequencing analysis of transcriptional start sites in vivo. We found that it is still difficult to predict transcript levels in a strictly quantitative manner in vivo; however, it was possible to select active promoters in a given cell from the other silent promoters. Using this program, we analyzed the transcriptional landscape of the entire human genome. We demonstrate that many human genomic regions have potential promoter activity, and the expression of some previously uncharacterized putatively non-protein-coding transcripts can be explained by our prediction model. Furthermore, we found that nucleosomes occasionally formed open chromatin structures with RNA polymerase II recruitment where the program predicted significant promoter activities, although no transcripts were observed. |
format | Online Article Text |
id | pubmed-3113590 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-31135902011-06-14 Predicting promoter activities of primary human DNA sequences Irie, Takuma Park, Sung-Joon Yamashita, Riu Seki, Masahide Yada, Tetsushi Sugano, Sumio Nakai, Kenta Suzuki, Yutaka Nucleic Acids Res Methods Online We developed a computer program that can predict the intrinsic promoter activities of primary human DNA sequences. We observed promoter activity using a quantitative luciferase assay and generated a prediction model using multiple linear regression. Our program achieved a prediction accuracy correlation coefficient of 0.87 between the predicted and observed promoter activities. We evaluated the prediction accuracy of the program using massive sequencing analysis of transcriptional start sites in vivo. We found that it is still difficult to predict transcript levels in a strictly quantitative manner in vivo; however, it was possible to select active promoters in a given cell from the other silent promoters. Using this program, we analyzed the transcriptional landscape of the entire human genome. We demonstrate that many human genomic regions have potential promoter activity, and the expression of some previously uncharacterized putatively non-protein-coding transcripts can be explained by our prediction model. Furthermore, we found that nucleosomes occasionally formed open chromatin structures with RNA polymerase II recruitment where the program predicted significant promoter activities, although no transcripts were observed. Oxford University Press 2011-06 2011-04-12 /pmc/articles/PMC3113590/ /pubmed/21486745 http://dx.doi.org/10.1093/nar/gkr173 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Online Irie, Takuma Park, Sung-Joon Yamashita, Riu Seki, Masahide Yada, Tetsushi Sugano, Sumio Nakai, Kenta Suzuki, Yutaka Predicting promoter activities of primary human DNA sequences |
title | Predicting promoter activities of primary human DNA sequences |
title_full | Predicting promoter activities of primary human DNA sequences |
title_fullStr | Predicting promoter activities of primary human DNA sequences |
title_full_unstemmed | Predicting promoter activities of primary human DNA sequences |
title_short | Predicting promoter activities of primary human DNA sequences |
title_sort | predicting promoter activities of primary human dna sequences |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3113590/ https://www.ncbi.nlm.nih.gov/pubmed/21486745 http://dx.doi.org/10.1093/nar/gkr173 |
work_keys_str_mv | AT irietakuma predictingpromoteractivitiesofprimaryhumandnasequences AT parksungjoon predictingpromoteractivitiesofprimaryhumandnasequences AT yamashitariu predictingpromoteractivitiesofprimaryhumandnasequences AT sekimasahide predictingpromoteractivitiesofprimaryhumandnasequences AT yadatetsushi predictingpromoteractivitiesofprimaryhumandnasequences AT suganosumio predictingpromoteractivitiesofprimaryhumandnasequences AT nakaikenta predictingpromoteractivitiesofprimaryhumandnasequences AT suzukiyutaka predictingpromoteractivitiesofprimaryhumandnasequences |