Cargando…

Predicting promoter activities of primary human DNA sequences

We developed a computer program that can predict the intrinsic promoter activities of primary human DNA sequences. We observed promoter activity using a quantitative luciferase assay and generated a prediction model using multiple linear regression. Our program achieved a prediction accuracy correla...

Descripción completa

Detalles Bibliográficos
Autores principales: Irie, Takuma, Park, Sung-Joon, Yamashita, Riu, Seki, Masahide, Yada, Tetsushi, Sugano, Sumio, Nakai, Kenta, Suzuki, Yutaka
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3113590/
https://www.ncbi.nlm.nih.gov/pubmed/21486745
http://dx.doi.org/10.1093/nar/gkr173
_version_ 1782205950426349568
author Irie, Takuma
Park, Sung-Joon
Yamashita, Riu
Seki, Masahide
Yada, Tetsushi
Sugano, Sumio
Nakai, Kenta
Suzuki, Yutaka
author_facet Irie, Takuma
Park, Sung-Joon
Yamashita, Riu
Seki, Masahide
Yada, Tetsushi
Sugano, Sumio
Nakai, Kenta
Suzuki, Yutaka
author_sort Irie, Takuma
collection PubMed
description We developed a computer program that can predict the intrinsic promoter activities of primary human DNA sequences. We observed promoter activity using a quantitative luciferase assay and generated a prediction model using multiple linear regression. Our program achieved a prediction accuracy correlation coefficient of 0.87 between the predicted and observed promoter activities. We evaluated the prediction accuracy of the program using massive sequencing analysis of transcriptional start sites in vivo. We found that it is still difficult to predict transcript levels in a strictly quantitative manner in vivo; however, it was possible to select active promoters in a given cell from the other silent promoters. Using this program, we analyzed the transcriptional landscape of the entire human genome. We demonstrate that many human genomic regions have potential promoter activity, and the expression of some previously uncharacterized putatively non-protein-coding transcripts can be explained by our prediction model. Furthermore, we found that nucleosomes occasionally formed open chromatin structures with RNA polymerase II recruitment where the program predicted significant promoter activities, although no transcripts were observed.
format Online
Article
Text
id pubmed-3113590
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-31135902011-06-14 Predicting promoter activities of primary human DNA sequences Irie, Takuma Park, Sung-Joon Yamashita, Riu Seki, Masahide Yada, Tetsushi Sugano, Sumio Nakai, Kenta Suzuki, Yutaka Nucleic Acids Res Methods Online We developed a computer program that can predict the intrinsic promoter activities of primary human DNA sequences. We observed promoter activity using a quantitative luciferase assay and generated a prediction model using multiple linear regression. Our program achieved a prediction accuracy correlation coefficient of 0.87 between the predicted and observed promoter activities. We evaluated the prediction accuracy of the program using massive sequencing analysis of transcriptional start sites in vivo. We found that it is still difficult to predict transcript levels in a strictly quantitative manner in vivo; however, it was possible to select active promoters in a given cell from the other silent promoters. Using this program, we analyzed the transcriptional landscape of the entire human genome. We demonstrate that many human genomic regions have potential promoter activity, and the expression of some previously uncharacterized putatively non-protein-coding transcripts can be explained by our prediction model. Furthermore, we found that nucleosomes occasionally formed open chromatin structures with RNA polymerase II recruitment where the program predicted significant promoter activities, although no transcripts were observed. Oxford University Press 2011-06 2011-04-12 /pmc/articles/PMC3113590/ /pubmed/21486745 http://dx.doi.org/10.1093/nar/gkr173 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Irie, Takuma
Park, Sung-Joon
Yamashita, Riu
Seki, Masahide
Yada, Tetsushi
Sugano, Sumio
Nakai, Kenta
Suzuki, Yutaka
Predicting promoter activities of primary human DNA sequences
title Predicting promoter activities of primary human DNA sequences
title_full Predicting promoter activities of primary human DNA sequences
title_fullStr Predicting promoter activities of primary human DNA sequences
title_full_unstemmed Predicting promoter activities of primary human DNA sequences
title_short Predicting promoter activities of primary human DNA sequences
title_sort predicting promoter activities of primary human dna sequences
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3113590/
https://www.ncbi.nlm.nih.gov/pubmed/21486745
http://dx.doi.org/10.1093/nar/gkr173
work_keys_str_mv AT irietakuma predictingpromoteractivitiesofprimaryhumandnasequences
AT parksungjoon predictingpromoteractivitiesofprimaryhumandnasequences
AT yamashitariu predictingpromoteractivitiesofprimaryhumandnasequences
AT sekimasahide predictingpromoteractivitiesofprimaryhumandnasequences
AT yadatetsushi predictingpromoteractivitiesofprimaryhumandnasequences
AT suganosumio predictingpromoteractivitiesofprimaryhumandnasequences
AT nakaikenta predictingpromoteractivitiesofprimaryhumandnasequences
AT suzukiyutaka predictingpromoteractivitiesofprimaryhumandnasequences