Cargando…

Identifying transcriptional start sites of human microRNAs based on high-throughput sequencing data

MicroRNAs (miRNAs) are critical small non-coding RNAs that regulate gene expression by hybridizing to the 3′-untranslated regions (3′-UTR) of target mRNAs, subsequently controlling diverse biological processes at post-transcriptional level. How miRNA genes are regulated receives considerable attenti...

Descripción completa

Detalles Bibliográficos
Autores principales: Chien, Chia-Hung, Sun, Yi-Ming, Chang, Wen-Chi, Chiang-Hsieh, Pei-Yun, Lee, Tzong-Yi, Tsai, Wei-Chih, Horng, Jorng-Tzong, Tsou, Ann-Ping, Huang, Hsien-Da
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
RNA
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3241639/
https://www.ncbi.nlm.nih.gov/pubmed/21821656
http://dx.doi.org/10.1093/nar/gkr604
_version_ 1782219540994719744
author Chien, Chia-Hung
Sun, Yi-Ming
Chang, Wen-Chi
Chiang-Hsieh, Pei-Yun
Lee, Tzong-Yi
Tsai, Wei-Chih
Horng, Jorng-Tzong
Tsou, Ann-Ping
Huang, Hsien-Da
author_facet Chien, Chia-Hung
Sun, Yi-Ming
Chang, Wen-Chi
Chiang-Hsieh, Pei-Yun
Lee, Tzong-Yi
Tsai, Wei-Chih
Horng, Jorng-Tzong
Tsou, Ann-Ping
Huang, Hsien-Da
author_sort Chien, Chia-Hung
collection PubMed
description MicroRNAs (miRNAs) are critical small non-coding RNAs that regulate gene expression by hybridizing to the 3′-untranslated regions (3′-UTR) of target mRNAs, subsequently controlling diverse biological processes at post-transcriptional level. How miRNA genes are regulated receives considerable attention because it directly affects miRNA-mediated gene regulatory networks. Although numerous prediction models were developed for identifying miRNA promoters or transcriptional start sites (TSSs), most of them lack experimental validation and are inadequate to elucidate relationships between miRNA genes and transcription factors (TFs). Here, we integrate three experimental datasets, including cap analysis of gene expression (CAGE) tags, TSS Seq libraries and H3K4me3 chromatin signature derived from high-throughput sequencing analysis of gene initiation, to provide direct evidence of miRNA TSSs, thus establishing an experimental-based resource of human miRNA TSSs, named miRStart. Moreover, a machine-learning-based Support Vector Machine (SVM) model is developed to systematically identify representative TSSs for each miRNA gene. Finally, to demonstrate the effectiveness of the proposed resource, an important human intergenic miRNA, hsa-miR-122, is selected to experimentally validate putative TSS owing to its high expression in a normal liver. In conclusion, this work successfully identified 847 human miRNA TSSs (292 of them are clustered to 70 TSSs of miRNA clusters) based on the utilization of high-throughput sequencing data from TSS-relevant experiments, and establish a valuable resource for biologists in advanced research in miRNA-mediated regulatory networks.
format Online
Article
Text
id pubmed-3241639
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-32416392011-12-19 Identifying transcriptional start sites of human microRNAs based on high-throughput sequencing data Chien, Chia-Hung Sun, Yi-Ming Chang, Wen-Chi Chiang-Hsieh, Pei-Yun Lee, Tzong-Yi Tsai, Wei-Chih Horng, Jorng-Tzong Tsou, Ann-Ping Huang, Hsien-Da Nucleic Acids Res RNA MicroRNAs (miRNAs) are critical small non-coding RNAs that regulate gene expression by hybridizing to the 3′-untranslated regions (3′-UTR) of target mRNAs, subsequently controlling diverse biological processes at post-transcriptional level. How miRNA genes are regulated receives considerable attention because it directly affects miRNA-mediated gene regulatory networks. Although numerous prediction models were developed for identifying miRNA promoters or transcriptional start sites (TSSs), most of them lack experimental validation and are inadequate to elucidate relationships between miRNA genes and transcription factors (TFs). Here, we integrate three experimental datasets, including cap analysis of gene expression (CAGE) tags, TSS Seq libraries and H3K4me3 chromatin signature derived from high-throughput sequencing analysis of gene initiation, to provide direct evidence of miRNA TSSs, thus establishing an experimental-based resource of human miRNA TSSs, named miRStart. Moreover, a machine-learning-based Support Vector Machine (SVM) model is developed to systematically identify representative TSSs for each miRNA gene. Finally, to demonstrate the effectiveness of the proposed resource, an important human intergenic miRNA, hsa-miR-122, is selected to experimentally validate putative TSS owing to its high expression in a normal liver. In conclusion, this work successfully identified 847 human miRNA TSSs (292 of them are clustered to 70 TSSs of miRNA clusters) based on the utilization of high-throughput sequencing data from TSS-relevant experiments, and establish a valuable resource for biologists in advanced research in miRNA-mediated regulatory networks. Oxford University Press 2011-11 2011-08-05 /pmc/articles/PMC3241639/ /pubmed/21821656 http://dx.doi.org/10.1093/nar/gkr604 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle RNA
Chien, Chia-Hung
Sun, Yi-Ming
Chang, Wen-Chi
Chiang-Hsieh, Pei-Yun
Lee, Tzong-Yi
Tsai, Wei-Chih
Horng, Jorng-Tzong
Tsou, Ann-Ping
Huang, Hsien-Da
Identifying transcriptional start sites of human microRNAs based on high-throughput sequencing data
title Identifying transcriptional start sites of human microRNAs based on high-throughput sequencing data
title_full Identifying transcriptional start sites of human microRNAs based on high-throughput sequencing data
title_fullStr Identifying transcriptional start sites of human microRNAs based on high-throughput sequencing data
title_full_unstemmed Identifying transcriptional start sites of human microRNAs based on high-throughput sequencing data
title_short Identifying transcriptional start sites of human microRNAs based on high-throughput sequencing data
title_sort identifying transcriptional start sites of human micrornas based on high-throughput sequencing data
topic RNA
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3241639/
https://www.ncbi.nlm.nih.gov/pubmed/21821656
http://dx.doi.org/10.1093/nar/gkr604
work_keys_str_mv AT chienchiahung identifyingtranscriptionalstartsitesofhumanmicrornasbasedonhighthroughputsequencingdata
AT sunyiming identifyingtranscriptionalstartsitesofhumanmicrornasbasedonhighthroughputsequencingdata
AT changwenchi identifyingtranscriptionalstartsitesofhumanmicrornasbasedonhighthroughputsequencingdata
AT chianghsiehpeiyun identifyingtranscriptionalstartsitesofhumanmicrornasbasedonhighthroughputsequencingdata
AT leetzongyi identifyingtranscriptionalstartsitesofhumanmicrornasbasedonhighthroughputsequencingdata
AT tsaiweichih identifyingtranscriptionalstartsitesofhumanmicrornasbasedonhighthroughputsequencingdata
AT horngjorngtzong identifyingtranscriptionalstartsitesofhumanmicrornasbasedonhighthroughputsequencingdata
AT tsouannping identifyingtranscriptionalstartsitesofhumanmicrornasbasedonhighthroughputsequencingdata
AT huanghsienda identifyingtranscriptionalstartsitesofhumanmicrornasbasedonhighthroughputsequencingdata