Cargando…
Long non-coding RNA identification over mouse brain development by integrative modeling of chromatin and genomic features
In silico prediction of genomic long non-coding RNAs (lncRNAs) is prerequisite to the construction and elucidation of non-coding regulatory network. Chromatin modifications marked by chromatin regulators are important epigenetic features, which can be captured by prevailing high-throughput approache...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3905897/ https://www.ncbi.nlm.nih.gov/pubmed/24038472 http://dx.doi.org/10.1093/nar/gkt818 |
_version_ | 1782301401630638080 |
---|---|
author | Lv, Jie Liu, Hongbo Huang, Zhijun Su, Jianzhong He, Hongjuan Xiu, Youcheng Zhang, Yan Wu, Qiong |
author_facet | Lv, Jie Liu, Hongbo Huang, Zhijun Su, Jianzhong He, Hongjuan Xiu, Youcheng Zhang, Yan Wu, Qiong |
author_sort | Lv, Jie |
collection | PubMed |
description | In silico prediction of genomic long non-coding RNAs (lncRNAs) is prerequisite to the construction and elucidation of non-coding regulatory network. Chromatin modifications marked by chromatin regulators are important epigenetic features, which can be captured by prevailing high-throughput approaches such as ChIP sequencing. We demonstrate that the accuracy of lncRNA predictions can be greatly improved when incorporating high-throughput chromatin modifications over mouse embryonic stem differentiation toward adult Cerebellum by logistic regression with LASSO regularization. The discriminating features include H3K9me3, H3K27ac, H3K4me1, open reading frames and several repeat elements. Importantly, chromatin information is suggested to be complementary to genomic sequence information, highlighting the importance of an integrated model. Applying integrated model, we obtain a list of putative lncRNAs based on uncharacterized fragments from transcriptome assembly. We demonstrate that the putative lncRNAs have regulatory roles in vicinity of known gene loci by expression and Gene Ontology enrichment analysis. We also show that the lncRNA expression specificity can be efficiently modeled by the chromatin data with same developmental stage. The study not only supports the biological hypothesis that chromatin can regulate expression of tissue-specific or developmental stage-specific lncRNAs but also reveals the discriminating features between lncRNA and coding genes, which would guide further lncRNA identifications and characterizations. |
format | Online Article Text |
id | pubmed-3905897 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-39058972014-01-29 Long non-coding RNA identification over mouse brain development by integrative modeling of chromatin and genomic features Lv, Jie Liu, Hongbo Huang, Zhijun Su, Jianzhong He, Hongjuan Xiu, Youcheng Zhang, Yan Wu, Qiong Nucleic Acids Res Computational Biology In silico prediction of genomic long non-coding RNAs (lncRNAs) is prerequisite to the construction and elucidation of non-coding regulatory network. Chromatin modifications marked by chromatin regulators are important epigenetic features, which can be captured by prevailing high-throughput approaches such as ChIP sequencing. We demonstrate that the accuracy of lncRNA predictions can be greatly improved when incorporating high-throughput chromatin modifications over mouse embryonic stem differentiation toward adult Cerebellum by logistic regression with LASSO regularization. The discriminating features include H3K9me3, H3K27ac, H3K4me1, open reading frames and several repeat elements. Importantly, chromatin information is suggested to be complementary to genomic sequence information, highlighting the importance of an integrated model. Applying integrated model, we obtain a list of putative lncRNAs based on uncharacterized fragments from transcriptome assembly. We demonstrate that the putative lncRNAs have regulatory roles in vicinity of known gene loci by expression and Gene Ontology enrichment analysis. We also show that the lncRNA expression specificity can be efficiently modeled by the chromatin data with same developmental stage. The study not only supports the biological hypothesis that chromatin can regulate expression of tissue-specific or developmental stage-specific lncRNAs but also reveals the discriminating features between lncRNA and coding genes, which would guide further lncRNA identifications and characterizations. Oxford University Press 2013-12 2013-09-13 /pmc/articles/PMC3905897/ /pubmed/24038472 http://dx.doi.org/10.1093/nar/gkt818 Text en © The Author(s) 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Computational Biology Lv, Jie Liu, Hongbo Huang, Zhijun Su, Jianzhong He, Hongjuan Xiu, Youcheng Zhang, Yan Wu, Qiong Long non-coding RNA identification over mouse brain development by integrative modeling of chromatin and genomic features |
title | Long non-coding RNA identification over mouse brain development by integrative modeling of chromatin and genomic features |
title_full | Long non-coding RNA identification over mouse brain development by integrative modeling of chromatin and genomic features |
title_fullStr | Long non-coding RNA identification over mouse brain development by integrative modeling of chromatin and genomic features |
title_full_unstemmed | Long non-coding RNA identification over mouse brain development by integrative modeling of chromatin and genomic features |
title_short | Long non-coding RNA identification over mouse brain development by integrative modeling of chromatin and genomic features |
title_sort | long non-coding rna identification over mouse brain development by integrative modeling of chromatin and genomic features |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3905897/ https://www.ncbi.nlm.nih.gov/pubmed/24038472 http://dx.doi.org/10.1093/nar/gkt818 |
work_keys_str_mv | AT lvjie longnoncodingrnaidentificationovermousebraindevelopmentbyintegrativemodelingofchromatinandgenomicfeatures AT liuhongbo longnoncodingrnaidentificationovermousebraindevelopmentbyintegrativemodelingofchromatinandgenomicfeatures AT huangzhijun longnoncodingrnaidentificationovermousebraindevelopmentbyintegrativemodelingofchromatinandgenomicfeatures AT sujianzhong longnoncodingrnaidentificationovermousebraindevelopmentbyintegrativemodelingofchromatinandgenomicfeatures AT hehongjuan longnoncodingrnaidentificationovermousebraindevelopmentbyintegrativemodelingofchromatinandgenomicfeatures AT xiuyoucheng longnoncodingrnaidentificationovermousebraindevelopmentbyintegrativemodelingofchromatinandgenomicfeatures AT zhangyan longnoncodingrnaidentificationovermousebraindevelopmentbyintegrativemodelingofchromatinandgenomicfeatures AT wuqiong longnoncodingrnaidentificationovermousebraindevelopmentbyintegrativemodelingofchromatinandgenomicfeatures |