Cargando…

Long non-coding RNA identification over mouse brain development by integrative modeling of chromatin and genomic features

In silico prediction of genomic long non-coding RNAs (lncRNAs) is prerequisite to the construction and elucidation of non-coding regulatory network. Chromatin modifications marked by chromatin regulators are important epigenetic features, which can be captured by prevailing high-throughput approache...

Descripción completa

Detalles Bibliográficos
Autores principales: Lv, Jie, Liu, Hongbo, Huang, Zhijun, Su, Jianzhong, He, Hongjuan, Xiu, Youcheng, Zhang, Yan, Wu, Qiong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3905897/
https://www.ncbi.nlm.nih.gov/pubmed/24038472
http://dx.doi.org/10.1093/nar/gkt818
_version_ 1782301401630638080
author Lv, Jie
Liu, Hongbo
Huang, Zhijun
Su, Jianzhong
He, Hongjuan
Xiu, Youcheng
Zhang, Yan
Wu, Qiong
author_facet Lv, Jie
Liu, Hongbo
Huang, Zhijun
Su, Jianzhong
He, Hongjuan
Xiu, Youcheng
Zhang, Yan
Wu, Qiong
author_sort Lv, Jie
collection PubMed
description In silico prediction of genomic long non-coding RNAs (lncRNAs) is prerequisite to the construction and elucidation of non-coding regulatory network. Chromatin modifications marked by chromatin regulators are important epigenetic features, which can be captured by prevailing high-throughput approaches such as ChIP sequencing. We demonstrate that the accuracy of lncRNA predictions can be greatly improved when incorporating high-throughput chromatin modifications over mouse embryonic stem differentiation toward adult Cerebellum by logistic regression with LASSO regularization. The discriminating features include H3K9me3, H3K27ac, H3K4me1, open reading frames and several repeat elements. Importantly, chromatin information is suggested to be complementary to genomic sequence information, highlighting the importance of an integrated model. Applying integrated model, we obtain a list of putative lncRNAs based on uncharacterized fragments from transcriptome assembly. We demonstrate that the putative lncRNAs have regulatory roles in vicinity of known gene loci by expression and Gene Ontology enrichment analysis. We also show that the lncRNA expression specificity can be efficiently modeled by the chromatin data with same developmental stage. The study not only supports the biological hypothesis that chromatin can regulate expression of tissue-specific or developmental stage-specific lncRNAs but also reveals the discriminating features between lncRNA and coding genes, which would guide further lncRNA identifications and characterizations.
format Online
Article
Text
id pubmed-3905897
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-39058972014-01-29 Long non-coding RNA identification over mouse brain development by integrative modeling of chromatin and genomic features Lv, Jie Liu, Hongbo Huang, Zhijun Su, Jianzhong He, Hongjuan Xiu, Youcheng Zhang, Yan Wu, Qiong Nucleic Acids Res Computational Biology In silico prediction of genomic long non-coding RNAs (lncRNAs) is prerequisite to the construction and elucidation of non-coding regulatory network. Chromatin modifications marked by chromatin regulators are important epigenetic features, which can be captured by prevailing high-throughput approaches such as ChIP sequencing. We demonstrate that the accuracy of lncRNA predictions can be greatly improved when incorporating high-throughput chromatin modifications over mouse embryonic stem differentiation toward adult Cerebellum by logistic regression with LASSO regularization. The discriminating features include H3K9me3, H3K27ac, H3K4me1, open reading frames and several repeat elements. Importantly, chromatin information is suggested to be complementary to genomic sequence information, highlighting the importance of an integrated model. Applying integrated model, we obtain a list of putative lncRNAs based on uncharacterized fragments from transcriptome assembly. We demonstrate that the putative lncRNAs have regulatory roles in vicinity of known gene loci by expression and Gene Ontology enrichment analysis. We also show that the lncRNA expression specificity can be efficiently modeled by the chromatin data with same developmental stage. The study not only supports the biological hypothesis that chromatin can regulate expression of tissue-specific or developmental stage-specific lncRNAs but also reveals the discriminating features between lncRNA and coding genes, which would guide further lncRNA identifications and characterizations. Oxford University Press 2013-12 2013-09-13 /pmc/articles/PMC3905897/ /pubmed/24038472 http://dx.doi.org/10.1093/nar/gkt818 Text en © The Author(s) 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Computational Biology
Lv, Jie
Liu, Hongbo
Huang, Zhijun
Su, Jianzhong
He, Hongjuan
Xiu, Youcheng
Zhang, Yan
Wu, Qiong
Long non-coding RNA identification over mouse brain development by integrative modeling of chromatin and genomic features
title Long non-coding RNA identification over mouse brain development by integrative modeling of chromatin and genomic features
title_full Long non-coding RNA identification over mouse brain development by integrative modeling of chromatin and genomic features
title_fullStr Long non-coding RNA identification over mouse brain development by integrative modeling of chromatin and genomic features
title_full_unstemmed Long non-coding RNA identification over mouse brain development by integrative modeling of chromatin and genomic features
title_short Long non-coding RNA identification over mouse brain development by integrative modeling of chromatin and genomic features
title_sort long non-coding rna identification over mouse brain development by integrative modeling of chromatin and genomic features
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3905897/
https://www.ncbi.nlm.nih.gov/pubmed/24038472
http://dx.doi.org/10.1093/nar/gkt818
work_keys_str_mv AT lvjie longnoncodingrnaidentificationovermousebraindevelopmentbyintegrativemodelingofchromatinandgenomicfeatures
AT liuhongbo longnoncodingrnaidentificationovermousebraindevelopmentbyintegrativemodelingofchromatinandgenomicfeatures
AT huangzhijun longnoncodingrnaidentificationovermousebraindevelopmentbyintegrativemodelingofchromatinandgenomicfeatures
AT sujianzhong longnoncodingrnaidentificationovermousebraindevelopmentbyintegrativemodelingofchromatinandgenomicfeatures
AT hehongjuan longnoncodingrnaidentificationovermousebraindevelopmentbyintegrativemodelingofchromatinandgenomicfeatures
AT xiuyoucheng longnoncodingrnaidentificationovermousebraindevelopmentbyintegrativemodelingofchromatinandgenomicfeatures
AT zhangyan longnoncodingrnaidentificationovermousebraindevelopmentbyintegrativemodelingofchromatinandgenomicfeatures
AT wuqiong longnoncodingrnaidentificationovermousebraindevelopmentbyintegrativemodelingofchromatinandgenomicfeatures