Cargando…

Computational prediction of novel non-coding RNAs in Arabidopsis thaliana

BACKGROUND: Non-coding RNA (ncRNA) genes do not encode proteins but produce functional RNA molecules that play crucial roles in many key biological processes. Recent genome-wide transcriptional profiling studies using tiling arrays in organisms such as human and Arabidopsis have revealed a great num...

Descripción completa

Detalles Bibliográficos
Autores principales: Song, Dandan, Yang, Yang, Yu, Bin, Zheng, Binglian, Deng, Zhidong, Lu, Bao-Liang, Chen, Xuemei, Jiang, Tao
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2648795/
https://www.ncbi.nlm.nih.gov/pubmed/19208137
http://dx.doi.org/10.1186/1471-2105-10-S1-S36
_version_ 1782164990119116800
author Song, Dandan
Yang, Yang
Yu, Bin
Zheng, Binglian
Deng, Zhidong
Lu, Bao-Liang
Chen, Xuemei
Jiang, Tao
author_facet Song, Dandan
Yang, Yang
Yu, Bin
Zheng, Binglian
Deng, Zhidong
Lu, Bao-Liang
Chen, Xuemei
Jiang, Tao
author_sort Song, Dandan
collection PubMed
description BACKGROUND: Non-coding RNA (ncRNA) genes do not encode proteins but produce functional RNA molecules that play crucial roles in many key biological processes. Recent genome-wide transcriptional profiling studies using tiling arrays in organisms such as human and Arabidopsis have revealed a great number of transcripts, a large portion of which have little or no capability to encode proteins. This unexpected finding suggests that the currently known repertoire of ncRNAs may only represent a small fraction of ncRNAs of the organisms. Thus, efficient and effective prediction of ncRNAs has become an important task in bioinformatics in recent years. Among the available computational methods, the comparative genomic approach seems to be the most powerful to detect ncRNAs. The recent completion of the sequencing of several major plant genomes has made the approach possible for plants. RESULTS: We have developed a pipeline to predict novel ncRNAs in the Arabidopsis (Arabidopsis thaliana) genome. It starts by comparing the expressed intergenic regions of Arabidopsis as provided in two whole-genome high-density oligo-probe arrays from the literature with the intergenic nucleotide sequences of all completely sequenced plant genomes including rice (Oryza sativa), poplar (Populus trichocarpa), grape (Vitis vinifera), and papaya (Carica papaya). By using multiple sequence alignment, a popular ncRNA prediction program (RNAz), wet-bench experimental validation, protein-coding potential analysis, and stringent screening against various ncRNA databases, the pipeline resulted in 16 families of novel ncRNAs (with a total of 21 ncRNAs). CONCLUSION: In this paper, we undertake a genome-wide search for novel ncRNAs in the genome of Arabidopsis by a comparative genomics approach. The identified novel ncRNAs are evolutionarily conserved between Arabidopsis and other recently sequenced plants, and may conduct interesting novel biological functions.
format Text
id pubmed-2648795
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26487952009-02-28 Computational prediction of novel non-coding RNAs in Arabidopsis thaliana Song, Dandan Yang, Yang Yu, Bin Zheng, Binglian Deng, Zhidong Lu, Bao-Liang Chen, Xuemei Jiang, Tao BMC Bioinformatics Research BACKGROUND: Non-coding RNA (ncRNA) genes do not encode proteins but produce functional RNA molecules that play crucial roles in many key biological processes. Recent genome-wide transcriptional profiling studies using tiling arrays in organisms such as human and Arabidopsis have revealed a great number of transcripts, a large portion of which have little or no capability to encode proteins. This unexpected finding suggests that the currently known repertoire of ncRNAs may only represent a small fraction of ncRNAs of the organisms. Thus, efficient and effective prediction of ncRNAs has become an important task in bioinformatics in recent years. Among the available computational methods, the comparative genomic approach seems to be the most powerful to detect ncRNAs. The recent completion of the sequencing of several major plant genomes has made the approach possible for plants. RESULTS: We have developed a pipeline to predict novel ncRNAs in the Arabidopsis (Arabidopsis thaliana) genome. It starts by comparing the expressed intergenic regions of Arabidopsis as provided in two whole-genome high-density oligo-probe arrays from the literature with the intergenic nucleotide sequences of all completely sequenced plant genomes including rice (Oryza sativa), poplar (Populus trichocarpa), grape (Vitis vinifera), and papaya (Carica papaya). By using multiple sequence alignment, a popular ncRNA prediction program (RNAz), wet-bench experimental validation, protein-coding potential analysis, and stringent screening against various ncRNA databases, the pipeline resulted in 16 families of novel ncRNAs (with a total of 21 ncRNAs). CONCLUSION: In this paper, we undertake a genome-wide search for novel ncRNAs in the genome of Arabidopsis by a comparative genomics approach. The identified novel ncRNAs are evolutionarily conserved between Arabidopsis and other recently sequenced plants, and may conduct interesting novel biological functions. BioMed Central 2009-01-30 /pmc/articles/PMC2648795/ /pubmed/19208137 http://dx.doi.org/10.1186/1471-2105-10-S1-S36 Text en Copyright © 2009 Song et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Song, Dandan
Yang, Yang
Yu, Bin
Zheng, Binglian
Deng, Zhidong
Lu, Bao-Liang
Chen, Xuemei
Jiang, Tao
Computational prediction of novel non-coding RNAs in Arabidopsis thaliana
title Computational prediction of novel non-coding RNAs in Arabidopsis thaliana
title_full Computational prediction of novel non-coding RNAs in Arabidopsis thaliana
title_fullStr Computational prediction of novel non-coding RNAs in Arabidopsis thaliana
title_full_unstemmed Computational prediction of novel non-coding RNAs in Arabidopsis thaliana
title_short Computational prediction of novel non-coding RNAs in Arabidopsis thaliana
title_sort computational prediction of novel non-coding rnas in arabidopsis thaliana
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2648795/
https://www.ncbi.nlm.nih.gov/pubmed/19208137
http://dx.doi.org/10.1186/1471-2105-10-S1-S36
work_keys_str_mv AT songdandan computationalpredictionofnovelnoncodingrnasinarabidopsisthaliana
AT yangyang computationalpredictionofnovelnoncodingrnasinarabidopsisthaliana
AT yubin computationalpredictionofnovelnoncodingrnasinarabidopsisthaliana
AT zhengbinglian computationalpredictionofnovelnoncodingrnasinarabidopsisthaliana
AT dengzhidong computationalpredictionofnovelnoncodingrnasinarabidopsisthaliana
AT lubaoliang computationalpredictionofnovelnoncodingrnasinarabidopsisthaliana
AT chenxuemei computationalpredictionofnovelnoncodingrnasinarabidopsisthaliana
AT jiangtao computationalpredictionofnovelnoncodingrnasinarabidopsisthaliana