Cargando…

Analysis of Antisense Expression by Whole Genome Tiling Microarrays and siRNAs Suggests Mis-Annotation of Arabidopsis Orphan Protein-Coding Genes

BACKGROUND: MicroRNAs (miRNAs) and trans-acting small-interfering RNAs (tasi-RNAs) are small (20–22 nt long) RNAs (smRNAs) generated from hairpin secondary structures or antisense transcripts, respectively, that regulate gene expression by Watson-Crick pairing to a target mRNA and altering expressio...

Descripción completa

Detalles Bibliográficos
Autores principales: Richardson, Casey R., Luo, Qing-Jun, Gontcharova, Viktoria, Jiang, Ying-Wen, Samanta, Manoj, Youn, Eunseog, Rock, Christopher D.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2877095/
https://www.ncbi.nlm.nih.gov/pubmed/20520764
http://dx.doi.org/10.1371/journal.pone.0010710
_version_ 1782181765039783936
author Richardson, Casey R.
Luo, Qing-Jun
Gontcharova, Viktoria
Jiang, Ying-Wen
Samanta, Manoj
Youn, Eunseog
Rock, Christopher D.
author_facet Richardson, Casey R.
Luo, Qing-Jun
Gontcharova, Viktoria
Jiang, Ying-Wen
Samanta, Manoj
Youn, Eunseog
Rock, Christopher D.
author_sort Richardson, Casey R.
collection PubMed
description BACKGROUND: MicroRNAs (miRNAs) and trans-acting small-interfering RNAs (tasi-RNAs) are small (20–22 nt long) RNAs (smRNAs) generated from hairpin secondary structures or antisense transcripts, respectively, that regulate gene expression by Watson-Crick pairing to a target mRNA and altering expression by mechanisms related to RNA interference. The high sequence homology of plant miRNAs to their targets has been the mainstay of miRNA prediction algorithms, which are limited in their predictive power for other kingdoms because miRNA complementarity is less conserved yet transitive processes (production of antisense smRNAs) are active in eukaryotes. We hypothesize that antisense transcription and associated smRNAs are biomarkers which can be computationally modeled for gene discovery. PRINCIPAL FINDINGS: We explored rice (Oryza sativa) sense and antisense gene expression in publicly available whole genome tiling array transcriptome data and sequenced smRNA libraries (as well as C. elegans) and found evidence of transitivity of MIRNA genes similar to that found in Arabidopsis. Statistical analysis of antisense transcript abundances, presence of antisense ESTs, and association with smRNAs suggests several hundred Arabidopsis ‘orphan’ hypothetical genes are non-coding RNAs. Consistent with this hypothesis, we found novel Arabidopsis homologues of some MIRNA genes on the antisense strand of previously annotated protein-coding genes. A Support Vector Machine (SVM) was applied using thermodynamic energy of binding plus novel expression features of sense/antisense transcription topology and siRNA abundances to build a prediction model of miRNA targets. The SVM when trained on targets could predict the “ancient” (deeply conserved) class of validated Arabidopsis MIRNA genes with an accuracy of 84%, and 76% for “new” rapidly-evolving MIRNA genes. CONCLUSIONS: Antisense and smRNA expression features and computational methods may identify novel MIRNA genes and other non-coding RNAs in plants and potentially other kingdoms, which can provide insight into antisense transcription, miRNA evolution, and post-transcriptional gene regulation.
format Text
id pubmed-2877095
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-28770952010-06-02 Analysis of Antisense Expression by Whole Genome Tiling Microarrays and siRNAs Suggests Mis-Annotation of Arabidopsis Orphan Protein-Coding Genes Richardson, Casey R. Luo, Qing-Jun Gontcharova, Viktoria Jiang, Ying-Wen Samanta, Manoj Youn, Eunseog Rock, Christopher D. PLoS One Research Article BACKGROUND: MicroRNAs (miRNAs) and trans-acting small-interfering RNAs (tasi-RNAs) are small (20–22 nt long) RNAs (smRNAs) generated from hairpin secondary structures or antisense transcripts, respectively, that regulate gene expression by Watson-Crick pairing to a target mRNA and altering expression by mechanisms related to RNA interference. The high sequence homology of plant miRNAs to their targets has been the mainstay of miRNA prediction algorithms, which are limited in their predictive power for other kingdoms because miRNA complementarity is less conserved yet transitive processes (production of antisense smRNAs) are active in eukaryotes. We hypothesize that antisense transcription and associated smRNAs are biomarkers which can be computationally modeled for gene discovery. PRINCIPAL FINDINGS: We explored rice (Oryza sativa) sense and antisense gene expression in publicly available whole genome tiling array transcriptome data and sequenced smRNA libraries (as well as C. elegans) and found evidence of transitivity of MIRNA genes similar to that found in Arabidopsis. Statistical analysis of antisense transcript abundances, presence of antisense ESTs, and association with smRNAs suggests several hundred Arabidopsis ‘orphan’ hypothetical genes are non-coding RNAs. Consistent with this hypothesis, we found novel Arabidopsis homologues of some MIRNA genes on the antisense strand of previously annotated protein-coding genes. A Support Vector Machine (SVM) was applied using thermodynamic energy of binding plus novel expression features of sense/antisense transcription topology and siRNA abundances to build a prediction model of miRNA targets. The SVM when trained on targets could predict the “ancient” (deeply conserved) class of validated Arabidopsis MIRNA genes with an accuracy of 84%, and 76% for “new” rapidly-evolving MIRNA genes. CONCLUSIONS: Antisense and smRNA expression features and computational methods may identify novel MIRNA genes and other non-coding RNAs in plants and potentially other kingdoms, which can provide insight into antisense transcription, miRNA evolution, and post-transcriptional gene regulation. Public Library of Science 2010-05-26 /pmc/articles/PMC2877095/ /pubmed/20520764 http://dx.doi.org/10.1371/journal.pone.0010710 Text en Richardson et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Richardson, Casey R.
Luo, Qing-Jun
Gontcharova, Viktoria
Jiang, Ying-Wen
Samanta, Manoj
Youn, Eunseog
Rock, Christopher D.
Analysis of Antisense Expression by Whole Genome Tiling Microarrays and siRNAs Suggests Mis-Annotation of Arabidopsis Orphan Protein-Coding Genes
title Analysis of Antisense Expression by Whole Genome Tiling Microarrays and siRNAs Suggests Mis-Annotation of Arabidopsis Orphan Protein-Coding Genes
title_full Analysis of Antisense Expression by Whole Genome Tiling Microarrays and siRNAs Suggests Mis-Annotation of Arabidopsis Orphan Protein-Coding Genes
title_fullStr Analysis of Antisense Expression by Whole Genome Tiling Microarrays and siRNAs Suggests Mis-Annotation of Arabidopsis Orphan Protein-Coding Genes
title_full_unstemmed Analysis of Antisense Expression by Whole Genome Tiling Microarrays and siRNAs Suggests Mis-Annotation of Arabidopsis Orphan Protein-Coding Genes
title_short Analysis of Antisense Expression by Whole Genome Tiling Microarrays and siRNAs Suggests Mis-Annotation of Arabidopsis Orphan Protein-Coding Genes
title_sort analysis of antisense expression by whole genome tiling microarrays and sirnas suggests mis-annotation of arabidopsis orphan protein-coding genes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2877095/
https://www.ncbi.nlm.nih.gov/pubmed/20520764
http://dx.doi.org/10.1371/journal.pone.0010710
work_keys_str_mv AT richardsoncaseyr analysisofantisenseexpressionbywholegenometilingmicroarraysandsirnassuggestsmisannotationofarabidopsisorphanproteincodinggenes
AT luoqingjun analysisofantisenseexpressionbywholegenometilingmicroarraysandsirnassuggestsmisannotationofarabidopsisorphanproteincodinggenes
AT gontcharovaviktoria analysisofantisenseexpressionbywholegenometilingmicroarraysandsirnassuggestsmisannotationofarabidopsisorphanproteincodinggenes
AT jiangyingwen analysisofantisenseexpressionbywholegenometilingmicroarraysandsirnassuggestsmisannotationofarabidopsisorphanproteincodinggenes
AT samantamanoj analysisofantisenseexpressionbywholegenometilingmicroarraysandsirnassuggestsmisannotationofarabidopsisorphanproteincodinggenes
AT youneunseog analysisofantisenseexpressionbywholegenometilingmicroarraysandsirnassuggestsmisannotationofarabidopsisorphanproteincodinggenes
AT rockchristopherd analysisofantisenseexpressionbywholegenometilingmicroarraysandsirnassuggestsmisannotationofarabidopsisorphanproteincodinggenes