Cargando…
miRA: adaptable novel miRNA identification in plants using small RNA sequencing data
BACKGROUND: MicroRNAs (miRNAs) are short regulatory RNAs derived from longer precursor RNAs. miRNA biogenesis has been studied in animals and plants, recently elucidating more complex aspects, such as non-conserved, species-specific, and heterogeneous miRNA precursor populations. Small RNA sequencin...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4635600/ https://www.ncbi.nlm.nih.gov/pubmed/26542525 http://dx.doi.org/10.1186/s12859-015-0798-3 |
_version_ | 1782399529689022464 |
---|---|
author | Evers, Maurits Huttner, Michael Dueck, Anne Meister, Gunter Engelmann, Julia C. |
author_facet | Evers, Maurits Huttner, Michael Dueck, Anne Meister, Gunter Engelmann, Julia C. |
author_sort | Evers, Maurits |
collection | PubMed |
description | BACKGROUND: MicroRNAs (miRNAs) are short regulatory RNAs derived from longer precursor RNAs. miRNA biogenesis has been studied in animals and plants, recently elucidating more complex aspects, such as non-conserved, species-specific, and heterogeneous miRNA precursor populations. Small RNA sequencing data can help in computationally identifying genomic loci of miRNA precursors. The challenge is to predict a valid miRNA precursor from inhomogeneous read coverage from a complex RNA library: while the mature miRNA typically produces many sequence reads, the remaining part of the precursor is covered very sparsely. As recent results suggest, alternative miRNA biogenesis pathways may lead to a more diverse miRNA precursor population than previously assumed. In plants, the latter manifests itself in e.g. complex secondary structures and expression from multiple loci within precursors. Current miRNA identification algorithms often depend on already existing gene annotation, and/or make use of specific miRNA precursor features such as precursor lengths, secondary structures etc. Consequently and in view of the emerging new understanding of a more complex miRNA biogenesis in plants, current tools may fail to characterise organism-specific and heterogeneous miRNA populations. RESULTS: miRA is a new tool to identify miRNA precursors in plants, allowing for heterogeneous and complex precursor populations. miRA requires small RNA sequencing data and a corresponding reference genome, and evaluates precursor secondary structures and precursor processing accuracy; key parameters can be adapted based on the specific organism under investigation. We show that miRA outperforms the currently best plant miRNA prediction tools both in sensitivity and specificity, for data involving Arabidopsis thaliana and the Volvocine algae Chlamydomonas reinhardtii; the latter organism has been shown to exhibit a heterogeneous and complex precursor population with little cross-species miRNA sequence conservation, and therefore constitutes an ideal model organism. Furthermore we identify novel miRNAs in the Chlamydomonas-related organism Volvox carteri. CONCLUSIONS: We propose miRA, a new plant miRNA identification tool that is well adapted to complex precursor populations. miRA is particularly suited for organisms with no existing miRNA annotation, or without a known related organism with well characterized miRNAs. Moreover, miRA has proven its ability to identify species-specific miRNAs. miRA is flexible in its parameter settings, and produces user-friendly output files in various formats (pdf, csv, genome-browser-suitable annotation files, etc.). It is freely available at https://github.com/mhuttner/miRA. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0798-3) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4635600 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-46356002015-11-07 miRA: adaptable novel miRNA identification in plants using small RNA sequencing data Evers, Maurits Huttner, Michael Dueck, Anne Meister, Gunter Engelmann, Julia C. BMC Bioinformatics Software BACKGROUND: MicroRNAs (miRNAs) are short regulatory RNAs derived from longer precursor RNAs. miRNA biogenesis has been studied in animals and plants, recently elucidating more complex aspects, such as non-conserved, species-specific, and heterogeneous miRNA precursor populations. Small RNA sequencing data can help in computationally identifying genomic loci of miRNA precursors. The challenge is to predict a valid miRNA precursor from inhomogeneous read coverage from a complex RNA library: while the mature miRNA typically produces many sequence reads, the remaining part of the precursor is covered very sparsely. As recent results suggest, alternative miRNA biogenesis pathways may lead to a more diverse miRNA precursor population than previously assumed. In plants, the latter manifests itself in e.g. complex secondary structures and expression from multiple loci within precursors. Current miRNA identification algorithms often depend on already existing gene annotation, and/or make use of specific miRNA precursor features such as precursor lengths, secondary structures etc. Consequently and in view of the emerging new understanding of a more complex miRNA biogenesis in plants, current tools may fail to characterise organism-specific and heterogeneous miRNA populations. RESULTS: miRA is a new tool to identify miRNA precursors in plants, allowing for heterogeneous and complex precursor populations. miRA requires small RNA sequencing data and a corresponding reference genome, and evaluates precursor secondary structures and precursor processing accuracy; key parameters can be adapted based on the specific organism under investigation. We show that miRA outperforms the currently best plant miRNA prediction tools both in sensitivity and specificity, for data involving Arabidopsis thaliana and the Volvocine algae Chlamydomonas reinhardtii; the latter organism has been shown to exhibit a heterogeneous and complex precursor population with little cross-species miRNA sequence conservation, and therefore constitutes an ideal model organism. Furthermore we identify novel miRNAs in the Chlamydomonas-related organism Volvox carteri. CONCLUSIONS: We propose miRA, a new plant miRNA identification tool that is well adapted to complex precursor populations. miRA is particularly suited for organisms with no existing miRNA annotation, or without a known related organism with well characterized miRNAs. Moreover, miRA has proven its ability to identify species-specific miRNAs. miRA is flexible in its parameter settings, and produces user-friendly output files in various formats (pdf, csv, genome-browser-suitable annotation files, etc.). It is freely available at https://github.com/mhuttner/miRA. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0798-3) contains supplementary material, which is available to authorized users. BioMed Central 2015-11-05 /pmc/articles/PMC4635600/ /pubmed/26542525 http://dx.doi.org/10.1186/s12859-015-0798-3 Text en © Evers et al. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Evers, Maurits Huttner, Michael Dueck, Anne Meister, Gunter Engelmann, Julia C. miRA: adaptable novel miRNA identification in plants using small RNA sequencing data |
title | miRA: adaptable novel miRNA identification in plants using small RNA sequencing data |
title_full | miRA: adaptable novel miRNA identification in plants using small RNA sequencing data |
title_fullStr | miRA: adaptable novel miRNA identification in plants using small RNA sequencing data |
title_full_unstemmed | miRA: adaptable novel miRNA identification in plants using small RNA sequencing data |
title_short | miRA: adaptable novel miRNA identification in plants using small RNA sequencing data |
title_sort | mira: adaptable novel mirna identification in plants using small rna sequencing data |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4635600/ https://www.ncbi.nlm.nih.gov/pubmed/26542525 http://dx.doi.org/10.1186/s12859-015-0798-3 |
work_keys_str_mv | AT eversmaurits miraadaptablenovelmirnaidentificationinplantsusingsmallrnasequencingdata AT huttnermichael miraadaptablenovelmirnaidentificationinplantsusingsmallrnasequencingdata AT dueckanne miraadaptablenovelmirnaidentificationinplantsusingsmallrnasequencingdata AT meistergunter miraadaptablenovelmirnaidentificationinplantsusingsmallrnasequencingdata AT engelmannjuliac miraadaptablenovelmirnaidentificationinplantsusingsmallrnasequencingdata |