Cargando…

miRA: adaptable novel miRNA identification in plants using small RNA sequencing data

BACKGROUND: MicroRNAs (miRNAs) are short regulatory RNAs derived from longer precursor RNAs. miRNA biogenesis has been studied in animals and plants, recently elucidating more complex aspects, such as non-conserved, species-specific, and heterogeneous miRNA precursor populations. Small RNA sequencin...

Descripción completa

Detalles Bibliográficos
Autores principales: Evers, Maurits, Huttner, Michael, Dueck, Anne, Meister, Gunter, Engelmann, Julia C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4635600/
https://www.ncbi.nlm.nih.gov/pubmed/26542525
http://dx.doi.org/10.1186/s12859-015-0798-3
_version_ 1782399529689022464
author Evers, Maurits
Huttner, Michael
Dueck, Anne
Meister, Gunter
Engelmann, Julia C.
author_facet Evers, Maurits
Huttner, Michael
Dueck, Anne
Meister, Gunter
Engelmann, Julia C.
author_sort Evers, Maurits
collection PubMed
description BACKGROUND: MicroRNAs (miRNAs) are short regulatory RNAs derived from longer precursor RNAs. miRNA biogenesis has been studied in animals and plants, recently elucidating more complex aspects, such as non-conserved, species-specific, and heterogeneous miRNA precursor populations. Small RNA sequencing data can help in computationally identifying genomic loci of miRNA precursors. The challenge is to predict a valid miRNA precursor from inhomogeneous read coverage from a complex RNA library: while the mature miRNA typically produces many sequence reads, the remaining part of the precursor is covered very sparsely. As recent results suggest, alternative miRNA biogenesis pathways may lead to a more diverse miRNA precursor population than previously assumed. In plants, the latter manifests itself in e.g. complex secondary structures and expression from multiple loci within precursors. Current miRNA identification algorithms often depend on already existing gene annotation, and/or make use of specific miRNA precursor features such as precursor lengths, secondary structures etc. Consequently and in view of the emerging new understanding of a more complex miRNA biogenesis in plants, current tools may fail to characterise organism-specific and heterogeneous miRNA populations. RESULTS: miRA is a new tool to identify miRNA precursors in plants, allowing for heterogeneous and complex precursor populations. miRA requires small RNA sequencing data and a corresponding reference genome, and evaluates precursor secondary structures and precursor processing accuracy; key parameters can be adapted based on the specific organism under investigation. We show that miRA outperforms the currently best plant miRNA prediction tools both in sensitivity and specificity, for data involving Arabidopsis thaliana and the Volvocine algae Chlamydomonas reinhardtii; the latter organism has been shown to exhibit a heterogeneous and complex precursor population with little cross-species miRNA sequence conservation, and therefore constitutes an ideal model organism. Furthermore we identify novel miRNAs in the Chlamydomonas-related organism Volvox carteri. CONCLUSIONS: We propose miRA, a new plant miRNA identification tool that is well adapted to complex precursor populations. miRA is particularly suited for organisms with no existing miRNA annotation, or without a known related organism with well characterized miRNAs. Moreover, miRA has proven its ability to identify species-specific miRNAs. miRA is flexible in its parameter settings, and produces user-friendly output files in various formats (pdf, csv, genome-browser-suitable annotation files, etc.). It is freely available at https://github.com/mhuttner/miRA. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0798-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4635600
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46356002015-11-07 miRA: adaptable novel miRNA identification in plants using small RNA sequencing data Evers, Maurits Huttner, Michael Dueck, Anne Meister, Gunter Engelmann, Julia C. BMC Bioinformatics Software BACKGROUND: MicroRNAs (miRNAs) are short regulatory RNAs derived from longer precursor RNAs. miRNA biogenesis has been studied in animals and plants, recently elucidating more complex aspects, such as non-conserved, species-specific, and heterogeneous miRNA precursor populations. Small RNA sequencing data can help in computationally identifying genomic loci of miRNA precursors. The challenge is to predict a valid miRNA precursor from inhomogeneous read coverage from a complex RNA library: while the mature miRNA typically produces many sequence reads, the remaining part of the precursor is covered very sparsely. As recent results suggest, alternative miRNA biogenesis pathways may lead to a more diverse miRNA precursor population than previously assumed. In plants, the latter manifests itself in e.g. complex secondary structures and expression from multiple loci within precursors. Current miRNA identification algorithms often depend on already existing gene annotation, and/or make use of specific miRNA precursor features such as precursor lengths, secondary structures etc. Consequently and in view of the emerging new understanding of a more complex miRNA biogenesis in plants, current tools may fail to characterise organism-specific and heterogeneous miRNA populations. RESULTS: miRA is a new tool to identify miRNA precursors in plants, allowing for heterogeneous and complex precursor populations. miRA requires small RNA sequencing data and a corresponding reference genome, and evaluates precursor secondary structures and precursor processing accuracy; key parameters can be adapted based on the specific organism under investigation. We show that miRA outperforms the currently best plant miRNA prediction tools both in sensitivity and specificity, for data involving Arabidopsis thaliana and the Volvocine algae Chlamydomonas reinhardtii; the latter organism has been shown to exhibit a heterogeneous and complex precursor population with little cross-species miRNA sequence conservation, and therefore constitutes an ideal model organism. Furthermore we identify novel miRNAs in the Chlamydomonas-related organism Volvox carteri. CONCLUSIONS: We propose miRA, a new plant miRNA identification tool that is well adapted to complex precursor populations. miRA is particularly suited for organisms with no existing miRNA annotation, or without a known related organism with well characterized miRNAs. Moreover, miRA has proven its ability to identify species-specific miRNAs. miRA is flexible in its parameter settings, and produces user-friendly output files in various formats (pdf, csv, genome-browser-suitable annotation files, etc.). It is freely available at https://github.com/mhuttner/miRA. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0798-3) contains supplementary material, which is available to authorized users. BioMed Central 2015-11-05 /pmc/articles/PMC4635600/ /pubmed/26542525 http://dx.doi.org/10.1186/s12859-015-0798-3 Text en © Evers et al. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Evers, Maurits
Huttner, Michael
Dueck, Anne
Meister, Gunter
Engelmann, Julia C.
miRA: adaptable novel miRNA identification in plants using small RNA sequencing data
title miRA: adaptable novel miRNA identification in plants using small RNA sequencing data
title_full miRA: adaptable novel miRNA identification in plants using small RNA sequencing data
title_fullStr miRA: adaptable novel miRNA identification in plants using small RNA sequencing data
title_full_unstemmed miRA: adaptable novel miRNA identification in plants using small RNA sequencing data
title_short miRA: adaptable novel miRNA identification in plants using small RNA sequencing data
title_sort mira: adaptable novel mirna identification in plants using small rna sequencing data
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4635600/
https://www.ncbi.nlm.nih.gov/pubmed/26542525
http://dx.doi.org/10.1186/s12859-015-0798-3
work_keys_str_mv AT eversmaurits miraadaptablenovelmirnaidentificationinplantsusingsmallrnasequencingdata
AT huttnermichael miraadaptablenovelmirnaidentificationinplantsusingsmallrnasequencingdata
AT dueckanne miraadaptablenovelmirnaidentificationinplantsusingsmallrnasequencingdata
AT meistergunter miraadaptablenovelmirnaidentificationinplantsusingsmallrnasequencingdata
AT engelmannjuliac miraadaptablenovelmirnaidentificationinplantsusingsmallrnasequencingdata