Cargando…

A comparison of performance of plant miRNA target prediction tools and the characterization of features for genome-wide target prediction

BACKGROUND: Deep-sequencing has enabled the identification of large numbers of miRNAs and siRNAs, making the high-throughput target identification a main limiting factor in defining their function. In plants, several tools have been developed to predict targets, majority of them being trained on Ara...

Descripción completa

Detalles Bibliográficos
Autores principales: Srivastava, Prashant K, Moturu, Taraka Ramji, Pandey, Priyanka, Baldwin, Ian T, Pandey, Shree P
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4035075/
https://www.ncbi.nlm.nih.gov/pubmed/24885295
http://dx.doi.org/10.1186/1471-2164-15-348
_version_ 1782318018191163392
author Srivastava, Prashant K
Moturu, Taraka Ramji
Pandey, Priyanka
Baldwin, Ian T
Pandey, Shree P
author_facet Srivastava, Prashant K
Moturu, Taraka Ramji
Pandey, Priyanka
Baldwin, Ian T
Pandey, Shree P
author_sort Srivastava, Prashant K
collection PubMed
description BACKGROUND: Deep-sequencing has enabled the identification of large numbers of miRNAs and siRNAs, making the high-throughput target identification a main limiting factor in defining their function. In plants, several tools have been developed to predict targets, majority of them being trained on Arabidopsis datasets. An extensive and systematic evaluation has not been made for their suitability for predicting targets in species other than Arabidopsis. Nor, these have not been evaluated for their suitability for high-throughput target prediction at genome level. RESULTS: We evaluated the performance of 11 computational tools in identifying genome-wide targets in Arabidopsis and other plants with procedures that optimized score-cutoffs for estimating targets. Targetfinder was most efficient [89% ‘precision’ (accuracy of prediction), 97% ‘recall’ (sensitivity)] in predicting ‘true-positive’ targets in Arabidopsis miRNA-mRNA interactions. In contrast, only 46% of true positive interactions from non-Arabidopsis species were detected, indicating low ‘recall’ values. Score optimizations increased the ‘recall’ to only 70% (corresponding ‘precision’: 65%) for datasets of true miRNA-mRNA interactions in species other than Arabidopsis. Combining the results of Targetfinder and psRNATarget delivers high true positive coverage, whereas the intersection of psRNATarget and Tapirhybrid outputs deliver highly ‘precise’ predictions. The large number of ‘false negative’ predictions delivered from non-Arabidopsis datasets by all the available tools indicate the diversity in miRNAs-mRNA interaction features between Arabidopsis and other species. A subset of miRNA-mRNA interactions differed significantly for features in seed regions as well as the total number of matches/mismatches. CONCLUSION: Although, many plant miRNA target prediction tools may be optimized to predict targets with high specificity in Arabidopsis, such optimized thresholds may not be suitable for many targets in non-Arabidopsis species. More importantly, non-conventional features of miRNA-mRNA interaction may exist in plants indicating alternate mode of miRNA target recognition. Incorporation of these divergent features would enable next-generation of algorithms to better identify target interactions. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-348) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4035075
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40350752014-06-06 A comparison of performance of plant miRNA target prediction tools and the characterization of features for genome-wide target prediction Srivastava, Prashant K Moturu, Taraka Ramji Pandey, Priyanka Baldwin, Ian T Pandey, Shree P BMC Genomics Research Article BACKGROUND: Deep-sequencing has enabled the identification of large numbers of miRNAs and siRNAs, making the high-throughput target identification a main limiting factor in defining their function. In plants, several tools have been developed to predict targets, majority of them being trained on Arabidopsis datasets. An extensive and systematic evaluation has not been made for their suitability for predicting targets in species other than Arabidopsis. Nor, these have not been evaluated for their suitability for high-throughput target prediction at genome level. RESULTS: We evaluated the performance of 11 computational tools in identifying genome-wide targets in Arabidopsis and other plants with procedures that optimized score-cutoffs for estimating targets. Targetfinder was most efficient [89% ‘precision’ (accuracy of prediction), 97% ‘recall’ (sensitivity)] in predicting ‘true-positive’ targets in Arabidopsis miRNA-mRNA interactions. In contrast, only 46% of true positive interactions from non-Arabidopsis species were detected, indicating low ‘recall’ values. Score optimizations increased the ‘recall’ to only 70% (corresponding ‘precision’: 65%) for datasets of true miRNA-mRNA interactions in species other than Arabidopsis. Combining the results of Targetfinder and psRNATarget delivers high true positive coverage, whereas the intersection of psRNATarget and Tapirhybrid outputs deliver highly ‘precise’ predictions. The large number of ‘false negative’ predictions delivered from non-Arabidopsis datasets by all the available tools indicate the diversity in miRNAs-mRNA interaction features between Arabidopsis and other species. A subset of miRNA-mRNA interactions differed significantly for features in seed regions as well as the total number of matches/mismatches. CONCLUSION: Although, many plant miRNA target prediction tools may be optimized to predict targets with high specificity in Arabidopsis, such optimized thresholds may not be suitable for many targets in non-Arabidopsis species. More importantly, non-conventional features of miRNA-mRNA interaction may exist in plants indicating alternate mode of miRNA target recognition. Incorporation of these divergent features would enable next-generation of algorithms to better identify target interactions. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-348) contains supplementary material, which is available to authorized users. BioMed Central 2014-05-08 /pmc/articles/PMC4035075/ /pubmed/24885295 http://dx.doi.org/10.1186/1471-2164-15-348 Text en © Srivastava et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Srivastava, Prashant K
Moturu, Taraka Ramji
Pandey, Priyanka
Baldwin, Ian T
Pandey, Shree P
A comparison of performance of plant miRNA target prediction tools and the characterization of features for genome-wide target prediction
title A comparison of performance of plant miRNA target prediction tools and the characterization of features for genome-wide target prediction
title_full A comparison of performance of plant miRNA target prediction tools and the characterization of features for genome-wide target prediction
title_fullStr A comparison of performance of plant miRNA target prediction tools and the characterization of features for genome-wide target prediction
title_full_unstemmed A comparison of performance of plant miRNA target prediction tools and the characterization of features for genome-wide target prediction
title_short A comparison of performance of plant miRNA target prediction tools and the characterization of features for genome-wide target prediction
title_sort comparison of performance of plant mirna target prediction tools and the characterization of features for genome-wide target prediction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4035075/
https://www.ncbi.nlm.nih.gov/pubmed/24885295
http://dx.doi.org/10.1186/1471-2164-15-348
work_keys_str_mv AT srivastavaprashantk acomparisonofperformanceofplantmirnatargetpredictiontoolsandthecharacterizationoffeaturesforgenomewidetargetprediction
AT moturutarakaramji acomparisonofperformanceofplantmirnatargetpredictiontoolsandthecharacterizationoffeaturesforgenomewidetargetprediction
AT pandeypriyanka acomparisonofperformanceofplantmirnatargetpredictiontoolsandthecharacterizationoffeaturesforgenomewidetargetprediction
AT baldwiniant acomparisonofperformanceofplantmirnatargetpredictiontoolsandthecharacterizationoffeaturesforgenomewidetargetprediction
AT pandeyshreep acomparisonofperformanceofplantmirnatargetpredictiontoolsandthecharacterizationoffeaturesforgenomewidetargetprediction
AT srivastavaprashantk comparisonofperformanceofplantmirnatargetpredictiontoolsandthecharacterizationoffeaturesforgenomewidetargetprediction
AT moturutarakaramji comparisonofperformanceofplantmirnatargetpredictiontoolsandthecharacterizationoffeaturesforgenomewidetargetprediction
AT pandeypriyanka comparisonofperformanceofplantmirnatargetpredictiontoolsandthecharacterizationoffeaturesforgenomewidetargetprediction
AT baldwiniant comparisonofperformanceofplantmirnatargetpredictiontoolsandthecharacterizationoffeaturesforgenomewidetargetprediction
AT pandeyshreep comparisonofperformanceofplantmirnatargetpredictiontoolsandthecharacterizationoffeaturesforgenomewidetargetprediction