Cargando…

Improving performance of mammalian microRNA target prediction

BACKGROUND: MicroRNAs (miRNAs) are single-stranded non-coding RNAs known to regulate a wide range of cellular processes by silencing the gene expression at the protein and/or mRNA levels. Computational prediction of miRNA targets is essential for elucidating the detailed functions of miRNA. However,...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Hui, Yue, Dong, Chen, Yidong, Gao, Shou-Jiang, Huang, Yufei
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2955701/
https://www.ncbi.nlm.nih.gov/pubmed/20860840
http://dx.doi.org/10.1186/1471-2105-11-476
_version_ 1782188072689991680
author Liu, Hui
Yue, Dong
Chen, Yidong
Gao, Shou-Jiang
Huang, Yufei
author_facet Liu, Hui
Yue, Dong
Chen, Yidong
Gao, Shou-Jiang
Huang, Yufei
author_sort Liu, Hui
collection PubMed
description BACKGROUND: MicroRNAs (miRNAs) are single-stranded non-coding RNAs known to regulate a wide range of cellular processes by silencing the gene expression at the protein and/or mRNA levels. Computational prediction of miRNA targets is essential for elucidating the detailed functions of miRNA. However, the prediction specificity and sensitivity of the existing algorithms are still poor to generate meaningful, workable hypotheses for subsequent experimental testing. Constructing a richer and more reliable training data set and developing an algorithm that properly exploits this data set would be the key to improve the performance current prediction algorithms. RESULTS: A comprehensive training data set is constructed for mammalian miRNAs with its positive targets obtained from the most up-to-date miRNA target depository called miRecords and its negative targets derived from 20 microarray data. A new algorithm SVMicrO is developed, which assumes a 2-stage structure including a site support vector machine (SVM) followed by a UTR-SVM. SVMicrO makes prediction based on 21 optimal site features and 18 optimal UTR features, selected by training from a comprehensive collection of 113 site and 30 UTR features. Comprehensive evaluation of SVMicrO performance has been carried out on the training data, proteomics data, and immunoprecipitation (IP) pull-down data. Comparisons with some popular algorithms demonstrate consistent improvements in prediction specificity, sensitivity and precision in all tested cases. All the related materials including source code and genome-wide prediction of human targets are available at http://compgenomics.utsa.edu/svmicro.html. CONCLUSIONS: A 2-stage SVM based new miRNA target prediction algorithm called SVMicrO is developed. SVMicrO is shown to be able to achieve robust performance. It holds the promise to achieve continuing improvement whenever better training data that contain additional verified or high confidence positive targets and properly selected negative targets are available.
format Text
id pubmed-2955701
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29557012010-10-18 Improving performance of mammalian microRNA target prediction Liu, Hui Yue, Dong Chen, Yidong Gao, Shou-Jiang Huang, Yufei BMC Bioinformatics Research Article BACKGROUND: MicroRNAs (miRNAs) are single-stranded non-coding RNAs known to regulate a wide range of cellular processes by silencing the gene expression at the protein and/or mRNA levels. Computational prediction of miRNA targets is essential for elucidating the detailed functions of miRNA. However, the prediction specificity and sensitivity of the existing algorithms are still poor to generate meaningful, workable hypotheses for subsequent experimental testing. Constructing a richer and more reliable training data set and developing an algorithm that properly exploits this data set would be the key to improve the performance current prediction algorithms. RESULTS: A comprehensive training data set is constructed for mammalian miRNAs with its positive targets obtained from the most up-to-date miRNA target depository called miRecords and its negative targets derived from 20 microarray data. A new algorithm SVMicrO is developed, which assumes a 2-stage structure including a site support vector machine (SVM) followed by a UTR-SVM. SVMicrO makes prediction based on 21 optimal site features and 18 optimal UTR features, selected by training from a comprehensive collection of 113 site and 30 UTR features. Comprehensive evaluation of SVMicrO performance has been carried out on the training data, proteomics data, and immunoprecipitation (IP) pull-down data. Comparisons with some popular algorithms demonstrate consistent improvements in prediction specificity, sensitivity and precision in all tested cases. All the related materials including source code and genome-wide prediction of human targets are available at http://compgenomics.utsa.edu/svmicro.html. CONCLUSIONS: A 2-stage SVM based new miRNA target prediction algorithm called SVMicrO is developed. SVMicrO is shown to be able to achieve robust performance. It holds the promise to achieve continuing improvement whenever better training data that contain additional verified or high confidence positive targets and properly selected negative targets are available. BioMed Central 2010-09-22 /pmc/articles/PMC2955701/ /pubmed/20860840 http://dx.doi.org/10.1186/1471-2105-11-476 Text en Copyright ©2010 Liu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Liu, Hui
Yue, Dong
Chen, Yidong
Gao, Shou-Jiang
Huang, Yufei
Improving performance of mammalian microRNA target prediction
title Improving performance of mammalian microRNA target prediction
title_full Improving performance of mammalian microRNA target prediction
title_fullStr Improving performance of mammalian microRNA target prediction
title_full_unstemmed Improving performance of mammalian microRNA target prediction
title_short Improving performance of mammalian microRNA target prediction
title_sort improving performance of mammalian microrna target prediction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2955701/
https://www.ncbi.nlm.nih.gov/pubmed/20860840
http://dx.doi.org/10.1186/1471-2105-11-476
work_keys_str_mv AT liuhui improvingperformanceofmammalianmicrornatargetprediction
AT yuedong improvingperformanceofmammalianmicrornatargetprediction
AT chenyidong improvingperformanceofmammalianmicrornatargetprediction
AT gaoshoujiang improvingperformanceofmammalianmicrornatargetprediction
AT huangyufei improvingperformanceofmammalianmicrornatargetprediction