Cargando…

Adaboost-SVM-based probability algorithm for the prediction of all mature miRNA sites based on structured-sequence features

The significant role of microRNAs (miRNAs) in various biological processes and diseases has been widely studied and reported in recent years. Several computational methods associated with mature miRNA identification suffer various limitations involving canonical biological features extraction, class...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Ying, Ru, Jidong, Jiang, Yueqiu, Zhang, Jian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6365589/
https://www.ncbi.nlm.nih.gov/pubmed/30728425
http://dx.doi.org/10.1038/s41598-018-38048-7
_version_ 1783393454519222272
author Wang, Ying
Ru, Jidong
Jiang, Yueqiu
Zhang, Jian
author_facet Wang, Ying
Ru, Jidong
Jiang, Yueqiu
Zhang, Jian
author_sort Wang, Ying
collection PubMed
description The significant role of microRNAs (miRNAs) in various biological processes and diseases has been widely studied and reported in recent years. Several computational methods associated with mature miRNA identification suffer various limitations involving canonical biological features extraction, class imbalance, and classifier performance. The proposed classifier, miRFinder, is an accurate alternative for the identification of mature miRNAs. The structured-sequence features were proposed to precisely extract miRNA biological features, and three algorithms were selected to obtain the canonical features based on the classifier performance. Moreover, the center of mass near distance training based on K-means was provided to improve the class imbalance problem. In particular, the AdaBoost-SVM algorithm was used to construct the classifier. The classifier training process focuses on incorrectly classified samples, and the integrated results use the common decision strategies of the weak classifier with different weights. In addition, the all mature miRNA sites were predicted by different classifiers based on the features of different sites. Compared with other methods, the performance of the classifiers has a high degree of efficacy for the identification of mature miRNAs. MiRFinder is freely available at https://github.com/wangying0128/miRFinder.
format Online
Article
Text
id pubmed-6365589
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-63655892019-02-08 Adaboost-SVM-based probability algorithm for the prediction of all mature miRNA sites based on structured-sequence features Wang, Ying Ru, Jidong Jiang, Yueqiu Zhang, Jian Sci Rep Article The significant role of microRNAs (miRNAs) in various biological processes and diseases has been widely studied and reported in recent years. Several computational methods associated with mature miRNA identification suffer various limitations involving canonical biological features extraction, class imbalance, and classifier performance. The proposed classifier, miRFinder, is an accurate alternative for the identification of mature miRNAs. The structured-sequence features were proposed to precisely extract miRNA biological features, and three algorithms were selected to obtain the canonical features based on the classifier performance. Moreover, the center of mass near distance training based on K-means was provided to improve the class imbalance problem. In particular, the AdaBoost-SVM algorithm was used to construct the classifier. The classifier training process focuses on incorrectly classified samples, and the integrated results use the common decision strategies of the weak classifier with different weights. In addition, the all mature miRNA sites were predicted by different classifiers based on the features of different sites. Compared with other methods, the performance of the classifiers has a high degree of efficacy for the identification of mature miRNAs. MiRFinder is freely available at https://github.com/wangying0128/miRFinder. Nature Publishing Group UK 2019-02-06 /pmc/articles/PMC6365589/ /pubmed/30728425 http://dx.doi.org/10.1038/s41598-018-38048-7 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Wang, Ying
Ru, Jidong
Jiang, Yueqiu
Zhang, Jian
Adaboost-SVM-based probability algorithm for the prediction of all mature miRNA sites based on structured-sequence features
title Adaboost-SVM-based probability algorithm for the prediction of all mature miRNA sites based on structured-sequence features
title_full Adaboost-SVM-based probability algorithm for the prediction of all mature miRNA sites based on structured-sequence features
title_fullStr Adaboost-SVM-based probability algorithm for the prediction of all mature miRNA sites based on structured-sequence features
title_full_unstemmed Adaboost-SVM-based probability algorithm for the prediction of all mature miRNA sites based on structured-sequence features
title_short Adaboost-SVM-based probability algorithm for the prediction of all mature miRNA sites based on structured-sequence features
title_sort adaboost-svm-based probability algorithm for the prediction of all mature mirna sites based on structured-sequence features
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6365589/
https://www.ncbi.nlm.nih.gov/pubmed/30728425
http://dx.doi.org/10.1038/s41598-018-38048-7
work_keys_str_mv AT wangying adaboostsvmbasedprobabilityalgorithmforthepredictionofallmaturemirnasitesbasedonstructuredsequencefeatures
AT rujidong adaboostsvmbasedprobabilityalgorithmforthepredictionofallmaturemirnasitesbasedonstructuredsequencefeatures
AT jiangyueqiu adaboostsvmbasedprobabilityalgorithmforthepredictionofallmaturemirnasitesbasedonstructuredsequencefeatures
AT zhangjian adaboostsvmbasedprobabilityalgorithmforthepredictionofallmaturemirnasitesbasedonstructuredsequencefeatures