Cargando…

ANMDA: anti-noise based computational model for predicting potential miRNA-disease associations

BACKGROUND: A growing proportion of research has proved that microRNAs (miRNAs) can regulate the function of target genes and have close relations with various diseases. Developing computational methods to exploit more potential miRNA-disease associations can provide clues for further functional res...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Xue-Jun, Hua, Xin-Yun, Jiang, Zhen-Ran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8254275/
https://www.ncbi.nlm.nih.gov/pubmed/34215183
http://dx.doi.org/10.1186/s12859-021-04266-6
_version_ 1783717695685918720
author Chen, Xue-Jun
Hua, Xin-Yun
Jiang, Zhen-Ran
author_facet Chen, Xue-Jun
Hua, Xin-Yun
Jiang, Zhen-Ran
author_sort Chen, Xue-Jun
collection PubMed
description BACKGROUND: A growing proportion of research has proved that microRNAs (miRNAs) can regulate the function of target genes and have close relations with various diseases. Developing computational methods to exploit more potential miRNA-disease associations can provide clues for further functional research. RESULTS: Inspired by the work of predecessors, we discover that the noise hiding in the data can affect the prediction performance and then propose an anti-noise algorithm (ANMDA) to predict potential miRNA-disease associations. Firstly, we calculate the similarity in miRNAs and diseases to construct features and obtain positive samples according to the Human MicroRNA Disease Database version 2.0 (HMDD v2.0). Then, we apply k-means on the undetected miRNA-disease associations and sample the negative examples equally from the k-cluster. Further, we construct several data subsets through sampling with replacement to feed on the light gradient boosting machine (LightGBM) method. Finally, the voting method is applied to predict potential miRNA-disease relationships. As a result, ANMDA can achieve an area under the receiver operating characteristic curve (AUROC) of 0.9373 ± 0.0005 in five-fold cross-validation, which is superior to several published methods. In addition, we analyze the predicted miRNA-disease associations with high probability and compare them with the data in HMDD v3.0 in the case study. The results show ANMDA is a novel and practical algorithm that can be used to infer potential miRNA-disease associations. CONCLUSION: The results indicate the noise hiding in the data has an obvious impact on predicting potential miRNA-disease associations. We believe ANMDA can achieve better results from this task with more methods used in dealing with the data noise. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04266-6.
format Online
Article
Text
id pubmed-8254275
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-82542752021-07-06 ANMDA: anti-noise based computational model for predicting potential miRNA-disease associations Chen, Xue-Jun Hua, Xin-Yun Jiang, Zhen-Ran BMC Bioinformatics Research Article BACKGROUND: A growing proportion of research has proved that microRNAs (miRNAs) can regulate the function of target genes and have close relations with various diseases. Developing computational methods to exploit more potential miRNA-disease associations can provide clues for further functional research. RESULTS: Inspired by the work of predecessors, we discover that the noise hiding in the data can affect the prediction performance and then propose an anti-noise algorithm (ANMDA) to predict potential miRNA-disease associations. Firstly, we calculate the similarity in miRNAs and diseases to construct features and obtain positive samples according to the Human MicroRNA Disease Database version 2.0 (HMDD v2.0). Then, we apply k-means on the undetected miRNA-disease associations and sample the negative examples equally from the k-cluster. Further, we construct several data subsets through sampling with replacement to feed on the light gradient boosting machine (LightGBM) method. Finally, the voting method is applied to predict potential miRNA-disease relationships. As a result, ANMDA can achieve an area under the receiver operating characteristic curve (AUROC) of 0.9373 ± 0.0005 in five-fold cross-validation, which is superior to several published methods. In addition, we analyze the predicted miRNA-disease associations with high probability and compare them with the data in HMDD v3.0 in the case study. The results show ANMDA is a novel and practical algorithm that can be used to infer potential miRNA-disease associations. CONCLUSION: The results indicate the noise hiding in the data has an obvious impact on predicting potential miRNA-disease associations. We believe ANMDA can achieve better results from this task with more methods used in dealing with the data noise. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04266-6. BioMed Central 2021-07-02 /pmc/articles/PMC8254275/ /pubmed/34215183 http://dx.doi.org/10.1186/s12859-021-04266-6 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Chen, Xue-Jun
Hua, Xin-Yun
Jiang, Zhen-Ran
ANMDA: anti-noise based computational model for predicting potential miRNA-disease associations
title ANMDA: anti-noise based computational model for predicting potential miRNA-disease associations
title_full ANMDA: anti-noise based computational model for predicting potential miRNA-disease associations
title_fullStr ANMDA: anti-noise based computational model for predicting potential miRNA-disease associations
title_full_unstemmed ANMDA: anti-noise based computational model for predicting potential miRNA-disease associations
title_short ANMDA: anti-noise based computational model for predicting potential miRNA-disease associations
title_sort anmda: anti-noise based computational model for predicting potential mirna-disease associations
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8254275/
https://www.ncbi.nlm.nih.gov/pubmed/34215183
http://dx.doi.org/10.1186/s12859-021-04266-6
work_keys_str_mv AT chenxuejun anmdaantinoisebasedcomputationalmodelforpredictingpotentialmirnadiseaseassociations
AT huaxinyun anmdaantinoisebasedcomputationalmodelforpredictingpotentialmirnadiseaseassociations
AT jiangzhenran anmdaantinoisebasedcomputationalmodelforpredictingpotentialmirnadiseaseassociations