Cargando…
ANMDA: anti-noise based computational model for predicting potential miRNA-disease associations
BACKGROUND: A growing proportion of research has proved that microRNAs (miRNAs) can regulate the function of target genes and have close relations with various diseases. Developing computational methods to exploit more potential miRNA-disease associations can provide clues for further functional res...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8254275/ https://www.ncbi.nlm.nih.gov/pubmed/34215183 http://dx.doi.org/10.1186/s12859-021-04266-6 |
_version_ | 1783717695685918720 |
---|---|
author | Chen, Xue-Jun Hua, Xin-Yun Jiang, Zhen-Ran |
author_facet | Chen, Xue-Jun Hua, Xin-Yun Jiang, Zhen-Ran |
author_sort | Chen, Xue-Jun |
collection | PubMed |
description | BACKGROUND: A growing proportion of research has proved that microRNAs (miRNAs) can regulate the function of target genes and have close relations with various diseases. Developing computational methods to exploit more potential miRNA-disease associations can provide clues for further functional research. RESULTS: Inspired by the work of predecessors, we discover that the noise hiding in the data can affect the prediction performance and then propose an anti-noise algorithm (ANMDA) to predict potential miRNA-disease associations. Firstly, we calculate the similarity in miRNAs and diseases to construct features and obtain positive samples according to the Human MicroRNA Disease Database version 2.0 (HMDD v2.0). Then, we apply k-means on the undetected miRNA-disease associations and sample the negative examples equally from the k-cluster. Further, we construct several data subsets through sampling with replacement to feed on the light gradient boosting machine (LightGBM) method. Finally, the voting method is applied to predict potential miRNA-disease relationships. As a result, ANMDA can achieve an area under the receiver operating characteristic curve (AUROC) of 0.9373 ± 0.0005 in five-fold cross-validation, which is superior to several published methods. In addition, we analyze the predicted miRNA-disease associations with high probability and compare them with the data in HMDD v3.0 in the case study. The results show ANMDA is a novel and practical algorithm that can be used to infer potential miRNA-disease associations. CONCLUSION: The results indicate the noise hiding in the data has an obvious impact on predicting potential miRNA-disease associations. We believe ANMDA can achieve better results from this task with more methods used in dealing with the data noise. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04266-6. |
format | Online Article Text |
id | pubmed-8254275 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-82542752021-07-06 ANMDA: anti-noise based computational model for predicting potential miRNA-disease associations Chen, Xue-Jun Hua, Xin-Yun Jiang, Zhen-Ran BMC Bioinformatics Research Article BACKGROUND: A growing proportion of research has proved that microRNAs (miRNAs) can regulate the function of target genes and have close relations with various diseases. Developing computational methods to exploit more potential miRNA-disease associations can provide clues for further functional research. RESULTS: Inspired by the work of predecessors, we discover that the noise hiding in the data can affect the prediction performance and then propose an anti-noise algorithm (ANMDA) to predict potential miRNA-disease associations. Firstly, we calculate the similarity in miRNAs and diseases to construct features and obtain positive samples according to the Human MicroRNA Disease Database version 2.0 (HMDD v2.0). Then, we apply k-means on the undetected miRNA-disease associations and sample the negative examples equally from the k-cluster. Further, we construct several data subsets through sampling with replacement to feed on the light gradient boosting machine (LightGBM) method. Finally, the voting method is applied to predict potential miRNA-disease relationships. As a result, ANMDA can achieve an area under the receiver operating characteristic curve (AUROC) of 0.9373 ± 0.0005 in five-fold cross-validation, which is superior to several published methods. In addition, we analyze the predicted miRNA-disease associations with high probability and compare them with the data in HMDD v3.0 in the case study. The results show ANMDA is a novel and practical algorithm that can be used to infer potential miRNA-disease associations. CONCLUSION: The results indicate the noise hiding in the data has an obvious impact on predicting potential miRNA-disease associations. We believe ANMDA can achieve better results from this task with more methods used in dealing with the data noise. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04266-6. BioMed Central 2021-07-02 /pmc/articles/PMC8254275/ /pubmed/34215183 http://dx.doi.org/10.1186/s12859-021-04266-6 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Chen, Xue-Jun Hua, Xin-Yun Jiang, Zhen-Ran ANMDA: anti-noise based computational model for predicting potential miRNA-disease associations |
title | ANMDA: anti-noise based computational model for predicting potential miRNA-disease associations |
title_full | ANMDA: anti-noise based computational model for predicting potential miRNA-disease associations |
title_fullStr | ANMDA: anti-noise based computational model for predicting potential miRNA-disease associations |
title_full_unstemmed | ANMDA: anti-noise based computational model for predicting potential miRNA-disease associations |
title_short | ANMDA: anti-noise based computational model for predicting potential miRNA-disease associations |
title_sort | anmda: anti-noise based computational model for predicting potential mirna-disease associations |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8254275/ https://www.ncbi.nlm.nih.gov/pubmed/34215183 http://dx.doi.org/10.1186/s12859-021-04266-6 |
work_keys_str_mv | AT chenxuejun anmdaantinoisebasedcomputationalmodelforpredictingpotentialmirnadiseaseassociations AT huaxinyun anmdaantinoisebasedcomputationalmodelforpredictingpotentialmirnadiseaseassociations AT jiangzhenran anmdaantinoisebasedcomputationalmodelforpredictingpotentialmirnadiseaseassociations |