Cargando…
Identifying potential circulating miRNA biomarkers for the diagnosis and prediction of ovarian cancer using machine-learning approach: application of Boruta
INTRODUCTION: In gynecologic oncology, ovarian cancer is a great clinical challenge. Because of the lack of typical symptoms and effective biomarkers for noninvasive screening, most patients develop advanced-stage ovarian cancer by the time of diagnosis. MicroRNAs (miRNAs) are a type of non-coding R...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10445490/ https://www.ncbi.nlm.nih.gov/pubmed/37621964 http://dx.doi.org/10.3389/fdgth.2023.1187578 |
_version_ | 1785094182372638720 |
---|---|
author | Hamidi, Farzaneh Gilani, Neda Arabi Belaghi, Reza Yaghoobi, Hanif Babaei, Esmaeil Sarbakhsh, Parvin Malakouti, Jamileh |
author_facet | Hamidi, Farzaneh Gilani, Neda Arabi Belaghi, Reza Yaghoobi, Hanif Babaei, Esmaeil Sarbakhsh, Parvin Malakouti, Jamileh |
author_sort | Hamidi, Farzaneh |
collection | PubMed |
description | INTRODUCTION: In gynecologic oncology, ovarian cancer is a great clinical challenge. Because of the lack of typical symptoms and effective biomarkers for noninvasive screening, most patients develop advanced-stage ovarian cancer by the time of diagnosis. MicroRNAs (miRNAs) are a type of non-coding RNA molecule that has been linked to human cancers. Specifying diagnostic biomarkers to determine non-cancer and cancer samples is difficult. METHODS: By using Boruta, a novel random forest-based feature selection in the machine-learning techniques, we aimed to identify biomarkers associated with ovarian cancer using cancerous and non-cancer samples from the Gene Expression Omnibus (GEO) database: GSE106817. In this study, we used two independent GEO data sets as external validation, including GSE113486 and GSE113740. We utilized five state-of-the-art machine-learning algorithms for classification: logistic regression, random forest, decision trees, artificial neural networks, and XGBoost. RESULTS: Four models discovered in GSE113486 had an AUC of 100%, three in GSE113740 with AUC of over 94%, and four in GSE113486 with AUC of over 94%. We identified 10 miRNAs to distinguish ovarian cancer cases from normal controls: hsa-miR-1290, hsa-miR-1233-5p, hsa-miR-1914-5p, hsa-miR-1469, hsa-miR-4675, hsa-miR-1228-5p, hsa-miR-3184-5p, hsa-miR-6784-5p, hsa-miR-6800-5p, and hsa-miR-5100. Our findings suggest that miRNAs could be used as possible biomarkers for ovarian cancer screening, for possible intervention. |
format | Online Article Text |
id | pubmed-10445490 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-104454902023-08-24 Identifying potential circulating miRNA biomarkers for the diagnosis and prediction of ovarian cancer using machine-learning approach: application of Boruta Hamidi, Farzaneh Gilani, Neda Arabi Belaghi, Reza Yaghoobi, Hanif Babaei, Esmaeil Sarbakhsh, Parvin Malakouti, Jamileh Front Digit Health Digital Health INTRODUCTION: In gynecologic oncology, ovarian cancer is a great clinical challenge. Because of the lack of typical symptoms and effective biomarkers for noninvasive screening, most patients develop advanced-stage ovarian cancer by the time of diagnosis. MicroRNAs (miRNAs) are a type of non-coding RNA molecule that has been linked to human cancers. Specifying diagnostic biomarkers to determine non-cancer and cancer samples is difficult. METHODS: By using Boruta, a novel random forest-based feature selection in the machine-learning techniques, we aimed to identify biomarkers associated with ovarian cancer using cancerous and non-cancer samples from the Gene Expression Omnibus (GEO) database: GSE106817. In this study, we used two independent GEO data sets as external validation, including GSE113486 and GSE113740. We utilized five state-of-the-art machine-learning algorithms for classification: logistic regression, random forest, decision trees, artificial neural networks, and XGBoost. RESULTS: Four models discovered in GSE113486 had an AUC of 100%, three in GSE113740 with AUC of over 94%, and four in GSE113486 with AUC of over 94%. We identified 10 miRNAs to distinguish ovarian cancer cases from normal controls: hsa-miR-1290, hsa-miR-1233-5p, hsa-miR-1914-5p, hsa-miR-1469, hsa-miR-4675, hsa-miR-1228-5p, hsa-miR-3184-5p, hsa-miR-6784-5p, hsa-miR-6800-5p, and hsa-miR-5100. Our findings suggest that miRNAs could be used as possible biomarkers for ovarian cancer screening, for possible intervention. Frontiers Media S.A. 2023-08-09 /pmc/articles/PMC10445490/ /pubmed/37621964 http://dx.doi.org/10.3389/fdgth.2023.1187578 Text en © 2023 Hamidi, Gilani, Arabi Belaghi, Yaghoobi, Babaei, Sarbakhsh and Malakouti. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY) (https://creativecommons.org/licenses/by/4.0/) . The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Digital Health Hamidi, Farzaneh Gilani, Neda Arabi Belaghi, Reza Yaghoobi, Hanif Babaei, Esmaeil Sarbakhsh, Parvin Malakouti, Jamileh Identifying potential circulating miRNA biomarkers for the diagnosis and prediction of ovarian cancer using machine-learning approach: application of Boruta |
title | Identifying potential circulating miRNA biomarkers for the diagnosis and prediction of ovarian cancer using machine-learning approach: application of Boruta |
title_full | Identifying potential circulating miRNA biomarkers for the diagnosis and prediction of ovarian cancer using machine-learning approach: application of Boruta |
title_fullStr | Identifying potential circulating miRNA biomarkers for the diagnosis and prediction of ovarian cancer using machine-learning approach: application of Boruta |
title_full_unstemmed | Identifying potential circulating miRNA biomarkers for the diagnosis and prediction of ovarian cancer using machine-learning approach: application of Boruta |
title_short | Identifying potential circulating miRNA biomarkers for the diagnosis and prediction of ovarian cancer using machine-learning approach: application of Boruta |
title_sort | identifying potential circulating mirna biomarkers for the diagnosis and prediction of ovarian cancer using machine-learning approach: application of boruta |
topic | Digital Health |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10445490/ https://www.ncbi.nlm.nih.gov/pubmed/37621964 http://dx.doi.org/10.3389/fdgth.2023.1187578 |
work_keys_str_mv | AT hamidifarzaneh identifyingpotentialcirculatingmirnabiomarkersforthediagnosisandpredictionofovariancancerusingmachinelearningapproachapplicationofboruta AT gilanineda identifyingpotentialcirculatingmirnabiomarkersforthediagnosisandpredictionofovariancancerusingmachinelearningapproachapplicationofboruta AT arabibelaghireza identifyingpotentialcirculatingmirnabiomarkersforthediagnosisandpredictionofovariancancerusingmachinelearningapproachapplicationofboruta AT yaghoobihanif identifyingpotentialcirculatingmirnabiomarkersforthediagnosisandpredictionofovariancancerusingmachinelearningapproachapplicationofboruta AT babaeiesmaeil identifyingpotentialcirculatingmirnabiomarkersforthediagnosisandpredictionofovariancancerusingmachinelearningapproachapplicationofboruta AT sarbakhshparvin identifyingpotentialcirculatingmirnabiomarkersforthediagnosisandpredictionofovariancancerusingmachinelearningapproachapplicationofboruta AT malakoutijamileh identifyingpotentialcirculatingmirnabiomarkersforthediagnosisandpredictionofovariancancerusingmachinelearningapproachapplicationofboruta |