Cargando…
Identification of miRNA-Mediated Subpathways as Prostate Cancer Biomarkers Based on Topological Inference in a Machine Learning Process Using Integrated Gene and miRNA Expression Data
Accurately identifying classification biomarkers for distinguishing between normal and cancer samples is challenging. Additionally, the reproducibility of single-molecule biomarkers is limited by the existence of heterogeneous patient subgroups and differences in the sequencing techniques used to co...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8024646/ https://www.ncbi.nlm.nih.gov/pubmed/33841512 http://dx.doi.org/10.3389/fgene.2021.656526 |
_version_ | 1783675352446402560 |
---|---|
author | Ning, Ziyu Yu, Shuang Zhao, Yanqiao Sun, Xiaoming Wu, Haibin Yu, Xiaoyang |
author_facet | Ning, Ziyu Yu, Shuang Zhao, Yanqiao Sun, Xiaoming Wu, Haibin Yu, Xiaoyang |
author_sort | Ning, Ziyu |
collection | PubMed |
description | Accurately identifying classification biomarkers for distinguishing between normal and cancer samples is challenging. Additionally, the reproducibility of single-molecule biomarkers is limited by the existence of heterogeneous patient subgroups and differences in the sequencing techniques used to collect patient data. In this study, we developed a method to identify robust biomarkers (i.e., miRNA-mediated subpathways) associated with prostate cancer based on normal prostate samples and cancer samples from a dataset from The Cancer Genome Atlas (TCGA; n = 546) and datasets from the Gene Expression Omnibus (GEO) database (n = 139 and n = 90, with the latter being a cell line dataset). We also obtained 10 other cancer datasets to evaluate the performance of the method. We propose a multi-omics data integration strategy for identifying classification biomarkers using a machine learning method that involves reassigning topological weights to the genes using a directed random walk (DRW)-based method. A global directed pathway network (GDPN) was constructed based on the significantly differentially expressed target genes of the significantly differentially expressed miRNAs, which allowed us to identify the robust biomarkers in the form of miRNA-mediated subpathways (miRNAs). The activity value of each miRNA-mediated subpathway was calculated by integrating multiple types of data, which included the expression of the miRNA and the miRNAs’ target genes and GDPN topological information. Finally, we identified the high-frequency miRNA-mediated subpathways involved in prostate cancer using a support vector machine (SVM) model. The results demonstrated that we obtained robust biomarkers of prostate cancer, which could classify prostate cancer and normal samples. Our method outperformed seven other methods, and many of the identified biomarkers were associated with known clinical treatments. |
format | Online Article Text |
id | pubmed-8024646 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-80246462021-04-08 Identification of miRNA-Mediated Subpathways as Prostate Cancer Biomarkers Based on Topological Inference in a Machine Learning Process Using Integrated Gene and miRNA Expression Data Ning, Ziyu Yu, Shuang Zhao, Yanqiao Sun, Xiaoming Wu, Haibin Yu, Xiaoyang Front Genet Genetics Accurately identifying classification biomarkers for distinguishing between normal and cancer samples is challenging. Additionally, the reproducibility of single-molecule biomarkers is limited by the existence of heterogeneous patient subgroups and differences in the sequencing techniques used to collect patient data. In this study, we developed a method to identify robust biomarkers (i.e., miRNA-mediated subpathways) associated with prostate cancer based on normal prostate samples and cancer samples from a dataset from The Cancer Genome Atlas (TCGA; n = 546) and datasets from the Gene Expression Omnibus (GEO) database (n = 139 and n = 90, with the latter being a cell line dataset). We also obtained 10 other cancer datasets to evaluate the performance of the method. We propose a multi-omics data integration strategy for identifying classification biomarkers using a machine learning method that involves reassigning topological weights to the genes using a directed random walk (DRW)-based method. A global directed pathway network (GDPN) was constructed based on the significantly differentially expressed target genes of the significantly differentially expressed miRNAs, which allowed us to identify the robust biomarkers in the form of miRNA-mediated subpathways (miRNAs). The activity value of each miRNA-mediated subpathway was calculated by integrating multiple types of data, which included the expression of the miRNA and the miRNAs’ target genes and GDPN topological information. Finally, we identified the high-frequency miRNA-mediated subpathways involved in prostate cancer using a support vector machine (SVM) model. The results demonstrated that we obtained robust biomarkers of prostate cancer, which could classify prostate cancer and normal samples. Our method outperformed seven other methods, and many of the identified biomarkers were associated with known clinical treatments. Frontiers Media S.A. 2021-03-24 /pmc/articles/PMC8024646/ /pubmed/33841512 http://dx.doi.org/10.3389/fgene.2021.656526 Text en Copyright © 2021 Ning, Yu, Zhao, Sun, Wu and Yu. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Ning, Ziyu Yu, Shuang Zhao, Yanqiao Sun, Xiaoming Wu, Haibin Yu, Xiaoyang Identification of miRNA-Mediated Subpathways as Prostate Cancer Biomarkers Based on Topological Inference in a Machine Learning Process Using Integrated Gene and miRNA Expression Data |
title | Identification of miRNA-Mediated Subpathways as Prostate Cancer Biomarkers Based on Topological Inference in a Machine Learning Process Using Integrated Gene and miRNA Expression Data |
title_full | Identification of miRNA-Mediated Subpathways as Prostate Cancer Biomarkers Based on Topological Inference in a Machine Learning Process Using Integrated Gene and miRNA Expression Data |
title_fullStr | Identification of miRNA-Mediated Subpathways as Prostate Cancer Biomarkers Based on Topological Inference in a Machine Learning Process Using Integrated Gene and miRNA Expression Data |
title_full_unstemmed | Identification of miRNA-Mediated Subpathways as Prostate Cancer Biomarkers Based on Topological Inference in a Machine Learning Process Using Integrated Gene and miRNA Expression Data |
title_short | Identification of miRNA-Mediated Subpathways as Prostate Cancer Biomarkers Based on Topological Inference in a Machine Learning Process Using Integrated Gene and miRNA Expression Data |
title_sort | identification of mirna-mediated subpathways as prostate cancer biomarkers based on topological inference in a machine learning process using integrated gene and mirna expression data |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8024646/ https://www.ncbi.nlm.nih.gov/pubmed/33841512 http://dx.doi.org/10.3389/fgene.2021.656526 |
work_keys_str_mv | AT ningziyu identificationofmirnamediatedsubpathwaysasprostatecancerbiomarkersbasedontopologicalinferenceinamachinelearningprocessusingintegratedgeneandmirnaexpressiondata AT yushuang identificationofmirnamediatedsubpathwaysasprostatecancerbiomarkersbasedontopologicalinferenceinamachinelearningprocessusingintegratedgeneandmirnaexpressiondata AT zhaoyanqiao identificationofmirnamediatedsubpathwaysasprostatecancerbiomarkersbasedontopologicalinferenceinamachinelearningprocessusingintegratedgeneandmirnaexpressiondata AT sunxiaoming identificationofmirnamediatedsubpathwaysasprostatecancerbiomarkersbasedontopologicalinferenceinamachinelearningprocessusingintegratedgeneandmirnaexpressiondata AT wuhaibin identificationofmirnamediatedsubpathwaysasprostatecancerbiomarkersbasedontopologicalinferenceinamachinelearningprocessusingintegratedgeneandmirnaexpressiondata AT yuxiaoyang identificationofmirnamediatedsubpathwaysasprostatecancerbiomarkersbasedontopologicalinferenceinamachinelearningprocessusingintegratedgeneandmirnaexpressiondata |