Cargando…

Identification of miRNA-Mediated Subpathways as Prostate Cancer Biomarkers Based on Topological Inference in a Machine Learning Process Using Integrated Gene and miRNA Expression Data

Accurately identifying classification biomarkers for distinguishing between normal and cancer samples is challenging. Additionally, the reproducibility of single-molecule biomarkers is limited by the existence of heterogeneous patient subgroups and differences in the sequencing techniques used to co...

Descripción completa

Detalles Bibliográficos
Autores principales: Ning, Ziyu, Yu, Shuang, Zhao, Yanqiao, Sun, Xiaoming, Wu, Haibin, Yu, Xiaoyang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8024646/
https://www.ncbi.nlm.nih.gov/pubmed/33841512
http://dx.doi.org/10.3389/fgene.2021.656526
_version_ 1783675352446402560
author Ning, Ziyu
Yu, Shuang
Zhao, Yanqiao
Sun, Xiaoming
Wu, Haibin
Yu, Xiaoyang
author_facet Ning, Ziyu
Yu, Shuang
Zhao, Yanqiao
Sun, Xiaoming
Wu, Haibin
Yu, Xiaoyang
author_sort Ning, Ziyu
collection PubMed
description Accurately identifying classification biomarkers for distinguishing between normal and cancer samples is challenging. Additionally, the reproducibility of single-molecule biomarkers is limited by the existence of heterogeneous patient subgroups and differences in the sequencing techniques used to collect patient data. In this study, we developed a method to identify robust biomarkers (i.e., miRNA-mediated subpathways) associated with prostate cancer based on normal prostate samples and cancer samples from a dataset from The Cancer Genome Atlas (TCGA; n = 546) and datasets from the Gene Expression Omnibus (GEO) database (n = 139 and n = 90, with the latter being a cell line dataset). We also obtained 10 other cancer datasets to evaluate the performance of the method. We propose a multi-omics data integration strategy for identifying classification biomarkers using a machine learning method that involves reassigning topological weights to the genes using a directed random walk (DRW)-based method. A global directed pathway network (GDPN) was constructed based on the significantly differentially expressed target genes of the significantly differentially expressed miRNAs, which allowed us to identify the robust biomarkers in the form of miRNA-mediated subpathways (miRNAs). The activity value of each miRNA-mediated subpathway was calculated by integrating multiple types of data, which included the expression of the miRNA and the miRNAs’ target genes and GDPN topological information. Finally, we identified the high-frequency miRNA-mediated subpathways involved in prostate cancer using a support vector machine (SVM) model. The results demonstrated that we obtained robust biomarkers of prostate cancer, which could classify prostate cancer and normal samples. Our method outperformed seven other methods, and many of the identified biomarkers were associated with known clinical treatments.
format Online
Article
Text
id pubmed-8024646
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-80246462021-04-08 Identification of miRNA-Mediated Subpathways as Prostate Cancer Biomarkers Based on Topological Inference in a Machine Learning Process Using Integrated Gene and miRNA Expression Data Ning, Ziyu Yu, Shuang Zhao, Yanqiao Sun, Xiaoming Wu, Haibin Yu, Xiaoyang Front Genet Genetics Accurately identifying classification biomarkers for distinguishing between normal and cancer samples is challenging. Additionally, the reproducibility of single-molecule biomarkers is limited by the existence of heterogeneous patient subgroups and differences in the sequencing techniques used to collect patient data. In this study, we developed a method to identify robust biomarkers (i.e., miRNA-mediated subpathways) associated with prostate cancer based on normal prostate samples and cancer samples from a dataset from The Cancer Genome Atlas (TCGA; n = 546) and datasets from the Gene Expression Omnibus (GEO) database (n = 139 and n = 90, with the latter being a cell line dataset). We also obtained 10 other cancer datasets to evaluate the performance of the method. We propose a multi-omics data integration strategy for identifying classification biomarkers using a machine learning method that involves reassigning topological weights to the genes using a directed random walk (DRW)-based method. A global directed pathway network (GDPN) was constructed based on the significantly differentially expressed target genes of the significantly differentially expressed miRNAs, which allowed us to identify the robust biomarkers in the form of miRNA-mediated subpathways (miRNAs). The activity value of each miRNA-mediated subpathway was calculated by integrating multiple types of data, which included the expression of the miRNA and the miRNAs’ target genes and GDPN topological information. Finally, we identified the high-frequency miRNA-mediated subpathways involved in prostate cancer using a support vector machine (SVM) model. The results demonstrated that we obtained robust biomarkers of prostate cancer, which could classify prostate cancer and normal samples. Our method outperformed seven other methods, and many of the identified biomarkers were associated with known clinical treatments. Frontiers Media S.A. 2021-03-24 /pmc/articles/PMC8024646/ /pubmed/33841512 http://dx.doi.org/10.3389/fgene.2021.656526 Text en Copyright © 2021 Ning, Yu, Zhao, Sun, Wu and Yu. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Ning, Ziyu
Yu, Shuang
Zhao, Yanqiao
Sun, Xiaoming
Wu, Haibin
Yu, Xiaoyang
Identification of miRNA-Mediated Subpathways as Prostate Cancer Biomarkers Based on Topological Inference in a Machine Learning Process Using Integrated Gene and miRNA Expression Data
title Identification of miRNA-Mediated Subpathways as Prostate Cancer Biomarkers Based on Topological Inference in a Machine Learning Process Using Integrated Gene and miRNA Expression Data
title_full Identification of miRNA-Mediated Subpathways as Prostate Cancer Biomarkers Based on Topological Inference in a Machine Learning Process Using Integrated Gene and miRNA Expression Data
title_fullStr Identification of miRNA-Mediated Subpathways as Prostate Cancer Biomarkers Based on Topological Inference in a Machine Learning Process Using Integrated Gene and miRNA Expression Data
title_full_unstemmed Identification of miRNA-Mediated Subpathways as Prostate Cancer Biomarkers Based on Topological Inference in a Machine Learning Process Using Integrated Gene and miRNA Expression Data
title_short Identification of miRNA-Mediated Subpathways as Prostate Cancer Biomarkers Based on Topological Inference in a Machine Learning Process Using Integrated Gene and miRNA Expression Data
title_sort identification of mirna-mediated subpathways as prostate cancer biomarkers based on topological inference in a machine learning process using integrated gene and mirna expression data
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8024646/
https://www.ncbi.nlm.nih.gov/pubmed/33841512
http://dx.doi.org/10.3389/fgene.2021.656526
work_keys_str_mv AT ningziyu identificationofmirnamediatedsubpathwaysasprostatecancerbiomarkersbasedontopologicalinferenceinamachinelearningprocessusingintegratedgeneandmirnaexpressiondata
AT yushuang identificationofmirnamediatedsubpathwaysasprostatecancerbiomarkersbasedontopologicalinferenceinamachinelearningprocessusingintegratedgeneandmirnaexpressiondata
AT zhaoyanqiao identificationofmirnamediatedsubpathwaysasprostatecancerbiomarkersbasedontopologicalinferenceinamachinelearningprocessusingintegratedgeneandmirnaexpressiondata
AT sunxiaoming identificationofmirnamediatedsubpathwaysasprostatecancerbiomarkersbasedontopologicalinferenceinamachinelearningprocessusingintegratedgeneandmirnaexpressiondata
AT wuhaibin identificationofmirnamediatedsubpathwaysasprostatecancerbiomarkersbasedontopologicalinferenceinamachinelearningprocessusingintegratedgeneandmirnaexpressiondata
AT yuxiaoyang identificationofmirnamediatedsubpathwaysasprostatecancerbiomarkersbasedontopologicalinferenceinamachinelearningprocessusingintegratedgeneandmirnaexpressiondata