Cargando…

Enhanced Directed Random Walk for the Identification of Breast Cancer Prognostic Markers from Multiclass Expression Data

Artificial intelligence in healthcare can potentially identify the probability of contracting a particular disease more accurately. There are five common molecular subtypes of breast cancer: luminal A, luminal B, basal, ERBB2, and normal-like. Previous investigations showed that pathway-based microa...

Descripción completa

Detalles Bibliográficos
Autores principales: Nies, Hui Wen, Mohamad, Mohd Saberi, Zakaria, Zalmiyah, Chan, Weng Howe, Remli, Muhammad Akmal, Nies, Yong Hui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8472068/
https://www.ncbi.nlm.nih.gov/pubmed/34573857
http://dx.doi.org/10.3390/e23091232
_version_ 1784574633279750144
author Nies, Hui Wen
Mohamad, Mohd Saberi
Zakaria, Zalmiyah
Chan, Weng Howe
Remli, Muhammad Akmal
Nies, Yong Hui
author_facet Nies, Hui Wen
Mohamad, Mohd Saberi
Zakaria, Zalmiyah
Chan, Weng Howe
Remli, Muhammad Akmal
Nies, Yong Hui
author_sort Nies, Hui Wen
collection PubMed
description Artificial intelligence in healthcare can potentially identify the probability of contracting a particular disease more accurately. There are five common molecular subtypes of breast cancer: luminal A, luminal B, basal, ERBB2, and normal-like. Previous investigations showed that pathway-based microarray analysis could help in the identification of prognostic markers from gene expressions. For example, directed random walk (DRW) can infer a greater reproducibility power of the pathway activity between two classes of samples with a higher classification accuracy. However, most of the existing methods (including DRW) ignored the characteristics of different cancer subtypes and considered all of the pathways to contribute equally to the analysis. Therefore, an enhanced DRW (eDRW+) is proposed to identify breast cancer prognostic markers from multiclass expression data. An improved weight strategy using one-way ANOVA (F-test) and pathway selection based on the greatest reproducibility power is proposed in eDRW+. The experimental results show that the eDRW+ exceeds other methods in terms of AUC. Besides this, the eDRW+ identifies 294 gene markers and 45 pathway markers from the breast cancer datasets with better AUC. Therefore, the prognostic markers (pathway markers and gene markers) can identify drug targets and look for cancer subtypes with clinically distinct outcomes.
format Online
Article
Text
id pubmed-8472068
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-84720682021-09-28 Enhanced Directed Random Walk for the Identification of Breast Cancer Prognostic Markers from Multiclass Expression Data Nies, Hui Wen Mohamad, Mohd Saberi Zakaria, Zalmiyah Chan, Weng Howe Remli, Muhammad Akmal Nies, Yong Hui Entropy (Basel) Article Artificial intelligence in healthcare can potentially identify the probability of contracting a particular disease more accurately. There are five common molecular subtypes of breast cancer: luminal A, luminal B, basal, ERBB2, and normal-like. Previous investigations showed that pathway-based microarray analysis could help in the identification of prognostic markers from gene expressions. For example, directed random walk (DRW) can infer a greater reproducibility power of the pathway activity between two classes of samples with a higher classification accuracy. However, most of the existing methods (including DRW) ignored the characteristics of different cancer subtypes and considered all of the pathways to contribute equally to the analysis. Therefore, an enhanced DRW (eDRW+) is proposed to identify breast cancer prognostic markers from multiclass expression data. An improved weight strategy using one-way ANOVA (F-test) and pathway selection based on the greatest reproducibility power is proposed in eDRW+. The experimental results show that the eDRW+ exceeds other methods in terms of AUC. Besides this, the eDRW+ identifies 294 gene markers and 45 pathway markers from the breast cancer datasets with better AUC. Therefore, the prognostic markers (pathway markers and gene markers) can identify drug targets and look for cancer subtypes with clinically distinct outcomes. MDPI 2021-09-20 /pmc/articles/PMC8472068/ /pubmed/34573857 http://dx.doi.org/10.3390/e23091232 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Nies, Hui Wen
Mohamad, Mohd Saberi
Zakaria, Zalmiyah
Chan, Weng Howe
Remli, Muhammad Akmal
Nies, Yong Hui
Enhanced Directed Random Walk for the Identification of Breast Cancer Prognostic Markers from Multiclass Expression Data
title Enhanced Directed Random Walk for the Identification of Breast Cancer Prognostic Markers from Multiclass Expression Data
title_full Enhanced Directed Random Walk for the Identification of Breast Cancer Prognostic Markers from Multiclass Expression Data
title_fullStr Enhanced Directed Random Walk for the Identification of Breast Cancer Prognostic Markers from Multiclass Expression Data
title_full_unstemmed Enhanced Directed Random Walk for the Identification of Breast Cancer Prognostic Markers from Multiclass Expression Data
title_short Enhanced Directed Random Walk for the Identification of Breast Cancer Prognostic Markers from Multiclass Expression Data
title_sort enhanced directed random walk for the identification of breast cancer prognostic markers from multiclass expression data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8472068/
https://www.ncbi.nlm.nih.gov/pubmed/34573857
http://dx.doi.org/10.3390/e23091232
work_keys_str_mv AT nieshuiwen enhanceddirectedrandomwalkfortheidentificationofbreastcancerprognosticmarkersfrommulticlassexpressiondata
AT mohamadmohdsaberi enhanceddirectedrandomwalkfortheidentificationofbreastcancerprognosticmarkersfrommulticlassexpressiondata
AT zakariazalmiyah enhanceddirectedrandomwalkfortheidentificationofbreastcancerprognosticmarkersfrommulticlassexpressiondata
AT chanwenghowe enhanceddirectedrandomwalkfortheidentificationofbreastcancerprognosticmarkersfrommulticlassexpressiondata
AT remlimuhammadakmal enhanceddirectedrandomwalkfortheidentificationofbreastcancerprognosticmarkersfrommulticlassexpressiondata
AT niesyonghui enhanceddirectedrandomwalkfortheidentificationofbreastcancerprognosticmarkersfrommulticlassexpressiondata