Cargando…

Development of a Machine Learning-Based Prognostic Model for Hormone Receptor-Positive Breast Cancer Using Nine-Gene Expression Signature

BACKGROUND: Determining the prognosis of hormone receptor positive (HR(+)) breast cancer (BC), which accounts for 80% of all BCs, is critical in improving survival outcomes. Stratifying individuals at high risk of BC-related mortality and improving prognosis has been the focus of research for over a...

Descripción completa

Detalles Bibliográficos
Autores principales: Takeshita, Takashi, Iwase, Hirotaka, Wu, Rongrong, Ziazadeh, Danya, Yan, Li, Takabe, Kazuaki
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elmer Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10588506/
https://www.ncbi.nlm.nih.gov/pubmed/37869243
http://dx.doi.org/10.14740/wjon1700
_version_ 1785123598349893632
author Takeshita, Takashi
Iwase, Hirotaka
Wu, Rongrong
Ziazadeh, Danya
Yan, Li
Takabe, Kazuaki
author_facet Takeshita, Takashi
Iwase, Hirotaka
Wu, Rongrong
Ziazadeh, Danya
Yan, Li
Takabe, Kazuaki
author_sort Takeshita, Takashi
collection PubMed
description BACKGROUND: Determining the prognosis of hormone receptor positive (HR(+)) breast cancer (BC), which accounts for 80% of all BCs, is critical in improving survival outcomes. Stratifying individuals at high risk of BC-related mortality and improving prognosis has been the focus of research for over a decade. However, these tools are not universal as they are limited to clinical factors. We hypothesized that a new framework for predicting prognosis in HR(+) BC patients can develop using artificial intelligence. METHODS: A total of 2,338 HR(+) human epidermal growth factor receptor 2 negative (HER2(-)) BC cases were analyzed from Molecular Taxonomy of Breast Cancer International Consortium (METABRIC), The Cancer Genome Atlas (TCGA), and Gene Expression Omnibus (GEO) cohorts. Groups were then divided into high- and low-risk categories utilizing a recurrence prediction model (RPM). An RPM was created by extracting nine prognosis-related genes from over 18,000 genes using a logistic progression model. RESULTS: Risk classification by RPM was significantly stratified in both the discovery cohort and validation cohort. In the time-dependent area under the curve analysis, there was some variation depending on the cohort, but accuracy was found to decline significantly after about 10 years. Cell cycle related gene sets, MYC, and PI3K-AKT-mTOR signaling were enriched in high-risk tumors by the Gene Set Enrichment Analysis. High-risk tumors were associated with high levels of immune cells from the lymphoid and myeloid lineage and immune cytolytic activity, as well as low levels of stem cells and stromal cells. High-risk tumors were also associated with poor therapeutic effects of chemotherapy and endocrine therapy. CONCLUSIONS: This model was able to stratify prognosis in multiple cohorts. This is because the model reflects major BC therapeutic target pathways and tumor immune microenvironment and, further is supported by the therapeutic effect of chemotherapy and endocrine therapy.
format Online
Article
Text
id pubmed-10588506
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elmer Press
record_format MEDLINE/PubMed
spelling pubmed-105885062023-10-21 Development of a Machine Learning-Based Prognostic Model for Hormone Receptor-Positive Breast Cancer Using Nine-Gene Expression Signature Takeshita, Takashi Iwase, Hirotaka Wu, Rongrong Ziazadeh, Danya Yan, Li Takabe, Kazuaki World J Oncol Original Article BACKGROUND: Determining the prognosis of hormone receptor positive (HR(+)) breast cancer (BC), which accounts for 80% of all BCs, is critical in improving survival outcomes. Stratifying individuals at high risk of BC-related mortality and improving prognosis has been the focus of research for over a decade. However, these tools are not universal as they are limited to clinical factors. We hypothesized that a new framework for predicting prognosis in HR(+) BC patients can develop using artificial intelligence. METHODS: A total of 2,338 HR(+) human epidermal growth factor receptor 2 negative (HER2(-)) BC cases were analyzed from Molecular Taxonomy of Breast Cancer International Consortium (METABRIC), The Cancer Genome Atlas (TCGA), and Gene Expression Omnibus (GEO) cohorts. Groups were then divided into high- and low-risk categories utilizing a recurrence prediction model (RPM). An RPM was created by extracting nine prognosis-related genes from over 18,000 genes using a logistic progression model. RESULTS: Risk classification by RPM was significantly stratified in both the discovery cohort and validation cohort. In the time-dependent area under the curve analysis, there was some variation depending on the cohort, but accuracy was found to decline significantly after about 10 years. Cell cycle related gene sets, MYC, and PI3K-AKT-mTOR signaling were enriched in high-risk tumors by the Gene Set Enrichment Analysis. High-risk tumors were associated with high levels of immune cells from the lymphoid and myeloid lineage and immune cytolytic activity, as well as low levels of stem cells and stromal cells. High-risk tumors were also associated with poor therapeutic effects of chemotherapy and endocrine therapy. CONCLUSIONS: This model was able to stratify prognosis in multiple cohorts. This is because the model reflects major BC therapeutic target pathways and tumor immune microenvironment and, further is supported by the therapeutic effect of chemotherapy and endocrine therapy. Elmer Press 2023-10 2023-09-20 /pmc/articles/PMC10588506/ /pubmed/37869243 http://dx.doi.org/10.14740/wjon1700 Text en Copyright 2023, Takeshita et al. https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution Non-Commercial 4.0 International License, which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Takeshita, Takashi
Iwase, Hirotaka
Wu, Rongrong
Ziazadeh, Danya
Yan, Li
Takabe, Kazuaki
Development of a Machine Learning-Based Prognostic Model for Hormone Receptor-Positive Breast Cancer Using Nine-Gene Expression Signature
title Development of a Machine Learning-Based Prognostic Model for Hormone Receptor-Positive Breast Cancer Using Nine-Gene Expression Signature
title_full Development of a Machine Learning-Based Prognostic Model for Hormone Receptor-Positive Breast Cancer Using Nine-Gene Expression Signature
title_fullStr Development of a Machine Learning-Based Prognostic Model for Hormone Receptor-Positive Breast Cancer Using Nine-Gene Expression Signature
title_full_unstemmed Development of a Machine Learning-Based Prognostic Model for Hormone Receptor-Positive Breast Cancer Using Nine-Gene Expression Signature
title_short Development of a Machine Learning-Based Prognostic Model for Hormone Receptor-Positive Breast Cancer Using Nine-Gene Expression Signature
title_sort development of a machine learning-based prognostic model for hormone receptor-positive breast cancer using nine-gene expression signature
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10588506/
https://www.ncbi.nlm.nih.gov/pubmed/37869243
http://dx.doi.org/10.14740/wjon1700
work_keys_str_mv AT takeshitatakashi developmentofamachinelearningbasedprognosticmodelforhormonereceptorpositivebreastcancerusingninegeneexpressionsignature
AT iwasehirotaka developmentofamachinelearningbasedprognosticmodelforhormonereceptorpositivebreastcancerusingninegeneexpressionsignature
AT wurongrong developmentofamachinelearningbasedprognosticmodelforhormonereceptorpositivebreastcancerusingninegeneexpressionsignature
AT ziazadehdanya developmentofamachinelearningbasedprognosticmodelforhormonereceptorpositivebreastcancerusingninegeneexpressionsignature
AT yanli developmentofamachinelearningbasedprognosticmodelforhormonereceptorpositivebreastcancerusingninegeneexpressionsignature
AT takabekazuaki developmentofamachinelearningbasedprognosticmodelforhormonereceptorpositivebreastcancerusingninegeneexpressionsignature