Cargando…

XGBoost-based and tumor-immune characterized gene signature for the prediction of metastatic status in breast cancer

BACKGROUND: For a long time, breast cancer has been a leading cancer diagnosed in women worldwide, and approximately 90% of cancer-related deaths are caused by metastasis. For this reason, finding new biomarkers related to metastasis is an urgent task to predict the metastatic status of breast cance...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Qingqing, Yang, Hui, Wang, Peipei, Liu, Xiaocen, Lv, Kun, Ye, Mingquan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9014628/
https://www.ncbi.nlm.nih.gov/pubmed/35436939
http://dx.doi.org/10.1186/s12967-022-03369-9
_version_ 1784688223379783680
author Li, Qingqing
Yang, Hui
Wang, Peipei
Liu, Xiaocen
Lv, Kun
Ye, Mingquan
author_facet Li, Qingqing
Yang, Hui
Wang, Peipei
Liu, Xiaocen
Lv, Kun
Ye, Mingquan
author_sort Li, Qingqing
collection PubMed
description BACKGROUND: For a long time, breast cancer has been a leading cancer diagnosed in women worldwide, and approximately 90% of cancer-related deaths are caused by metastasis. For this reason, finding new biomarkers related to metastasis is an urgent task to predict the metastatic status of breast cancer and provide new therapeutic targets. METHODS: In this research, an efficient model of eXtreme Gradient Boosting (XGBoost) optimized by a grid search algorithm is established to realize auxiliary identification of metastatic breast tumors based on gene expression. Estimated by ten-fold cross-validation, the optimized XGBoost classifier can achieve an overall higher mean AUC of 0.82 compared to other classifiers such as DT, SVM, KNN, LR, and RF. RESULTS: A novel 6-gene signature (SQSTM1, GDF9, LINC01125, PTGS2, GVINP1, and TMEM64) was selected by feature importance ranking and a series of in vitro experiments were conducted to verify the potential role of each biomarker. In general, the effects of SQSTM in tumor cells are assigned as a risk factor, while the effects of the other 5 genes (GDF9, LINC01125, PTGS2, GVINP1, and TMEM64) in immune cells are assigned as protective factors. CONCLUSIONS: Our findings will allow for a more accurate prediction of the metastatic status of breast cancer and will benefit the mining of breast cancer metastasis-related biomarkers. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12967-022-03369-9.
format Online
Article
Text
id pubmed-9014628
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-90146282022-04-19 XGBoost-based and tumor-immune characterized gene signature for the prediction of metastatic status in breast cancer Li, Qingqing Yang, Hui Wang, Peipei Liu, Xiaocen Lv, Kun Ye, Mingquan J Transl Med Research BACKGROUND: For a long time, breast cancer has been a leading cancer diagnosed in women worldwide, and approximately 90% of cancer-related deaths are caused by metastasis. For this reason, finding new biomarkers related to metastasis is an urgent task to predict the metastatic status of breast cancer and provide new therapeutic targets. METHODS: In this research, an efficient model of eXtreme Gradient Boosting (XGBoost) optimized by a grid search algorithm is established to realize auxiliary identification of metastatic breast tumors based on gene expression. Estimated by ten-fold cross-validation, the optimized XGBoost classifier can achieve an overall higher mean AUC of 0.82 compared to other classifiers such as DT, SVM, KNN, LR, and RF. RESULTS: A novel 6-gene signature (SQSTM1, GDF9, LINC01125, PTGS2, GVINP1, and TMEM64) was selected by feature importance ranking and a series of in vitro experiments were conducted to verify the potential role of each biomarker. In general, the effects of SQSTM in tumor cells are assigned as a risk factor, while the effects of the other 5 genes (GDF9, LINC01125, PTGS2, GVINP1, and TMEM64) in immune cells are assigned as protective factors. CONCLUSIONS: Our findings will allow for a more accurate prediction of the metastatic status of breast cancer and will benefit the mining of breast cancer metastasis-related biomarkers. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12967-022-03369-9. BioMed Central 2022-04-18 /pmc/articles/PMC9014628/ /pubmed/35436939 http://dx.doi.org/10.1186/s12967-022-03369-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Li, Qingqing
Yang, Hui
Wang, Peipei
Liu, Xiaocen
Lv, Kun
Ye, Mingquan
XGBoost-based and tumor-immune characterized gene signature for the prediction of metastatic status in breast cancer
title XGBoost-based and tumor-immune characterized gene signature for the prediction of metastatic status in breast cancer
title_full XGBoost-based and tumor-immune characterized gene signature for the prediction of metastatic status in breast cancer
title_fullStr XGBoost-based and tumor-immune characterized gene signature for the prediction of metastatic status in breast cancer
title_full_unstemmed XGBoost-based and tumor-immune characterized gene signature for the prediction of metastatic status in breast cancer
title_short XGBoost-based and tumor-immune characterized gene signature for the prediction of metastatic status in breast cancer
title_sort xgboost-based and tumor-immune characterized gene signature for the prediction of metastatic status in breast cancer
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9014628/
https://www.ncbi.nlm.nih.gov/pubmed/35436939
http://dx.doi.org/10.1186/s12967-022-03369-9
work_keys_str_mv AT liqingqing xgboostbasedandtumorimmunecharacterizedgenesignatureforthepredictionofmetastaticstatusinbreastcancer
AT yanghui xgboostbasedandtumorimmunecharacterizedgenesignatureforthepredictionofmetastaticstatusinbreastcancer
AT wangpeipei xgboostbasedandtumorimmunecharacterizedgenesignatureforthepredictionofmetastaticstatusinbreastcancer
AT liuxiaocen xgboostbasedandtumorimmunecharacterizedgenesignatureforthepredictionofmetastaticstatusinbreastcancer
AT lvkun xgboostbasedandtumorimmunecharacterizedgenesignatureforthepredictionofmetastaticstatusinbreastcancer
AT yemingquan xgboostbasedandtumorimmunecharacterizedgenesignatureforthepredictionofmetastaticstatusinbreastcancer