Cargando…
Application of statistical machine learning in biomarker selection
In the recent JAVELIN Bladder 100 phase 3 trial, avelumab plus best supportive care significantly prolonged overall survival relative to best supportive care alone as first-line maintenance therapy following first-line platinum-based chemotherapy in patients with advanced urothelial cancer (aUC). Di...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10603146/ https://www.ncbi.nlm.nih.gov/pubmed/37884606 http://dx.doi.org/10.1038/s41598-023-45323-9 |
_version_ | 1785126542087553024 |
---|---|
author | Vashistha, Ritwik Noor, Zubdahe Dasgupta, Shibasish Pu, Jie Deng, Shibing |
author_facet | Vashistha, Ritwik Noor, Zubdahe Dasgupta, Shibasish Pu, Jie Deng, Shibing |
author_sort | Vashistha, Ritwik |
collection | PubMed |
description | In the recent JAVELIN Bladder 100 phase 3 trial, avelumab plus best supportive care significantly prolonged overall survival relative to best supportive care alone as first-line maintenance therapy following first-line platinum-based chemotherapy in patients with advanced urothelial cancer (aUC). Discovering biomarkers using genomic profiling to understand potential patient heterogeneity is essential to help improve patient care with precision medicine. For the JAVELIN Bladder 100 trial, it is unclear which variable selection methods can most reliably identify biomarkers to inform patient care because the dataset is characterized by high collinearity and low signal. The aim of this paper was to evaluate available selection methods and their ability to discover prognostic and predictive biomarkers in patients with aUC receiving first-line maintenance therapy. A simulation study evaluated the performance of popular variable selection approaches for high-dimensional data, including penalized regression models, random survival forests, and Bayesian variable selection methods. For Bayesian variable selection methods, a modified Bayesian Information Criterion (BIC) thresholding rule was proposed in addition to the traditional BIC thresholding rule. These methods were applied to the JAVELIN Bladder 100 dataset to investigate potential biomarkers associated with survival benefit. Results from the simulations demonstrated the strengths and limitations of the different methods. The variable selection methods demonstrated low false discovery rates under different conditions. However, their performance declined in the presence of high collinearity. Using the JAVELIN Bladder 100 data, we identified some potentially significant biomarkers across multiple models. Several lasso-related methods were able to identify potentially biologically meaningful variables in the trial. Some variable selection methods (such as stochastic search variable selection and random survival forest) may not be well suited to this type of data due to the presence of extreme collinearity and low signal. Future research should explore novel variable selection methods that may be more suitable for identifying prognostic and predictive biomarkers in this population. Trial registration: ClinicalTrials.gov Identifier: NCT02603432. |
format | Online Article Text |
id | pubmed-10603146 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-106031462023-10-28 Application of statistical machine learning in biomarker selection Vashistha, Ritwik Noor, Zubdahe Dasgupta, Shibasish Pu, Jie Deng, Shibing Sci Rep Article In the recent JAVELIN Bladder 100 phase 3 trial, avelumab plus best supportive care significantly prolonged overall survival relative to best supportive care alone as first-line maintenance therapy following first-line platinum-based chemotherapy in patients with advanced urothelial cancer (aUC). Discovering biomarkers using genomic profiling to understand potential patient heterogeneity is essential to help improve patient care with precision medicine. For the JAVELIN Bladder 100 trial, it is unclear which variable selection methods can most reliably identify biomarkers to inform patient care because the dataset is characterized by high collinearity and low signal. The aim of this paper was to evaluate available selection methods and their ability to discover prognostic and predictive biomarkers in patients with aUC receiving first-line maintenance therapy. A simulation study evaluated the performance of popular variable selection approaches for high-dimensional data, including penalized regression models, random survival forests, and Bayesian variable selection methods. For Bayesian variable selection methods, a modified Bayesian Information Criterion (BIC) thresholding rule was proposed in addition to the traditional BIC thresholding rule. These methods were applied to the JAVELIN Bladder 100 dataset to investigate potential biomarkers associated with survival benefit. Results from the simulations demonstrated the strengths and limitations of the different methods. The variable selection methods demonstrated low false discovery rates under different conditions. However, their performance declined in the presence of high collinearity. Using the JAVELIN Bladder 100 data, we identified some potentially significant biomarkers across multiple models. Several lasso-related methods were able to identify potentially biologically meaningful variables in the trial. Some variable selection methods (such as stochastic search variable selection and random survival forest) may not be well suited to this type of data due to the presence of extreme collinearity and low signal. Future research should explore novel variable selection methods that may be more suitable for identifying prognostic and predictive biomarkers in this population. Trial registration: ClinicalTrials.gov Identifier: NCT02603432. Nature Publishing Group UK 2023-10-26 /pmc/articles/PMC10603146/ /pubmed/37884606 http://dx.doi.org/10.1038/s41598-023-45323-9 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Vashistha, Ritwik Noor, Zubdahe Dasgupta, Shibasish Pu, Jie Deng, Shibing Application of statistical machine learning in biomarker selection |
title | Application of statistical machine learning in biomarker selection |
title_full | Application of statistical machine learning in biomarker selection |
title_fullStr | Application of statistical machine learning in biomarker selection |
title_full_unstemmed | Application of statistical machine learning in biomarker selection |
title_short | Application of statistical machine learning in biomarker selection |
title_sort | application of statistical machine learning in biomarker selection |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10603146/ https://www.ncbi.nlm.nih.gov/pubmed/37884606 http://dx.doi.org/10.1038/s41598-023-45323-9 |
work_keys_str_mv | AT vashistharitwik applicationofstatisticalmachinelearninginbiomarkerselection AT noorzubdahe applicationofstatisticalmachinelearninginbiomarkerselection AT dasguptashibasish applicationofstatisticalmachinelearninginbiomarkerselection AT pujie applicationofstatisticalmachinelearninginbiomarkerselection AT dengshibing applicationofstatisticalmachinelearninginbiomarkerselection |