Cargando…

A Simple, Test-Based Method to Control the Overestimation Bias in the Analysis of Potential Prognostic Tumour Markers

SIMPLE SUMMARY: Studies aimed at assessing the potential prognostic role of gene expression profiles in cancer patients often employ running procedures to select optimal cut-offs for the identification of groups with a poorer outcome. The corresponding hazard ratio (HR) is the most frequently used m...

Descripción completa

Detalles Bibliográficos
Autores principales: Ognibene, Marzia, Pezzolo, Annalisa, Cavanna, Roberto, Cangelosi, Davide, Sorrentino, Stefania, Parodi, Stefano
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9953998/
https://www.ncbi.nlm.nih.gov/pubmed/36831529
http://dx.doi.org/10.3390/cancers15041188
_version_ 1784894017421443072
author Ognibene, Marzia
Pezzolo, Annalisa
Cavanna, Roberto
Cangelosi, Davide
Sorrentino, Stefania
Parodi, Stefano
author_facet Ognibene, Marzia
Pezzolo, Annalisa
Cavanna, Roberto
Cangelosi, Davide
Sorrentino, Stefania
Parodi, Stefano
author_sort Ognibene, Marzia
collection PubMed
description SIMPLE SUMMARY: Studies aimed at assessing the potential prognostic role of gene expression profiles in cancer patients often employ running procedures to select optimal cut-offs for the identification of groups with a poorer outcome. The corresponding hazard ratio (HR) is the most frequently used measure of association between gene expression and patient survival, but it is prone to an overestimation bias. If rare diseases are investigated in the absence of an external cohort, it is difficult to control this lack of accuracy. We propose a simple, test-based method to obtain the HR, adjusted for the overestimation bias. Validation using both simulated data and gene expression profiles from two publicly available data sets is provided. Furthermore, we show that the proposed method is able to identify a new gene with potential oncogenic activity in the reanalysis of a data set including 134 patients affected by Stage 4S neuroblastoma. ABSTRACT: The early evaluation of prognostic tumour markers is commonly performed by comparing the survival of two groups of patients identified on the basis of a cut-off value. The corresponding hazard ratio (HR) is usually estimated, representing a measure of the relative risk between patients with marker values above and below the cut-off. A posteriori methods identifying an optimal cut-off are appropriate when the functional form of the relation between the marker distribution and patient survival is unknown, but they are prone to an overestimation bias. In the presence of a small sample size, which is typical of rare diseases, the external validation sets are hardly available and internal cross-validation could be unfeasible. We describe a new method to obtain an unbiased estimate of the HR at an optimal cut-off, exploiting the simple relation between the HR and the associated p-value estimated by a random permutation analysis. We validate the method on both simulated data and set of gene expression profiles from two large, publicly available data sets. Furthermore, a reanalysis of a previously published study, which included 134 Stage 4S neuroblastoma patients, allowed for the identification of E2F1 as a new gene with potential oncogenic activity. This finding was confirmed by an immunofluorescence analysis on an independent cohort.
format Online
Article
Text
id pubmed-9953998
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-99539982023-02-25 A Simple, Test-Based Method to Control the Overestimation Bias in the Analysis of Potential Prognostic Tumour Markers Ognibene, Marzia Pezzolo, Annalisa Cavanna, Roberto Cangelosi, Davide Sorrentino, Stefania Parodi, Stefano Cancers (Basel) Article SIMPLE SUMMARY: Studies aimed at assessing the potential prognostic role of gene expression profiles in cancer patients often employ running procedures to select optimal cut-offs for the identification of groups with a poorer outcome. The corresponding hazard ratio (HR) is the most frequently used measure of association between gene expression and patient survival, but it is prone to an overestimation bias. If rare diseases are investigated in the absence of an external cohort, it is difficult to control this lack of accuracy. We propose a simple, test-based method to obtain the HR, adjusted for the overestimation bias. Validation using both simulated data and gene expression profiles from two publicly available data sets is provided. Furthermore, we show that the proposed method is able to identify a new gene with potential oncogenic activity in the reanalysis of a data set including 134 patients affected by Stage 4S neuroblastoma. ABSTRACT: The early evaluation of prognostic tumour markers is commonly performed by comparing the survival of two groups of patients identified on the basis of a cut-off value. The corresponding hazard ratio (HR) is usually estimated, representing a measure of the relative risk between patients with marker values above and below the cut-off. A posteriori methods identifying an optimal cut-off are appropriate when the functional form of the relation between the marker distribution and patient survival is unknown, but they are prone to an overestimation bias. In the presence of a small sample size, which is typical of rare diseases, the external validation sets are hardly available and internal cross-validation could be unfeasible. We describe a new method to obtain an unbiased estimate of the HR at an optimal cut-off, exploiting the simple relation between the HR and the associated p-value estimated by a random permutation analysis. We validate the method on both simulated data and set of gene expression profiles from two large, publicly available data sets. Furthermore, a reanalysis of a previously published study, which included 134 Stage 4S neuroblastoma patients, allowed for the identification of E2F1 as a new gene with potential oncogenic activity. This finding was confirmed by an immunofluorescence analysis on an independent cohort. MDPI 2023-02-13 /pmc/articles/PMC9953998/ /pubmed/36831529 http://dx.doi.org/10.3390/cancers15041188 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Ognibene, Marzia
Pezzolo, Annalisa
Cavanna, Roberto
Cangelosi, Davide
Sorrentino, Stefania
Parodi, Stefano
A Simple, Test-Based Method to Control the Overestimation Bias in the Analysis of Potential Prognostic Tumour Markers
title A Simple, Test-Based Method to Control the Overestimation Bias in the Analysis of Potential Prognostic Tumour Markers
title_full A Simple, Test-Based Method to Control the Overestimation Bias in the Analysis of Potential Prognostic Tumour Markers
title_fullStr A Simple, Test-Based Method to Control the Overestimation Bias in the Analysis of Potential Prognostic Tumour Markers
title_full_unstemmed A Simple, Test-Based Method to Control the Overestimation Bias in the Analysis of Potential Prognostic Tumour Markers
title_short A Simple, Test-Based Method to Control the Overestimation Bias in the Analysis of Potential Prognostic Tumour Markers
title_sort simple, test-based method to control the overestimation bias in the analysis of potential prognostic tumour markers
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9953998/
https://www.ncbi.nlm.nih.gov/pubmed/36831529
http://dx.doi.org/10.3390/cancers15041188
work_keys_str_mv AT ognibenemarzia asimpletestbasedmethodtocontroltheoverestimationbiasintheanalysisofpotentialprognostictumourmarkers
AT pezzoloannalisa asimpletestbasedmethodtocontroltheoverestimationbiasintheanalysisofpotentialprognostictumourmarkers
AT cavannaroberto asimpletestbasedmethodtocontroltheoverestimationbiasintheanalysisofpotentialprognostictumourmarkers
AT cangelosidavide asimpletestbasedmethodtocontroltheoverestimationbiasintheanalysisofpotentialprognostictumourmarkers
AT sorrentinostefania asimpletestbasedmethodtocontroltheoverestimationbiasintheanalysisofpotentialprognostictumourmarkers
AT parodistefano asimpletestbasedmethodtocontroltheoverestimationbiasintheanalysisofpotentialprognostictumourmarkers
AT ognibenemarzia simpletestbasedmethodtocontroltheoverestimationbiasintheanalysisofpotentialprognostictumourmarkers
AT pezzoloannalisa simpletestbasedmethodtocontroltheoverestimationbiasintheanalysisofpotentialprognostictumourmarkers
AT cavannaroberto simpletestbasedmethodtocontroltheoverestimationbiasintheanalysisofpotentialprognostictumourmarkers
AT cangelosidavide simpletestbasedmethodtocontroltheoverestimationbiasintheanalysisofpotentialprognostictumourmarkers
AT sorrentinostefania simpletestbasedmethodtocontroltheoverestimationbiasintheanalysisofpotentialprognostictumourmarkers
AT parodistefano simpletestbasedmethodtocontroltheoverestimationbiasintheanalysisofpotentialprognostictumourmarkers