Cargando…
A strategy to build and validate a prognostic biomarker model based on RT-qPCR gene expression and clinical covariates
BACKGROUND: Construction and validation of a prognostic model for survival data in the clinical domain is still an active field of research. Nevertheless there is no consensus on how to develop routine prognostic tests based on a combination of RT-qPCR biomarkers and clinical or demographic variable...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4384357/ https://www.ncbi.nlm.nih.gov/pubmed/25880752 http://dx.doi.org/10.1186/s12859-015-0537-9 |
_version_ | 1782364890738982912 |
---|---|
author | Tournoud, Maud Larue, Audrey Cazalis, Marie-Angelique Venet, Fabienne Pachot, Alexandre Monneret, Guillaume Lepape, Alain Veyrieras, Jean-Baptiste |
author_facet | Tournoud, Maud Larue, Audrey Cazalis, Marie-Angelique Venet, Fabienne Pachot, Alexandre Monneret, Guillaume Lepape, Alain Veyrieras, Jean-Baptiste |
author_sort | Tournoud, Maud |
collection | PubMed |
description | BACKGROUND: Construction and validation of a prognostic model for survival data in the clinical domain is still an active field of research. Nevertheless there is no consensus on how to develop routine prognostic tests based on a combination of RT-qPCR biomarkers and clinical or demographic variables. In particular, the estimation of the model performance requires to properly account for the RT-qPCR experimental design. RESULTS: We present a strategy to build, select, and validate a prognostic model for survival data based on a combination of RT-qPCR biomarkers and clinical or demographic data and we provide an illustration on a real clinical dataset. First, we compare two cross-validation schemes: a classical outcome-stratified cross-validation scheme and an alternative one that accounts for the RT-qPCR plate design, especially when samples are processed by batches. The latter is intended to limit the performance discrepancies, also called the validation surprise, between the training and the test sets. Second, strategies for model building (covariate selection, functional relationship modeling, and statistical model) as well as performance indicators estimation are presented. Since in practice several prognostic models can exhibit similar performances, complementary criteria for model selection are discussed: the stability of the selected variables, the model optimism, and the impact of the omitted variables on the model performance. CONCLUSION: On the training dataset, appropriate resampling methods are expected to prevent from any upward biases due to unaccounted technical and biological variability that may arise from the experimental and intrinsic design of the RT-qPCR assay. Moreover, the stability of the selected variables, the model optimism, and the impact of the omitted variables on the model performances are pivotal indicators to select the optimal model to be validated on the test dataset. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0537-9) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4384357 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-43843572015-04-04 A strategy to build and validate a prognostic biomarker model based on RT-qPCR gene expression and clinical covariates Tournoud, Maud Larue, Audrey Cazalis, Marie-Angelique Venet, Fabienne Pachot, Alexandre Monneret, Guillaume Lepape, Alain Veyrieras, Jean-Baptiste BMC Bioinformatics Methodology Article BACKGROUND: Construction and validation of a prognostic model for survival data in the clinical domain is still an active field of research. Nevertheless there is no consensus on how to develop routine prognostic tests based on a combination of RT-qPCR biomarkers and clinical or demographic variables. In particular, the estimation of the model performance requires to properly account for the RT-qPCR experimental design. RESULTS: We present a strategy to build, select, and validate a prognostic model for survival data based on a combination of RT-qPCR biomarkers and clinical or demographic data and we provide an illustration on a real clinical dataset. First, we compare two cross-validation schemes: a classical outcome-stratified cross-validation scheme and an alternative one that accounts for the RT-qPCR plate design, especially when samples are processed by batches. The latter is intended to limit the performance discrepancies, also called the validation surprise, between the training and the test sets. Second, strategies for model building (covariate selection, functional relationship modeling, and statistical model) as well as performance indicators estimation are presented. Since in practice several prognostic models can exhibit similar performances, complementary criteria for model selection are discussed: the stability of the selected variables, the model optimism, and the impact of the omitted variables on the model performance. CONCLUSION: On the training dataset, appropriate resampling methods are expected to prevent from any upward biases due to unaccounted technical and biological variability that may arise from the experimental and intrinsic design of the RT-qPCR assay. Moreover, the stability of the selected variables, the model optimism, and the impact of the omitted variables on the model performances are pivotal indicators to select the optimal model to be validated on the test dataset. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0537-9) contains supplementary material, which is available to authorized users. BioMed Central 2015-03-28 /pmc/articles/PMC4384357/ /pubmed/25880752 http://dx.doi.org/10.1186/s12859-015-0537-9 Text en © Tournoud et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Tournoud, Maud Larue, Audrey Cazalis, Marie-Angelique Venet, Fabienne Pachot, Alexandre Monneret, Guillaume Lepape, Alain Veyrieras, Jean-Baptiste A strategy to build and validate a prognostic biomarker model based on RT-qPCR gene expression and clinical covariates |
title | A strategy to build and validate a prognostic biomarker model based on RT-qPCR gene expression and clinical covariates |
title_full | A strategy to build and validate a prognostic biomarker model based on RT-qPCR gene expression and clinical covariates |
title_fullStr | A strategy to build and validate a prognostic biomarker model based on RT-qPCR gene expression and clinical covariates |
title_full_unstemmed | A strategy to build and validate a prognostic biomarker model based on RT-qPCR gene expression and clinical covariates |
title_short | A strategy to build and validate a prognostic biomarker model based on RT-qPCR gene expression and clinical covariates |
title_sort | strategy to build and validate a prognostic biomarker model based on rt-qpcr gene expression and clinical covariates |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4384357/ https://www.ncbi.nlm.nih.gov/pubmed/25880752 http://dx.doi.org/10.1186/s12859-015-0537-9 |
work_keys_str_mv | AT tournoudmaud astrategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates AT larueaudrey astrategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates AT cazalismarieangelique astrategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates AT venetfabienne astrategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates AT pachotalexandre astrategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates AT monneretguillaume astrategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates AT lepapealain astrategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates AT veyrierasjeanbaptiste astrategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates AT tournoudmaud strategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates AT larueaudrey strategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates AT cazalismarieangelique strategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates AT venetfabienne strategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates AT pachotalexandre strategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates AT monneretguillaume strategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates AT lepapealain strategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates AT veyrierasjeanbaptiste strategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates |