Cargando…

A strategy to build and validate a prognostic biomarker model based on RT-qPCR gene expression and clinical covariates

BACKGROUND: Construction and validation of a prognostic model for survival data in the clinical domain is still an active field of research. Nevertheless there is no consensus on how to develop routine prognostic tests based on a combination of RT-qPCR biomarkers and clinical or demographic variable...

Descripción completa

Detalles Bibliográficos
Autores principales: Tournoud, Maud, Larue, Audrey, Cazalis, Marie-Angelique, Venet, Fabienne, Pachot, Alexandre, Monneret, Guillaume, Lepape, Alain, Veyrieras, Jean-Baptiste
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4384357/
https://www.ncbi.nlm.nih.gov/pubmed/25880752
http://dx.doi.org/10.1186/s12859-015-0537-9
_version_ 1782364890738982912
author Tournoud, Maud
Larue, Audrey
Cazalis, Marie-Angelique
Venet, Fabienne
Pachot, Alexandre
Monneret, Guillaume
Lepape, Alain
Veyrieras, Jean-Baptiste
author_facet Tournoud, Maud
Larue, Audrey
Cazalis, Marie-Angelique
Venet, Fabienne
Pachot, Alexandre
Monneret, Guillaume
Lepape, Alain
Veyrieras, Jean-Baptiste
author_sort Tournoud, Maud
collection PubMed
description BACKGROUND: Construction and validation of a prognostic model for survival data in the clinical domain is still an active field of research. Nevertheless there is no consensus on how to develop routine prognostic tests based on a combination of RT-qPCR biomarkers and clinical or demographic variables. In particular, the estimation of the model performance requires to properly account for the RT-qPCR experimental design. RESULTS: We present a strategy to build, select, and validate a prognostic model for survival data based on a combination of RT-qPCR biomarkers and clinical or demographic data and we provide an illustration on a real clinical dataset. First, we compare two cross-validation schemes: a classical outcome-stratified cross-validation scheme and an alternative one that accounts for the RT-qPCR plate design, especially when samples are processed by batches. The latter is intended to limit the performance discrepancies, also called the validation surprise, between the training and the test sets. Second, strategies for model building (covariate selection, functional relationship modeling, and statistical model) as well as performance indicators estimation are presented. Since in practice several prognostic models can exhibit similar performances, complementary criteria for model selection are discussed: the stability of the selected variables, the model optimism, and the impact of the omitted variables on the model performance. CONCLUSION: On the training dataset, appropriate resampling methods are expected to prevent from any upward biases due to unaccounted technical and biological variability that may arise from the experimental and intrinsic design of the RT-qPCR assay. Moreover, the stability of the selected variables, the model optimism, and the impact of the omitted variables on the model performances are pivotal indicators to select the optimal model to be validated on the test dataset. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0537-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4384357
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43843572015-04-04 A strategy to build and validate a prognostic biomarker model based on RT-qPCR gene expression and clinical covariates Tournoud, Maud Larue, Audrey Cazalis, Marie-Angelique Venet, Fabienne Pachot, Alexandre Monneret, Guillaume Lepape, Alain Veyrieras, Jean-Baptiste BMC Bioinformatics Methodology Article BACKGROUND: Construction and validation of a prognostic model for survival data in the clinical domain is still an active field of research. Nevertheless there is no consensus on how to develop routine prognostic tests based on a combination of RT-qPCR biomarkers and clinical or demographic variables. In particular, the estimation of the model performance requires to properly account for the RT-qPCR experimental design. RESULTS: We present a strategy to build, select, and validate a prognostic model for survival data based on a combination of RT-qPCR biomarkers and clinical or demographic data and we provide an illustration on a real clinical dataset. First, we compare two cross-validation schemes: a classical outcome-stratified cross-validation scheme and an alternative one that accounts for the RT-qPCR plate design, especially when samples are processed by batches. The latter is intended to limit the performance discrepancies, also called the validation surprise, between the training and the test sets. Second, strategies for model building (covariate selection, functional relationship modeling, and statistical model) as well as performance indicators estimation are presented. Since in practice several prognostic models can exhibit similar performances, complementary criteria for model selection are discussed: the stability of the selected variables, the model optimism, and the impact of the omitted variables on the model performance. CONCLUSION: On the training dataset, appropriate resampling methods are expected to prevent from any upward biases due to unaccounted technical and biological variability that may arise from the experimental and intrinsic design of the RT-qPCR assay. Moreover, the stability of the selected variables, the model optimism, and the impact of the omitted variables on the model performances are pivotal indicators to select the optimal model to be validated on the test dataset. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0537-9) contains supplementary material, which is available to authorized users. BioMed Central 2015-03-28 /pmc/articles/PMC4384357/ /pubmed/25880752 http://dx.doi.org/10.1186/s12859-015-0537-9 Text en © Tournoud et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Tournoud, Maud
Larue, Audrey
Cazalis, Marie-Angelique
Venet, Fabienne
Pachot, Alexandre
Monneret, Guillaume
Lepape, Alain
Veyrieras, Jean-Baptiste
A strategy to build and validate a prognostic biomarker model based on RT-qPCR gene expression and clinical covariates
title A strategy to build and validate a prognostic biomarker model based on RT-qPCR gene expression and clinical covariates
title_full A strategy to build and validate a prognostic biomarker model based on RT-qPCR gene expression and clinical covariates
title_fullStr A strategy to build and validate a prognostic biomarker model based on RT-qPCR gene expression and clinical covariates
title_full_unstemmed A strategy to build and validate a prognostic biomarker model based on RT-qPCR gene expression and clinical covariates
title_short A strategy to build and validate a prognostic biomarker model based on RT-qPCR gene expression and clinical covariates
title_sort strategy to build and validate a prognostic biomarker model based on rt-qpcr gene expression and clinical covariates
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4384357/
https://www.ncbi.nlm.nih.gov/pubmed/25880752
http://dx.doi.org/10.1186/s12859-015-0537-9
work_keys_str_mv AT tournoudmaud astrategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates
AT larueaudrey astrategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates
AT cazalismarieangelique astrategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates
AT venetfabienne astrategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates
AT pachotalexandre astrategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates
AT monneretguillaume astrategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates
AT lepapealain astrategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates
AT veyrierasjeanbaptiste astrategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates
AT tournoudmaud strategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates
AT larueaudrey strategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates
AT cazalismarieangelique strategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates
AT venetfabienne strategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates
AT pachotalexandre strategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates
AT monneretguillaume strategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates
AT lepapealain strategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates
AT veyrierasjeanbaptiste strategytobuildandvalidateaprognosticbiomarkermodelbasedonrtqpcrgeneexpressionandclinicalcovariates