Cargando…

Quantitative Assessment of Tissue Biomarkers and Construction of a Model to Predict Outcome in Breast Cancer Using Multiple Imputation

Missing data pose one of the greatest challenges in the rigorous evaluation of biomarkers. The limited availability of specimens with complete clinical annotation and quality biomaterial often leads to underpowered studies. Tissue microarray studies, for example, may be further handicapped by the lo...

Descripción completa

Detalles Bibliográficos
Autores principales: Emerson, John W., Dolled-Filhart, Marisa, Harris, Lyndsay, Rimm, David L., Tuck, David P.
Formato: Texto
Lenguaje:English
Publicado: Libertas Academica 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2664700/
https://www.ncbi.nlm.nih.gov/pubmed/19352457
_version_ 1782165979768291328
author Emerson, John W.
Dolled-Filhart, Marisa
Harris, Lyndsay
Rimm, David L.
Tuck, David P.
author_facet Emerson, John W.
Dolled-Filhart, Marisa
Harris, Lyndsay
Rimm, David L.
Tuck, David P.
author_sort Emerson, John W.
collection PubMed
description Missing data pose one of the greatest challenges in the rigorous evaluation of biomarkers. The limited availability of specimens with complete clinical annotation and quality biomaterial often leads to underpowered studies. Tissue microarray studies, for example, may be further handicapped by the loss of data points because of unevaluable staining, core loss, or the lack of tumor in the histospot. This paper presents a novel approach to these common problems in the context of a tissue protein biomarker analysis in a cohort of patients with breast cancer. Our analysis develops techniques based on multiple imputation to address the missing value problem. We first select markers using a training cohort, identifying a small subset of protein expression levels that are most useful in predicting patient survival. The best model is obtained by including both protein markers (including COX6C, GATA3, NAT1, and ESR1) and lymph node status. The use of either lymph node status or the four protein expression levels provides similar improvements in goodness-of-fit, with both significantly better than a baseline clinical model. Using the same multiple imputation strategy, we then validate the results out-of-sample on a larger independent cohort. Our approach of integrating multiple imputation with each stage of the analysis serves as an example that may be replicated or adapted in future studies with missing values.
format Text
id pubmed-2664700
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-26647002009-04-07 Quantitative Assessment of Tissue Biomarkers and Construction of a Model to Predict Outcome in Breast Cancer Using Multiple Imputation Emerson, John W. Dolled-Filhart, Marisa Harris, Lyndsay Rimm, David L. Tuck, David P. Cancer Inform Original Research Missing data pose one of the greatest challenges in the rigorous evaluation of biomarkers. The limited availability of specimens with complete clinical annotation and quality biomaterial often leads to underpowered studies. Tissue microarray studies, for example, may be further handicapped by the loss of data points because of unevaluable staining, core loss, or the lack of tumor in the histospot. This paper presents a novel approach to these common problems in the context of a tissue protein biomarker analysis in a cohort of patients with breast cancer. Our analysis develops techniques based on multiple imputation to address the missing value problem. We first select markers using a training cohort, identifying a small subset of protein expression levels that are most useful in predicting patient survival. The best model is obtained by including both protein markers (including COX6C, GATA3, NAT1, and ESR1) and lymph node status. The use of either lymph node status or the four protein expression levels provides similar improvements in goodness-of-fit, with both significantly better than a baseline clinical model. Using the same multiple imputation strategy, we then validate the results out-of-sample on a larger independent cohort. Our approach of integrating multiple imputation with each stage of the analysis serves as an example that may be replicated or adapted in future studies with missing values. Libertas Academica 2008-12-23 /pmc/articles/PMC2664700/ /pubmed/19352457 Text en © 2009 The authors. http://creativecommons.org/licenses/by/3.0 This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).
spellingShingle Original Research
Emerson, John W.
Dolled-Filhart, Marisa
Harris, Lyndsay
Rimm, David L.
Tuck, David P.
Quantitative Assessment of Tissue Biomarkers and Construction of a Model to Predict Outcome in Breast Cancer Using Multiple Imputation
title Quantitative Assessment of Tissue Biomarkers and Construction of a Model to Predict Outcome in Breast Cancer Using Multiple Imputation
title_full Quantitative Assessment of Tissue Biomarkers and Construction of a Model to Predict Outcome in Breast Cancer Using Multiple Imputation
title_fullStr Quantitative Assessment of Tissue Biomarkers and Construction of a Model to Predict Outcome in Breast Cancer Using Multiple Imputation
title_full_unstemmed Quantitative Assessment of Tissue Biomarkers and Construction of a Model to Predict Outcome in Breast Cancer Using Multiple Imputation
title_short Quantitative Assessment of Tissue Biomarkers and Construction of a Model to Predict Outcome in Breast Cancer Using Multiple Imputation
title_sort quantitative assessment of tissue biomarkers and construction of a model to predict outcome in breast cancer using multiple imputation
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2664700/
https://www.ncbi.nlm.nih.gov/pubmed/19352457
work_keys_str_mv AT emersonjohnw quantitativeassessmentoftissuebiomarkersandconstructionofamodeltopredictoutcomeinbreastcancerusingmultipleimputation
AT dolledfilhartmarisa quantitativeassessmentoftissuebiomarkersandconstructionofamodeltopredictoutcomeinbreastcancerusingmultipleimputation
AT harrislyndsay quantitativeassessmentoftissuebiomarkersandconstructionofamodeltopredictoutcomeinbreastcancerusingmultipleimputation
AT rimmdavidl quantitativeassessmentoftissuebiomarkersandconstructionofamodeltopredictoutcomeinbreastcancerusingmultipleimputation
AT tuckdavidp quantitativeassessmentoftissuebiomarkersandconstructionofamodeltopredictoutcomeinbreastcancerusingmultipleimputation