Cargando…

Characterization of uncertainty in the classification of multivariate assays: application to PAM50 centroid-based genomic predictors for breast cancer treatment plans

BACKGROUND: Multivariate assays (MVAs) for assisting clinical decisions are becoming commonly available, but due to complexity, are often considered a high-risk approach. A key concern is that uncertainty on the assay's final results is not well understood. This study focuses on developing a pr...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ebbert, Mark TW, Bastien, Roy RL, Boucher, Kenneth M, Martín, Miguel, Carrasco, Eva, Caballero, Rosalía, Stijleman, Inge J, Bernard, Philip S, Facelli, Julio C
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2011
Materias:	Methodology
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3275466/ https://www.ncbi.nlm.nih.gov/pubmed/22196354 http://dx.doi.org/10.1186/2043-9113-1-37

_version_	1782223214957559808
author	Ebbert, Mark TW Bastien, Roy RL Boucher, Kenneth M Martín, Miguel Carrasco, Eva Caballero, Rosalía Stijleman, Inge J Bernard, Philip S Facelli, Julio C
author_facet	Ebbert, Mark TW Bastien, Roy RL Boucher, Kenneth M Martín, Miguel Carrasco, Eva Caballero, Rosalía Stijleman, Inge J Bernard, Philip S Facelli, Julio C
author_sort	Ebbert, Mark TW
collection	PubMed
description	BACKGROUND: Multivariate assays (MVAs) for assisting clinical decisions are becoming commonly available, but due to complexity, are often considered a high-risk approach. A key concern is that uncertainty on the assay's final results is not well understood. This study focuses on developing a process to characterize error introduced in the MVA's results from the intrinsic error in the laboratory process: sample preparation and measurement of the contributing factors, such as gene expression. METHODS: Using the PAM50 Breast Cancer Intrinsic Classifier, we show how to characterize error within an MVA, and how these errors may affect results reported to clinicians. First we estimated the error distribution for measured factors within the PAM50 assay by performing repeated measures on four archetypal samples representative of the major breast cancer tumor subtypes. Then, using the error distributions and the original archetypal sample data, we used Monte Carlo simulations to generate a sufficient number of simulated samples. The effect of these errors on the PAM50 tumor subtype classification was estimated by measuring subtype reproducibility after classifying all simulated samples. Subtype reproducibility was measured as the percentage of simulated samples classified identically to the parent sample. The simulation was thereafter repeated on a large, independent data set of samples from the GEICAM 9906 clinical trial. Simulated samples from the GEICAM sample set were used to explore a more realistic scenario where, unlike archetypal samples, many samples are not easily classified. RESULTS: All simulated samples derived from the archetypal samples were classified identically to the parent sample. Subtypes for simulated samples from the GEICAM set were also highly reproducible, but there were a non-negligible number of samples that exhibit significant variability in their classification. CONCLUSIONS: We have developed a general methodology to estimate the effects of intrinsic errors within MVAs. We have applied the method to the PAM50 assay, showing that the PAM50 results are resilient to intrinsic errors within the assay, but also finding that in non-archetypal samples, experimental errors can lead to quite different classification of a tumor. Finally we propose a way to provide the uncertainty information in a usable way for clinicians.
format	Online Article Text
id	pubmed-3275466
institution	National Center for Biotechnology Information
language	English
publishDate	2011
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-32754662012-02-13 Characterization of uncertainty in the classification of multivariate assays: application to PAM50 centroid-based genomic predictors for breast cancer treatment plans Ebbert, Mark TW Bastien, Roy RL Boucher, Kenneth M Martín, Miguel Carrasco, Eva Caballero, Rosalía Stijleman, Inge J Bernard, Philip S Facelli, Julio C J Clin Bioinforma Methodology BACKGROUND: Multivariate assays (MVAs) for assisting clinical decisions are becoming commonly available, but due to complexity, are often considered a high-risk approach. A key concern is that uncertainty on the assay's final results is not well understood. This study focuses on developing a process to characterize error introduced in the MVA's results from the intrinsic error in the laboratory process: sample preparation and measurement of the contributing factors, such as gene expression. METHODS: Using the PAM50 Breast Cancer Intrinsic Classifier, we show how to characterize error within an MVA, and how these errors may affect results reported to clinicians. First we estimated the error distribution for measured factors within the PAM50 assay by performing repeated measures on four archetypal samples representative of the major breast cancer tumor subtypes. Then, using the error distributions and the original archetypal sample data, we used Monte Carlo simulations to generate a sufficient number of simulated samples. The effect of these errors on the PAM50 tumor subtype classification was estimated by measuring subtype reproducibility after classifying all simulated samples. Subtype reproducibility was measured as the percentage of simulated samples classified identically to the parent sample. The simulation was thereafter repeated on a large, independent data set of samples from the GEICAM 9906 clinical trial. Simulated samples from the GEICAM sample set were used to explore a more realistic scenario where, unlike archetypal samples, many samples are not easily classified. RESULTS: All simulated samples derived from the archetypal samples were classified identically to the parent sample. Subtypes for simulated samples from the GEICAM set were also highly reproducible, but there were a non-negligible number of samples that exhibit significant variability in their classification. CONCLUSIONS: We have developed a general methodology to estimate the effects of intrinsic errors within MVAs. We have applied the method to the PAM50 assay, showing that the PAM50 results are resilient to intrinsic errors within the assay, but also finding that in non-archetypal samples, experimental errors can lead to quite different classification of a tumor. Finally we propose a way to provide the uncertainty information in a usable way for clinicians. BioMed Central 2011-12-23 /pmc/articles/PMC3275466/ /pubmed/22196354 http://dx.doi.org/10.1186/2043-9113-1-37 Text en Copyright ©2011 Ebbert et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Methodology Ebbert, Mark TW Bastien, Roy RL Boucher, Kenneth M Martín, Miguel Carrasco, Eva Caballero, Rosalía Stijleman, Inge J Bernard, Philip S Facelli, Julio C Characterization of uncertainty in the classification of multivariate assays: application to PAM50 centroid-based genomic predictors for breast cancer treatment plans
title	Characterization of uncertainty in the classification of multivariate assays: application to PAM50 centroid-based genomic predictors for breast cancer treatment plans
title_full	Characterization of uncertainty in the classification of multivariate assays: application to PAM50 centroid-based genomic predictors for breast cancer treatment plans
title_fullStr	Characterization of uncertainty in the classification of multivariate assays: application to PAM50 centroid-based genomic predictors for breast cancer treatment plans
title_full_unstemmed	Characterization of uncertainty in the classification of multivariate assays: application to PAM50 centroid-based genomic predictors for breast cancer treatment plans
title_short	Characterization of uncertainty in the classification of multivariate assays: application to PAM50 centroid-based genomic predictors for breast cancer treatment plans
title_sort	characterization of uncertainty in the classification of multivariate assays: application to pam50 centroid-based genomic predictors for breast cancer treatment plans
topic	Methodology
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3275466/ https://www.ncbi.nlm.nih.gov/pubmed/22196354 http://dx.doi.org/10.1186/2043-9113-1-37
work_keys_str_mv	AT ebbertmarktw characterizationofuncertaintyintheclassificationofmultivariateassaysapplicationtopam50centroidbasedgenomicpredictorsforbreastcancertreatmentplans AT bastienroyrl characterizationofuncertaintyintheclassificationofmultivariateassaysapplicationtopam50centroidbasedgenomicpredictorsforbreastcancertreatmentplans AT boucherkennethm characterizationofuncertaintyintheclassificationofmultivariateassaysapplicationtopam50centroidbasedgenomicpredictorsforbreastcancertreatmentplans AT martinmiguel characterizationofuncertaintyintheclassificationofmultivariateassaysapplicationtopam50centroidbasedgenomicpredictorsforbreastcancertreatmentplans AT carrascoeva characterizationofuncertaintyintheclassificationofmultivariateassaysapplicationtopam50centroidbasedgenomicpredictorsforbreastcancertreatmentplans AT caballerorosalia characterizationofuncertaintyintheclassificationofmultivariateassaysapplicationtopam50centroidbasedgenomicpredictorsforbreastcancertreatmentplans AT stijlemaningej characterizationofuncertaintyintheclassificationofmultivariateassaysapplicationtopam50centroidbasedgenomicpredictorsforbreastcancertreatmentplans AT bernardphilips characterizationofuncertaintyintheclassificationofmultivariateassaysapplicationtopam50centroidbasedgenomicpredictorsforbreastcancertreatmentplans AT facellijulioc characterizationofuncertaintyintheclassificationofmultivariateassaysapplicationtopam50centroidbasedgenomicpredictorsforbreastcancertreatmentplans

Characterization of uncertainty in the classification of multivariate assays: application to PAM50 centroid-based genomic predictors for breast cancer treatment plans

Ejemplares similares