Cargando…

Making External Validation Valid for Molecular Classifier Development

PURPOSE: Accurate assessment of a molecular classifier that guides patient care is of paramount importance in precision oncology. Recent years have seen an increasing use of external validation for such assessment. However, little is known about how it is affected by ubiquitous unwanted variations i...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Yilin, Huang, Huei-Chung, Qin, Li-Xuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Wolters Kluwer Health 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8345919/
https://www.ncbi.nlm.nih.gov/pubmed/34377885
http://dx.doi.org/10.1200/PO.21.00103
_version_ 1783734744094081024
author Wu, Yilin
Huang, Huei-Chung
Qin, Li-Xuan
author_facet Wu, Yilin
Huang, Huei-Chung
Qin, Li-Xuan
author_sort Wu, Yilin
collection PubMed
description PURPOSE: Accurate assessment of a molecular classifier that guides patient care is of paramount importance in precision oncology. Recent years have seen an increasing use of external validation for such assessment. However, little is known about how it is affected by ubiquitous unwanted variations in test data because of disparate experimental handling and by the use of data normalization for alleviating such variations. METHODS: In this paper, we studied these issues using two microarray data sets for the same set of tumor samples and additional data simulated by resampling under various levels of signal-to-noise ratio and different designs for array-to-sample allocation. RESULTS: We showed that (1) unwanted variations can lead to biased classifier assessment and (2) data normalization mitigates the bias to varying extents depending on the specific method used. In particular, frozen normalization methods for test data outperform their conventional forms in terms of both reducing the bias in accuracy estimation and increasing robustness to handling effects. We make available our benchmarking tool as an R package on GitHub for performing such evaluation on additional methods for normalization and classification. CONCLUSION: Our findings thus highlight the importance of proper test-data normalization for valid assessment by external validation and call for caution on the choice of normalization method for molecular classifier development.
format Online
Article
Text
id pubmed-8345919
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Wolters Kluwer Health
record_format MEDLINE/PubMed
spelling pubmed-83459192021-08-09 Making External Validation Valid for Molecular Classifier Development Wu, Yilin Huang, Huei-Chung Qin, Li-Xuan JCO Precis Oncol ORIGINAL REPORTS PURPOSE: Accurate assessment of a molecular classifier that guides patient care is of paramount importance in precision oncology. Recent years have seen an increasing use of external validation for such assessment. However, little is known about how it is affected by ubiquitous unwanted variations in test data because of disparate experimental handling and by the use of data normalization for alleviating such variations. METHODS: In this paper, we studied these issues using two microarray data sets for the same set of tumor samples and additional data simulated by resampling under various levels of signal-to-noise ratio and different designs for array-to-sample allocation. RESULTS: We showed that (1) unwanted variations can lead to biased classifier assessment and (2) data normalization mitigates the bias to varying extents depending on the specific method used. In particular, frozen normalization methods for test data outperform their conventional forms in terms of both reducing the bias in accuracy estimation and increasing robustness to handling effects. We make available our benchmarking tool as an R package on GitHub for performing such evaluation on additional methods for normalization and classification. CONCLUSION: Our findings thus highlight the importance of proper test-data normalization for valid assessment by external validation and call for caution on the choice of normalization method for molecular classifier development. Wolters Kluwer Health 2021-08-05 /pmc/articles/PMC8345919/ /pubmed/34377885 http://dx.doi.org/10.1200/PO.21.00103 Text en © 2021 by American Society of Clinical Oncology https://creativecommons.org/licenses/by/4.0/Licensed under the Creative Commons Attribution 4.0 License: https://creativecommons.org/licenses/by/4.0/
spellingShingle ORIGINAL REPORTS
Wu, Yilin
Huang, Huei-Chung
Qin, Li-Xuan
Making External Validation Valid for Molecular Classifier Development
title Making External Validation Valid for Molecular Classifier Development
title_full Making External Validation Valid for Molecular Classifier Development
title_fullStr Making External Validation Valid for Molecular Classifier Development
title_full_unstemmed Making External Validation Valid for Molecular Classifier Development
title_short Making External Validation Valid for Molecular Classifier Development
title_sort making external validation valid for molecular classifier development
topic ORIGINAL REPORTS
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8345919/
https://www.ncbi.nlm.nih.gov/pubmed/34377885
http://dx.doi.org/10.1200/PO.21.00103
work_keys_str_mv AT wuyilin makingexternalvalidationvalidformolecularclassifierdevelopment
AT huanghueichung makingexternalvalidationvalidformolecularclassifierdevelopment
AT qinlixuan makingexternalvalidationvalidformolecularclassifierdevelopment