Cargando…

Selection bias in the reported performances of AD classification pipelines

The last decade has seen a great proliferation of supervised learning pipelines for individual diagnosis and prognosis in Alzheimer's disease. As more pipelines are developed and evaluated in the search for greater performance, only those results that are relatively impressive will be selected...

Descripción completa

Detalles Bibliográficos
Autores principales: Mendelson, Alex F., Zuluaga, Maria A., Lorenzi, Marco, Hutton, Brian F., Ourselin, Sébastien
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5322215/
https://www.ncbi.nlm.nih.gov/pubmed/28271040
http://dx.doi.org/10.1016/j.nicl.2016.12.018
_version_ 1782509808259170304
author Mendelson, Alex F.
Zuluaga, Maria A.
Lorenzi, Marco
Hutton, Brian F.
Ourselin, Sébastien
author_facet Mendelson, Alex F.
Zuluaga, Maria A.
Lorenzi, Marco
Hutton, Brian F.
Ourselin, Sébastien
author_sort Mendelson, Alex F.
collection PubMed
description The last decade has seen a great proliferation of supervised learning pipelines for individual diagnosis and prognosis in Alzheimer's disease. As more pipelines are developed and evaluated in the search for greater performance, only those results that are relatively impressive will be selected for publication. We present an empirical study to evaluate the potential for optimistic bias in classification performance results as a result of this selection. This is achieved using a novel, resampling-based experiment design that effectively simulates the optimisation of pipeline specifications by individuals or collectives of researchers using cross validation with limited data. Our findings indicate that bias can plausibly account for an appreciable fraction (often greater than half) of the apparent performance improvement associated with the pipeline optimisation, particularly in small samples. We discuss the consistency of our findings with patterns observed in the literature and consider strategies for bias reduction and mitigation.
format Online
Article
Text
id pubmed-5322215
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-53222152017-03-07 Selection bias in the reported performances of AD classification pipelines Mendelson, Alex F. Zuluaga, Maria A. Lorenzi, Marco Hutton, Brian F. Ourselin, Sébastien Neuroimage Clin Regular Article The last decade has seen a great proliferation of supervised learning pipelines for individual diagnosis and prognosis in Alzheimer's disease. As more pipelines are developed and evaluated in the search for greater performance, only those results that are relatively impressive will be selected for publication. We present an empirical study to evaluate the potential for optimistic bias in classification performance results as a result of this selection. This is achieved using a novel, resampling-based experiment design that effectively simulates the optimisation of pipeline specifications by individuals or collectives of researchers using cross validation with limited data. Our findings indicate that bias can plausibly account for an appreciable fraction (often greater than half) of the apparent performance improvement associated with the pipeline optimisation, particularly in small samples. We discuss the consistency of our findings with patterns observed in the literature and consider strategies for bias reduction and mitigation. Elsevier 2016-12-24 /pmc/articles/PMC5322215/ /pubmed/28271040 http://dx.doi.org/10.1016/j.nicl.2016.12.018 Text en © 2016 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Regular Article
Mendelson, Alex F.
Zuluaga, Maria A.
Lorenzi, Marco
Hutton, Brian F.
Ourselin, Sébastien
Selection bias in the reported performances of AD classification pipelines
title Selection bias in the reported performances of AD classification pipelines
title_full Selection bias in the reported performances of AD classification pipelines
title_fullStr Selection bias in the reported performances of AD classification pipelines
title_full_unstemmed Selection bias in the reported performances of AD classification pipelines
title_short Selection bias in the reported performances of AD classification pipelines
title_sort selection bias in the reported performances of ad classification pipelines
topic Regular Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5322215/
https://www.ncbi.nlm.nih.gov/pubmed/28271040
http://dx.doi.org/10.1016/j.nicl.2016.12.018
work_keys_str_mv AT mendelsonalexf selectionbiasinthereportedperformancesofadclassificationpipelines
AT zuluagamariaa selectionbiasinthereportedperformancesofadclassificationpipelines
AT lorenzimarco selectionbiasinthereportedperformancesofadclassificationpipelines
AT huttonbrianf selectionbiasinthereportedperformancesofadclassificationpipelines
AT ourselinsebastien selectionbiasinthereportedperformancesofadclassificationpipelines
AT selectionbiasinthereportedperformancesofadclassificationpipelines