Cargando…

Automated Classification of Circulating Tumor Cells and the Impact of Interobsever Variability on Classifier Training and Performance

Application of personalized medicine requires integration of different data to determine each patient's unique clinical constitution. The automated analysis of medical data is a growing field where different machine learning techniques are used to minimize the time-consuming task of manual anal...

Descripción completa

Detalles Bibliográficos
Autores principales:	Svensson, Carl-Magnus, Hübler, Ron, Figge, Marc Thilo
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Hindawi Publishing Corporation 2015
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4609523/ https://www.ncbi.nlm.nih.gov/pubmed/26504857 http://dx.doi.org/10.1155/2015/573165

_version_	1782395830894854144
author	Svensson, Carl-Magnus Hübler, Ron Figge, Marc Thilo
author_facet	Svensson, Carl-Magnus Hübler, Ron Figge, Marc Thilo
author_sort	Svensson, Carl-Magnus
collection	PubMed
description	Application of personalized medicine requires integration of different data to determine each patient's unique clinical constitution. The automated analysis of medical data is a growing field where different machine learning techniques are used to minimize the time-consuming task of manual analysis. The evaluation, and often training, of automated classifiers requires manually labelled data as ground truth. In many cases such labelling is not perfect, either because of the data being ambiguous even for a trained expert or because of mistakes. Here we investigated the interobserver variability of image data comprising fluorescently stained circulating tumor cells and its effect on the performance of two automated classifiers, a random forest and a support vector machine. We found that uncertainty in annotation between observers limited the performance of the automated classifiers, especially when it was included in the test set on which classifier performance was measured. The random forest classifier turned out to be resilient to uncertainty in the training data while the support vector machine's performance is highly dependent on the amount of uncertainty in the training data. We finally introduced the consensus data set as a possible solution for evaluation of automated classifiers that minimizes the penalty of interobserver variability.
format	Online Article Text
id	pubmed-4609523
institution	National Center for Biotechnology Information
language	English
publishDate	2015
publisher	Hindawi Publishing Corporation
record_format	MEDLINE/PubMed
spelling	pubmed-46095232015-10-26 Automated Classification of Circulating Tumor Cells and the Impact of Interobsever Variability on Classifier Training and Performance Svensson, Carl-Magnus Hübler, Ron Figge, Marc Thilo J Immunol Res Research Article Application of personalized medicine requires integration of different data to determine each patient's unique clinical constitution. The automated analysis of medical data is a growing field where different machine learning techniques are used to minimize the time-consuming task of manual analysis. The evaluation, and often training, of automated classifiers requires manually labelled data as ground truth. In many cases such labelling is not perfect, either because of the data being ambiguous even for a trained expert or because of mistakes. Here we investigated the interobserver variability of image data comprising fluorescently stained circulating tumor cells and its effect on the performance of two automated classifiers, a random forest and a support vector machine. We found that uncertainty in annotation between observers limited the performance of the automated classifiers, especially when it was included in the test set on which classifier performance was measured. The random forest classifier turned out to be resilient to uncertainty in the training data while the support vector machine's performance is highly dependent on the amount of uncertainty in the training data. We finally introduced the consensus data set as a possible solution for evaluation of automated classifiers that minimizes the penalty of interobserver variability. Hindawi Publishing Corporation 2015 2015-10-04 /pmc/articles/PMC4609523/ /pubmed/26504857 http://dx.doi.org/10.1155/2015/573165 Text en Copyright © 2015 Carl-Magnus Svensson et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Svensson, Carl-Magnus Hübler, Ron Figge, Marc Thilo Automated Classification of Circulating Tumor Cells and the Impact of Interobsever Variability on Classifier Training and Performance
title	Automated Classification of Circulating Tumor Cells and the Impact of Interobsever Variability on Classifier Training and Performance
title_full	Automated Classification of Circulating Tumor Cells and the Impact of Interobsever Variability on Classifier Training and Performance
title_fullStr	Automated Classification of Circulating Tumor Cells and the Impact of Interobsever Variability on Classifier Training and Performance
title_full_unstemmed	Automated Classification of Circulating Tumor Cells and the Impact of Interobsever Variability on Classifier Training and Performance
title_short	Automated Classification of Circulating Tumor Cells and the Impact of Interobsever Variability on Classifier Training and Performance
title_sort	automated classification of circulating tumor cells and the impact of interobsever variability on classifier training and performance
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4609523/ https://www.ncbi.nlm.nih.gov/pubmed/26504857 http://dx.doi.org/10.1155/2015/573165
work_keys_str_mv	AT svenssoncarlmagnus automatedclassificationofcirculatingtumorcellsandtheimpactofinterobsevervariabilityonclassifiertrainingandperformance AT hublerron automatedclassificationofcirculatingtumorcellsandtheimpactofinterobsevervariabilityonclassifiertrainingandperformance AT figgemarcthilo automatedclassificationofcirculatingtumorcellsandtheimpactofinterobsevervariabilityonclassifiertrainingandperformance

Automated Classification of Circulating Tumor Cells and the Impact of Interobsever Variability on Classifier Training and Performance

Ejemplares similares