Cargando…

Performance evaluation of deep neural ensembles toward malaria parasite detection in thin-blood smear images

BACKGROUND: Malaria is a life-threatening disease caused by Plasmodium parasites that infect the red blood cells (RBCs). Manual identification and counting of parasitized cells in microscopic thick/thin-film blood examination remains the common, but burdensome method for disease diagnosis. Its diagn...

Descripción completa

Detalles Bibliográficos
Autores principales: Rajaraman, Sivaramakrishnan, Jaeger, Stefan, Antani, Sameer K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6544011/
https://www.ncbi.nlm.nih.gov/pubmed/31179181
http://dx.doi.org/10.7717/peerj.6977
_version_ 1783423179238146048
author Rajaraman, Sivaramakrishnan
Jaeger, Stefan
Antani, Sameer K.
author_facet Rajaraman, Sivaramakrishnan
Jaeger, Stefan
Antani, Sameer K.
author_sort Rajaraman, Sivaramakrishnan
collection PubMed
description BACKGROUND: Malaria is a life-threatening disease caused by Plasmodium parasites that infect the red blood cells (RBCs). Manual identification and counting of parasitized cells in microscopic thick/thin-film blood examination remains the common, but burdensome method for disease diagnosis. Its diagnostic accuracy is adversely impacted by inter/intra-observer variability, particularly in large-scale screening under resource-constrained settings. INTRODUCTION: State-of-the-art computer-aided diagnostic tools based on data-driven deep learning algorithms like convolutional neural network (CNN) has become the architecture of choice for image recognition tasks. However, CNNs suffer from high variance and may overfit due to their sensitivity to training data fluctuations. OBJECTIVE: The primary aim of this study is to reduce model variance, improve robustness and generalization through constructing model ensembles toward detecting parasitized cells in thin-blood smear images. METHODS: We evaluate the performance of custom and pretrained CNNs and construct an optimal model ensemble toward the challenge of classifying parasitized and normal cells in thin-blood smear images. Cross-validation studies are performed at the patient level to ensure preventing data leakage into the validation and reduce generalization errors. The models are evaluated in terms of the following performance metrics: (a) Accuracy; (b) Area under the receiver operating characteristic (ROC) curve (AUC); (c) Mean squared error (MSE); (d) Precision; (e) F-score; and (f) Matthews Correlation Coefficient (MCC). RESULTS: It is observed that the ensemble model constructed with VGG-19 and SqueezeNet outperformed the state-of-the-art in several performance metrics toward classifying the parasitized and uninfected cells to aid in improved disease screening. CONCLUSIONS: Ensemble learning reduces the model variance by optimally combining the predictions of multiple models and decreases the sensitivity to the specifics of training data and selection of training algorithms. The performance of the model ensemble simulates real-world conditions with reduced variance, overfitting and leads to improved generalization.
format Online
Article
Text
id pubmed-6544011
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-65440112019-06-09 Performance evaluation of deep neural ensembles toward malaria parasite detection in thin-blood smear images Rajaraman, Sivaramakrishnan Jaeger, Stefan Antani, Sameer K. PeerJ Hematology BACKGROUND: Malaria is a life-threatening disease caused by Plasmodium parasites that infect the red blood cells (RBCs). Manual identification and counting of parasitized cells in microscopic thick/thin-film blood examination remains the common, but burdensome method for disease diagnosis. Its diagnostic accuracy is adversely impacted by inter/intra-observer variability, particularly in large-scale screening under resource-constrained settings. INTRODUCTION: State-of-the-art computer-aided diagnostic tools based on data-driven deep learning algorithms like convolutional neural network (CNN) has become the architecture of choice for image recognition tasks. However, CNNs suffer from high variance and may overfit due to their sensitivity to training data fluctuations. OBJECTIVE: The primary aim of this study is to reduce model variance, improve robustness and generalization through constructing model ensembles toward detecting parasitized cells in thin-blood smear images. METHODS: We evaluate the performance of custom and pretrained CNNs and construct an optimal model ensemble toward the challenge of classifying parasitized and normal cells in thin-blood smear images. Cross-validation studies are performed at the patient level to ensure preventing data leakage into the validation and reduce generalization errors. The models are evaluated in terms of the following performance metrics: (a) Accuracy; (b) Area under the receiver operating characteristic (ROC) curve (AUC); (c) Mean squared error (MSE); (d) Precision; (e) F-score; and (f) Matthews Correlation Coefficient (MCC). RESULTS: It is observed that the ensemble model constructed with VGG-19 and SqueezeNet outperformed the state-of-the-art in several performance metrics toward classifying the parasitized and uninfected cells to aid in improved disease screening. CONCLUSIONS: Ensemble learning reduces the model variance by optimally combining the predictions of multiple models and decreases the sensitivity to the specifics of training data and selection of training algorithms. The performance of the model ensemble simulates real-world conditions with reduced variance, overfitting and leads to improved generalization. PeerJ Inc. 2019-05-28 /pmc/articles/PMC6544011/ /pubmed/31179181 http://dx.doi.org/10.7717/peerj.6977 Text en http://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, made available under the Creative Commons Public Domain Dedication (http://creativecommons.org/publicdomain/zero/1.0/) . This work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose.
spellingShingle Hematology
Rajaraman, Sivaramakrishnan
Jaeger, Stefan
Antani, Sameer K.
Performance evaluation of deep neural ensembles toward malaria parasite detection in thin-blood smear images
title Performance evaluation of deep neural ensembles toward malaria parasite detection in thin-blood smear images
title_full Performance evaluation of deep neural ensembles toward malaria parasite detection in thin-blood smear images
title_fullStr Performance evaluation of deep neural ensembles toward malaria parasite detection in thin-blood smear images
title_full_unstemmed Performance evaluation of deep neural ensembles toward malaria parasite detection in thin-blood smear images
title_short Performance evaluation of deep neural ensembles toward malaria parasite detection in thin-blood smear images
title_sort performance evaluation of deep neural ensembles toward malaria parasite detection in thin-blood smear images
topic Hematology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6544011/
https://www.ncbi.nlm.nih.gov/pubmed/31179181
http://dx.doi.org/10.7717/peerj.6977
work_keys_str_mv AT rajaramansivaramakrishnan performanceevaluationofdeepneuralensemblestowardmalariaparasitedetectioninthinbloodsmearimages
AT jaegerstefan performanceevaluationofdeepneuralensemblestowardmalariaparasitedetectioninthinbloodsmearimages
AT antanisameerk performanceevaluationofdeepneuralensemblestowardmalariaparasitedetectioninthinbloodsmearimages