Cargando…

Naive Bayes classifiers for verbal autopsies: comparison to physician-based classification for 21,000 child and adult deaths

BACKGROUND: Verbal autopsies (VA) are increasingly used in low- and middle-income countries where most causes of death (COD) occur at home without medical attention, and home deaths differ substantially from hospital deaths. Hence, there is no plausible “standard” against which VAs for home deaths m...

Descripción completa

Detalles Bibliográficos
Autores principales: Miasnikof, Pierre, Giannakeas, Vasily, Gomes, Mireille, Aleksandrowicz, Lukasz, Shestopaloff, Alexander Y., Alam, Dewan, Tollman, Stephen, Samarikhalaj, Akram, Jha, Prabhat
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4660822/
https://www.ncbi.nlm.nih.gov/pubmed/26607695
http://dx.doi.org/10.1186/s12916-015-0521-2
_version_ 1782402881556578304
author Miasnikof, Pierre
Giannakeas, Vasily
Gomes, Mireille
Aleksandrowicz, Lukasz
Shestopaloff, Alexander Y.
Alam, Dewan
Tollman, Stephen
Samarikhalaj, Akram
Jha, Prabhat
author_facet Miasnikof, Pierre
Giannakeas, Vasily
Gomes, Mireille
Aleksandrowicz, Lukasz
Shestopaloff, Alexander Y.
Alam, Dewan
Tollman, Stephen
Samarikhalaj, Akram
Jha, Prabhat
author_sort Miasnikof, Pierre
collection PubMed
description BACKGROUND: Verbal autopsies (VA) are increasingly used in low- and middle-income countries where most causes of death (COD) occur at home without medical attention, and home deaths differ substantially from hospital deaths. Hence, there is no plausible “standard” against which VAs for home deaths may be validated. Previous studies have shown contradictory performance of automated methods compared to physician-based classification of CODs. We sought to compare the performance of the classic naive Bayes classifier (NBC) versus existing automated classifiers, using physician-based classification as the reference. METHODS: We compared the performance of NBC, an open-source Tariff Method (OTM), and InterVA-4 on three datasets covering about 21,000 child and adult deaths: the ongoing Million Death Study in India, and health and demographic surveillance sites in Agincourt, South Africa and Matlab, Bangladesh. We applied several training and testing splits of the data to quantify the sensitivity and specificity compared to physician coding for individual CODs and to test the cause-specific mortality fractions at the population level. RESULTS: The NBC achieved comparable sensitivity (median 0.51, range 0.48-0.58) to OTM (median 0.50, range 0.41-0.51), with InterVA-4 having lower sensitivity (median 0.43, range 0.36-0.47) in all three datasets, across all CODs. Consistency of CODs was comparable for NBC and InterVA-4 but lower for OTM. NBC and OTM achieved better performance when using a local rather than a non-local training dataset. At the population level, NBC scored the highest cause-specific mortality fraction accuracy across the datasets (median 0.88, range 0.87-0.93), followed by InterVA-4 (median 0.66, range 0.62-0.73) and OTM (median 0.57, range 0.42-0.58). CONCLUSIONS: NBC outperforms current similar COD classifiers at the population level. Nevertheless, no current automated classifier adequately replicates physician classification for individual CODs. There is a need for further research on automated classifiers using local training and test data in diverse settings prior to recommending any replacement of physician-based classification of verbal autopsies. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12916-015-0521-2) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4660822
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46608222015-11-27 Naive Bayes classifiers for verbal autopsies: comparison to physician-based classification for 21,000 child and adult deaths Miasnikof, Pierre Giannakeas, Vasily Gomes, Mireille Aleksandrowicz, Lukasz Shestopaloff, Alexander Y. Alam, Dewan Tollman, Stephen Samarikhalaj, Akram Jha, Prabhat BMC Med Research Article BACKGROUND: Verbal autopsies (VA) are increasingly used in low- and middle-income countries where most causes of death (COD) occur at home without medical attention, and home deaths differ substantially from hospital deaths. Hence, there is no plausible “standard” against which VAs for home deaths may be validated. Previous studies have shown contradictory performance of automated methods compared to physician-based classification of CODs. We sought to compare the performance of the classic naive Bayes classifier (NBC) versus existing automated classifiers, using physician-based classification as the reference. METHODS: We compared the performance of NBC, an open-source Tariff Method (OTM), and InterVA-4 on three datasets covering about 21,000 child and adult deaths: the ongoing Million Death Study in India, and health and demographic surveillance sites in Agincourt, South Africa and Matlab, Bangladesh. We applied several training and testing splits of the data to quantify the sensitivity and specificity compared to physician coding for individual CODs and to test the cause-specific mortality fractions at the population level. RESULTS: The NBC achieved comparable sensitivity (median 0.51, range 0.48-0.58) to OTM (median 0.50, range 0.41-0.51), with InterVA-4 having lower sensitivity (median 0.43, range 0.36-0.47) in all three datasets, across all CODs. Consistency of CODs was comparable for NBC and InterVA-4 but lower for OTM. NBC and OTM achieved better performance when using a local rather than a non-local training dataset. At the population level, NBC scored the highest cause-specific mortality fraction accuracy across the datasets (median 0.88, range 0.87-0.93), followed by InterVA-4 (median 0.66, range 0.62-0.73) and OTM (median 0.57, range 0.42-0.58). CONCLUSIONS: NBC outperforms current similar COD classifiers at the population level. Nevertheless, no current automated classifier adequately replicates physician classification for individual CODs. There is a need for further research on automated classifiers using local training and test data in diverse settings prior to recommending any replacement of physician-based classification of verbal autopsies. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12916-015-0521-2) contains supplementary material, which is available to authorized users. BioMed Central 2015-11-25 /pmc/articles/PMC4660822/ /pubmed/26607695 http://dx.doi.org/10.1186/s12916-015-0521-2 Text en © Miasnikof et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Miasnikof, Pierre
Giannakeas, Vasily
Gomes, Mireille
Aleksandrowicz, Lukasz
Shestopaloff, Alexander Y.
Alam, Dewan
Tollman, Stephen
Samarikhalaj, Akram
Jha, Prabhat
Naive Bayes classifiers for verbal autopsies: comparison to physician-based classification for 21,000 child and adult deaths
title Naive Bayes classifiers for verbal autopsies: comparison to physician-based classification for 21,000 child and adult deaths
title_full Naive Bayes classifiers for verbal autopsies: comparison to physician-based classification for 21,000 child and adult deaths
title_fullStr Naive Bayes classifiers for verbal autopsies: comparison to physician-based classification for 21,000 child and adult deaths
title_full_unstemmed Naive Bayes classifiers for verbal autopsies: comparison to physician-based classification for 21,000 child and adult deaths
title_short Naive Bayes classifiers for verbal autopsies: comparison to physician-based classification for 21,000 child and adult deaths
title_sort naive bayes classifiers for verbal autopsies: comparison to physician-based classification for 21,000 child and adult deaths
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4660822/
https://www.ncbi.nlm.nih.gov/pubmed/26607695
http://dx.doi.org/10.1186/s12916-015-0521-2
work_keys_str_mv AT miasnikofpierre naivebayesclassifiersforverbalautopsiescomparisontophysicianbasedclassificationfor21000childandadultdeaths
AT giannakeasvasily naivebayesclassifiersforverbalautopsiescomparisontophysicianbasedclassificationfor21000childandadultdeaths
AT gomesmireille naivebayesclassifiersforverbalautopsiescomparisontophysicianbasedclassificationfor21000childandadultdeaths
AT aleksandrowiczlukasz naivebayesclassifiersforverbalautopsiescomparisontophysicianbasedclassificationfor21000childandadultdeaths
AT shestopaloffalexandery naivebayesclassifiersforverbalautopsiescomparisontophysicianbasedclassificationfor21000childandadultdeaths
AT alamdewan naivebayesclassifiersforverbalautopsiescomparisontophysicianbasedclassificationfor21000childandadultdeaths
AT tollmanstephen naivebayesclassifiersforverbalautopsiescomparisontophysicianbasedclassificationfor21000childandadultdeaths
AT samarikhalajakram naivebayesclassifiersforverbalautopsiescomparisontophysicianbasedclassificationfor21000childandadultdeaths
AT jhaprabhat naivebayesclassifiersforverbalautopsiescomparisontophysicianbasedclassificationfor21000childandadultdeaths