Cargando…

A cost-effectiveness analysis of the number of samples to collect and test from a sexual assault

Although the backlog of untested sexual assault kits in the United States is starting to be addressed, many municipalities are opting for selective testing of samples within a kit, where only the most probative samples are tested. We use data from the San Francisco Police Department Criminalistics L...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Zhengli, MacMillan, Kevin, Powell, Mark, Wein, Lawrence M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7306798/
https://www.ncbi.nlm.nih.gov/pubmed/32482858
http://dx.doi.org/10.1073/pnas.2001103117
_version_ 1783548722760646656
author Wang, Zhengli
MacMillan, Kevin
Powell, Mark
Wein, Lawrence M.
author_facet Wang, Zhengli
MacMillan, Kevin
Powell, Mark
Wein, Lawrence M.
author_sort Wang, Zhengli
collection PubMed
description Although the backlog of untested sexual assault kits in the United States is starting to be addressed, many municipalities are opting for selective testing of samples within a kit, where only the most probative samples are tested. We use data from the San Francisco Police Department Criminalistics Laboratory, which tests all samples but also collects information on the samples flagged by sexual assault forensic examiners as most probative, to build a standard machine learning model that predicts (based on covariates gleaned from sexual assault kit questionnaires) which samples are most probative. This model is embedded within an optimization framework that selects which samples to test from each kit to maximize the Combined DNA Index System (CODIS) yield (i.e., the number of kits that generate at least one DNA profile for the criminal DNA database) subject to a budget constraint. Our analysis predicts that, relative to a policy that tests only the samples deemed probative by the sexual assault forensic examiners, the proposed policy increases the CODIS yield by 45.4% without increasing the cost. Full testing of all samples has a slightly lower cost-effectiveness than the selective policy based on forensic examiners, but more than doubles the yield. In over half of the sexual assaults, a sample was not collected during the forensic medical exam from the body location deemed most probative by the machine learning model. Our results suggest that electronic forensic records coupled with machine learning and optimization models could enhance the effectiveness of criminal investigations of sexual assaults.
format Online
Article
Text
id pubmed-7306798
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-73067982020-06-25 A cost-effectiveness analysis of the number of samples to collect and test from a sexual assault Wang, Zhengli MacMillan, Kevin Powell, Mark Wein, Lawrence M. Proc Natl Acad Sci U S A Social Sciences Although the backlog of untested sexual assault kits in the United States is starting to be addressed, many municipalities are opting for selective testing of samples within a kit, where only the most probative samples are tested. We use data from the San Francisco Police Department Criminalistics Laboratory, which tests all samples but also collects information on the samples flagged by sexual assault forensic examiners as most probative, to build a standard machine learning model that predicts (based on covariates gleaned from sexual assault kit questionnaires) which samples are most probative. This model is embedded within an optimization framework that selects which samples to test from each kit to maximize the Combined DNA Index System (CODIS) yield (i.e., the number of kits that generate at least one DNA profile for the criminal DNA database) subject to a budget constraint. Our analysis predicts that, relative to a policy that tests only the samples deemed probative by the sexual assault forensic examiners, the proposed policy increases the CODIS yield by 45.4% without increasing the cost. Full testing of all samples has a slightly lower cost-effectiveness than the selective policy based on forensic examiners, but more than doubles the yield. In over half of the sexual assaults, a sample was not collected during the forensic medical exam from the body location deemed most probative by the machine learning model. Our results suggest that electronic forensic records coupled with machine learning and optimization models could enhance the effectiveness of criminal investigations of sexual assaults. National Academy of Sciences 2020-06-16 2020-06-01 /pmc/articles/PMC7306798/ /pubmed/32482858 http://dx.doi.org/10.1073/pnas.2001103117 Text en Copyright © 2020 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Social Sciences
Wang, Zhengli
MacMillan, Kevin
Powell, Mark
Wein, Lawrence M.
A cost-effectiveness analysis of the number of samples to collect and test from a sexual assault
title A cost-effectiveness analysis of the number of samples to collect and test from a sexual assault
title_full A cost-effectiveness analysis of the number of samples to collect and test from a sexual assault
title_fullStr A cost-effectiveness analysis of the number of samples to collect and test from a sexual assault
title_full_unstemmed A cost-effectiveness analysis of the number of samples to collect and test from a sexual assault
title_short A cost-effectiveness analysis of the number of samples to collect and test from a sexual assault
title_sort cost-effectiveness analysis of the number of samples to collect and test from a sexual assault
topic Social Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7306798/
https://www.ncbi.nlm.nih.gov/pubmed/32482858
http://dx.doi.org/10.1073/pnas.2001103117
work_keys_str_mv AT wangzhengli acosteffectivenessanalysisofthenumberofsamplestocollectandtestfromasexualassault
AT macmillankevin acosteffectivenessanalysisofthenumberofsamplestocollectandtestfromasexualassault
AT powellmark acosteffectivenessanalysisofthenumberofsamplestocollectandtestfromasexualassault
AT weinlawrencem acosteffectivenessanalysisofthenumberofsamplestocollectandtestfromasexualassault
AT wangzhengli costeffectivenessanalysisofthenumberofsamplestocollectandtestfromasexualassault
AT macmillankevin costeffectivenessanalysisofthenumberofsamplestocollectandtestfromasexualassault
AT powellmark costeffectivenessanalysisofthenumberofsamplestocollectandtestfromasexualassault
AT weinlawrencem costeffectivenessanalysisofthenumberofsamplestocollectandtestfromasexualassault