Cargando…

Inclusion probability for DNA mixtures is a subjective one-sided match statistic unrelated to identification information

BACKGROUND: DNA mixtures of two or more people are a common type of forensic crime scene evidence. A match statistic that connects the evidence to a criminal defendant is usually needed for court. Jurors rely on this strength of match to help decide guilt or innocence. However, the reliability of un...

Descripción completa

Detalles Bibliográficos
Autor principal: Perlin, Mark William
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Medknow Publications & Media Pvt Ltd 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4639950/
https://www.ncbi.nlm.nih.gov/pubmed/26605124
http://dx.doi.org/10.4103/2153-3539.168525
_version_ 1782400010385620992
author Perlin, Mark William
author_facet Perlin, Mark William
author_sort Perlin, Mark William
collection PubMed
description BACKGROUND: DNA mixtures of two or more people are a common type of forensic crime scene evidence. A match statistic that connects the evidence to a criminal defendant is usually needed for court. Jurors rely on this strength of match to help decide guilt or innocence. However, the reliability of unsophisticated match statistics for DNA mixtures has been questioned. MATERIALS AND METHODS: The most prevalent match statistic for DNA mixtures is the combined probability of inclusion (CPI), used by crime labs for over 15 years. When testing 13 short tandem repeat (STR) genetic loci, the CPI(-1) value is typically around a million, regardless of DNA mixture composition. However, actual identification information, as measured by a likelihood ratio (LR), spans a much broader range. This study examined probability of inclusion (PI) mixture statistics for 517 locus experiments drawn from 16 reported cases and compared them with LR locus information calculated independently on the same data. The log(PI(-1)) values were examined and compared with corresponding log(LR) values. RESULTS: The LR and CPI methods were compared in case examples of false inclusion, false exclusion, a homicide, and criminal justice outcomes. Statistical analysis of crime laboratory STR data shows that inclusion match statistics exhibit a truncated normal distribution having zero center, with little correlation to actual identification information. By the law of large numbers (LLN), CPI(-1) increases with the number of tested genetic loci, regardless of DNA mixture composition or match information. These statistical findings explain why CPI is relatively constant, with implications for DNA policy, criminal justice, cost of crime, and crime prevention. CONCLUSIONS: Forensic crime laboratories have generated CPI statistics on hundreds of thousands of DNA mixture evidence items. However, this commonly used match statistic behaves like a random generator of inclusionary values, following the LLN rather than measuring identification information. A quantitative CPI number adds little meaningful information beyond the analyst's initial qualitative assessment that a person's DNA is included in a mixture. Statistical methods for reporting on DNA mixture evidence should be scientifically validated before they are relied upon by criminal justice.
format Online
Article
Text
id pubmed-4639950
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Medknow Publications & Media Pvt Ltd
record_format MEDLINE/PubMed
spelling pubmed-46399502015-11-24 Inclusion probability for DNA mixtures is a subjective one-sided match statistic unrelated to identification information Perlin, Mark William J Pathol Inform Research Article BACKGROUND: DNA mixtures of two or more people are a common type of forensic crime scene evidence. A match statistic that connects the evidence to a criminal defendant is usually needed for court. Jurors rely on this strength of match to help decide guilt or innocence. However, the reliability of unsophisticated match statistics for DNA mixtures has been questioned. MATERIALS AND METHODS: The most prevalent match statistic for DNA mixtures is the combined probability of inclusion (CPI), used by crime labs for over 15 years. When testing 13 short tandem repeat (STR) genetic loci, the CPI(-1) value is typically around a million, regardless of DNA mixture composition. However, actual identification information, as measured by a likelihood ratio (LR), spans a much broader range. This study examined probability of inclusion (PI) mixture statistics for 517 locus experiments drawn from 16 reported cases and compared them with LR locus information calculated independently on the same data. The log(PI(-1)) values were examined and compared with corresponding log(LR) values. RESULTS: The LR and CPI methods were compared in case examples of false inclusion, false exclusion, a homicide, and criminal justice outcomes. Statistical analysis of crime laboratory STR data shows that inclusion match statistics exhibit a truncated normal distribution having zero center, with little correlation to actual identification information. By the law of large numbers (LLN), CPI(-1) increases with the number of tested genetic loci, regardless of DNA mixture composition or match information. These statistical findings explain why CPI is relatively constant, with implications for DNA policy, criminal justice, cost of crime, and crime prevention. CONCLUSIONS: Forensic crime laboratories have generated CPI statistics on hundreds of thousands of DNA mixture evidence items. However, this commonly used match statistic behaves like a random generator of inclusionary values, following the LLN rather than measuring identification information. A quantitative CPI number adds little meaningful information beyond the analyst's initial qualitative assessment that a person's DNA is included in a mixture. Statistical methods for reporting on DNA mixture evidence should be scientifically validated before they are relied upon by criminal justice. Medknow Publications & Media Pvt Ltd 2015-10-28 /pmc/articles/PMC4639950/ /pubmed/26605124 http://dx.doi.org/10.4103/2153-3539.168525 Text en Copyright: © 2015 Journal of Pathology Informatics http://creativecommons.org/licenses/by-nc-sa/3.0 This is an open access article distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License, which allows others to remix, tweak, and build upon the work non-commercially, as long as the author is credited and the new creations are licensed under the identical terms.
spellingShingle Research Article
Perlin, Mark William
Inclusion probability for DNA mixtures is a subjective one-sided match statistic unrelated to identification information
title Inclusion probability for DNA mixtures is a subjective one-sided match statistic unrelated to identification information
title_full Inclusion probability for DNA mixtures is a subjective one-sided match statistic unrelated to identification information
title_fullStr Inclusion probability for DNA mixtures is a subjective one-sided match statistic unrelated to identification information
title_full_unstemmed Inclusion probability for DNA mixtures is a subjective one-sided match statistic unrelated to identification information
title_short Inclusion probability for DNA mixtures is a subjective one-sided match statistic unrelated to identification information
title_sort inclusion probability for dna mixtures is a subjective one-sided match statistic unrelated to identification information
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4639950/
https://www.ncbi.nlm.nih.gov/pubmed/26605124
http://dx.doi.org/10.4103/2153-3539.168525
work_keys_str_mv AT perlinmarkwilliam inclusionprobabilityfordnamixturesisasubjectiveonesidedmatchstatisticunrelatedtoidentificationinformation