Cargando…

Efficient construction of match strength distributions for uncertain multi-locus genotypes

Natural variation in biological evidence leads to uncertain genotypes. Forensic comparison of a probabilistic genotype with a person's reference gives a numerical strength of DNA association. The distribution of match strength for all possible references usefully represents a genotype's po...

Descripción completa

Detalles Bibliográficos
Autor principal: Perlin, Mark W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6176789/
https://www.ncbi.nlm.nih.gov/pubmed/30310872
http://dx.doi.org/10.1016/j.heliyon.2018.e00824
_version_ 1783361756673867776
author Perlin, Mark W.
author_facet Perlin, Mark W.
author_sort Perlin, Mark W.
collection PubMed
description Natural variation in biological evidence leads to uncertain genotypes. Forensic comparison of a probabilistic genotype with a person's reference gives a numerical strength of DNA association. The distribution of match strength for all possible references usefully represents a genotype's potential information. But testing more genetic loci exponentially increases the number of multi-locus possibilities, making direct computation infeasible. At each locus, Bayesian probability can quickly assemble a match strength random variable. Multi-locus match strength is the sum of these independent variables. A multi-locus genotype's match strength distribution is efficiently constructed by convolving together the separate locus distributions. This convolution construction can accurately collate all trillion trillion reference outcomes in a fraction of a second. This paper shows how to rapidly construct multi-locus match strength distributions by convolution. Function convergence demonstrates that distribution accuracy increases with numerical resolution. Convolution construction has quadratic computational complexity, relative to the exponential number of reference genotypes. A suitably defined random variable reduces high-dimensional computational cost to fast real-line arithmetic. Match strength distributions are used in forensic validation studies. They provide error rates for match results. The convolution construction applies to discrete or continuous variables in the forensic, natural and social sciences. Computer-derived match strength distributions elicit the information inherent in DNA evidence, often overlooked by human analysis.
format Online
Article
Text
id pubmed-6176789
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-61767892018-10-11 Efficient construction of match strength distributions for uncertain multi-locus genotypes Perlin, Mark W. Heliyon Article Natural variation in biological evidence leads to uncertain genotypes. Forensic comparison of a probabilistic genotype with a person's reference gives a numerical strength of DNA association. The distribution of match strength for all possible references usefully represents a genotype's potential information. But testing more genetic loci exponentially increases the number of multi-locus possibilities, making direct computation infeasible. At each locus, Bayesian probability can quickly assemble a match strength random variable. Multi-locus match strength is the sum of these independent variables. A multi-locus genotype's match strength distribution is efficiently constructed by convolving together the separate locus distributions. This convolution construction can accurately collate all trillion trillion reference outcomes in a fraction of a second. This paper shows how to rapidly construct multi-locus match strength distributions by convolution. Function convergence demonstrates that distribution accuracy increases with numerical resolution. Convolution construction has quadratic computational complexity, relative to the exponential number of reference genotypes. A suitably defined random variable reduces high-dimensional computational cost to fast real-line arithmetic. Match strength distributions are used in forensic validation studies. They provide error rates for match results. The convolution construction applies to discrete or continuous variables in the forensic, natural and social sciences. Computer-derived match strength distributions elicit the information inherent in DNA evidence, often overlooked by human analysis. Elsevier 2018-10-08 /pmc/articles/PMC6176789/ /pubmed/30310872 http://dx.doi.org/10.1016/j.heliyon.2018.e00824 Text en © 2018 The Author http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Perlin, Mark W.
Efficient construction of match strength distributions for uncertain multi-locus genotypes
title Efficient construction of match strength distributions for uncertain multi-locus genotypes
title_full Efficient construction of match strength distributions for uncertain multi-locus genotypes
title_fullStr Efficient construction of match strength distributions for uncertain multi-locus genotypes
title_full_unstemmed Efficient construction of match strength distributions for uncertain multi-locus genotypes
title_short Efficient construction of match strength distributions for uncertain multi-locus genotypes
title_sort efficient construction of match strength distributions for uncertain multi-locus genotypes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6176789/
https://www.ncbi.nlm.nih.gov/pubmed/30310872
http://dx.doi.org/10.1016/j.heliyon.2018.e00824
work_keys_str_mv AT perlinmarkw efficientconstructionofmatchstrengthdistributionsforuncertainmultilocusgenotypes