Cargando…

Gaussian Mixture Models of Between-Source Variation for Likelihood Ratio Computation from Multivariate Data

In forensic science, trace evidence found at a crime scene and on suspect has to be evaluated from the measurements performed on them, usually in the form of multivariate data (for example, several chemical compound or physical characteristics). In order to assess the strength of that evidence, the...

Descripción completa

Detalles Bibliográficos
Autores principales: Franco-Pedroso, Javier, Ramos, Daniel, Gonzalez-Rodriguez, Joaquin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4762660/
https://www.ncbi.nlm.nih.gov/pubmed/26901680
http://dx.doi.org/10.1371/journal.pone.0149958
_version_ 1782417138426839040
author Franco-Pedroso, Javier
Ramos, Daniel
Gonzalez-Rodriguez, Joaquin
author_facet Franco-Pedroso, Javier
Ramos, Daniel
Gonzalez-Rodriguez, Joaquin
author_sort Franco-Pedroso, Javier
collection PubMed
description In forensic science, trace evidence found at a crime scene and on suspect has to be evaluated from the measurements performed on them, usually in the form of multivariate data (for example, several chemical compound or physical characteristics). In order to assess the strength of that evidence, the likelihood ratio framework is being increasingly adopted. Several methods have been derived in order to obtain likelihood ratios directly from univariate or multivariate data by modelling both the variation appearing between observations (or features) coming from the same source (within-source variation) and that appearing between observations coming from different sources (between-source variation). In the widely used multivariate kernel likelihood-ratio, the within-source distribution is assumed to be normally distributed and constant among different sources and the between-source variation is modelled through a kernel density function (KDF). In order to better fit the observed distribution of the between-source variation, this paper presents a different approach in which a Gaussian mixture model (GMM) is used instead of a KDF. As it will be shown, this approach provides better-calibrated likelihood ratios as measured by the log-likelihood ratio cost (C(llr)) in experiments performed on freely available forensic datasets involving different trace evidences: inks, glass fragments and car paints.
format Online
Article
Text
id pubmed-4762660
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-47626602016-03-07 Gaussian Mixture Models of Between-Source Variation for Likelihood Ratio Computation from Multivariate Data Franco-Pedroso, Javier Ramos, Daniel Gonzalez-Rodriguez, Joaquin PLoS One Research Article In forensic science, trace evidence found at a crime scene and on suspect has to be evaluated from the measurements performed on them, usually in the form of multivariate data (for example, several chemical compound or physical characteristics). In order to assess the strength of that evidence, the likelihood ratio framework is being increasingly adopted. Several methods have been derived in order to obtain likelihood ratios directly from univariate or multivariate data by modelling both the variation appearing between observations (or features) coming from the same source (within-source variation) and that appearing between observations coming from different sources (between-source variation). In the widely used multivariate kernel likelihood-ratio, the within-source distribution is assumed to be normally distributed and constant among different sources and the between-source variation is modelled through a kernel density function (KDF). In order to better fit the observed distribution of the between-source variation, this paper presents a different approach in which a Gaussian mixture model (GMM) is used instead of a KDF. As it will be shown, this approach provides better-calibrated likelihood ratios as measured by the log-likelihood ratio cost (C(llr)) in experiments performed on freely available forensic datasets involving different trace evidences: inks, glass fragments and car paints. Public Library of Science 2016-02-22 /pmc/articles/PMC4762660/ /pubmed/26901680 http://dx.doi.org/10.1371/journal.pone.0149958 Text en © 2016 Franco-Pedroso et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Franco-Pedroso, Javier
Ramos, Daniel
Gonzalez-Rodriguez, Joaquin
Gaussian Mixture Models of Between-Source Variation for Likelihood Ratio Computation from Multivariate Data
title Gaussian Mixture Models of Between-Source Variation for Likelihood Ratio Computation from Multivariate Data
title_full Gaussian Mixture Models of Between-Source Variation for Likelihood Ratio Computation from Multivariate Data
title_fullStr Gaussian Mixture Models of Between-Source Variation for Likelihood Ratio Computation from Multivariate Data
title_full_unstemmed Gaussian Mixture Models of Between-Source Variation for Likelihood Ratio Computation from Multivariate Data
title_short Gaussian Mixture Models of Between-Source Variation for Likelihood Ratio Computation from Multivariate Data
title_sort gaussian mixture models of between-source variation for likelihood ratio computation from multivariate data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4762660/
https://www.ncbi.nlm.nih.gov/pubmed/26901680
http://dx.doi.org/10.1371/journal.pone.0149958
work_keys_str_mv AT francopedrosojavier gaussianmixturemodelsofbetweensourcevariationforlikelihoodratiocomputationfrommultivariatedata
AT ramosdaniel gaussianmixturemodelsofbetweensourcevariationforlikelihoodratiocomputationfrommultivariatedata
AT gonzalezrodriguezjoaquin gaussianmixturemodelsofbetweensourcevariationforlikelihoodratiocomputationfrommultivariatedata