Cargando…

A New Computational Deconvolution Algorithm for the Analysis of Forensic DNA Mixtures with SNP Markers

Single nucleotide polymorphisms (SNPs) support robust analysis on degraded DNA samples. However, the development of a systematic method to interpret the profiles derived from the mixtures is less studied, and it remains a challenge due to the bi-allelic nature of SNP markers. To improve the discrimi...

Descripción completa

Detalles Bibliográficos
Autores principales: Yin, Yu, Zhang, Peng, Xing, Yu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9141285/
https://www.ncbi.nlm.nih.gov/pubmed/35627269
http://dx.doi.org/10.3390/genes13050884
_version_ 1784715308606423040
author Yin, Yu
Zhang, Peng
Xing, Yu
author_facet Yin, Yu
Zhang, Peng
Xing, Yu
author_sort Yin, Yu
collection PubMed
description Single nucleotide polymorphisms (SNPs) support robust analysis on degraded DNA samples. However, the development of a systematic method to interpret the profiles derived from the mixtures is less studied, and it remains a challenge due to the bi-allelic nature of SNP markers. To improve the discriminating power of SNPs, this study explored bioinformatic strategies to analyze mixtures. Then, computer-generated mixtures were produced using real-world massively parallel sequencing (MPS) data from the single samples processed with the Precision ID Identity Panel. Moreover, the values of the frequency of major allele reads (F(MAR)) were calculated and applied as key parameters to deconvolve the two-person mixtures and estimate mixture ratios. Four custom R language scripts (three for autosomes and one for Y chromosome) were designed with the K-means clustering method as a core algorithm. Finally, the method was validated with real-world mixtures. The results indicated that the deconvolution accuracy for evenly balanced mixtures was 100% or close to 100%, which was the same as the deconvolution accuracy of inferring the genotypes of the major contributor of unevenly balanced mixtures. Meanwhile, the accuracy of inferring the genotypes of the minor contributor decreased as its proportion in the mixture decreased. Moreover, the estimated mixture ratio was almost equal to the actual ratio between 1:1 and 1:6. The method proposed in this study provides a new paradigm for mixture interpretation, especially for inferring contributor profiles of evenly balanced mixtures and the major contributor profile of unevenly balanced mixtures.
format Online
Article
Text
id pubmed-9141285
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-91412852022-05-28 A New Computational Deconvolution Algorithm for the Analysis of Forensic DNA Mixtures with SNP Markers Yin, Yu Zhang, Peng Xing, Yu Genes (Basel) Article Single nucleotide polymorphisms (SNPs) support robust analysis on degraded DNA samples. However, the development of a systematic method to interpret the profiles derived from the mixtures is less studied, and it remains a challenge due to the bi-allelic nature of SNP markers. To improve the discriminating power of SNPs, this study explored bioinformatic strategies to analyze mixtures. Then, computer-generated mixtures were produced using real-world massively parallel sequencing (MPS) data from the single samples processed with the Precision ID Identity Panel. Moreover, the values of the frequency of major allele reads (F(MAR)) were calculated and applied as key parameters to deconvolve the two-person mixtures and estimate mixture ratios. Four custom R language scripts (three for autosomes and one for Y chromosome) were designed with the K-means clustering method as a core algorithm. Finally, the method was validated with real-world mixtures. The results indicated that the deconvolution accuracy for evenly balanced mixtures was 100% or close to 100%, which was the same as the deconvolution accuracy of inferring the genotypes of the major contributor of unevenly balanced mixtures. Meanwhile, the accuracy of inferring the genotypes of the minor contributor decreased as its proportion in the mixture decreased. Moreover, the estimated mixture ratio was almost equal to the actual ratio between 1:1 and 1:6. The method proposed in this study provides a new paradigm for mixture interpretation, especially for inferring contributor profiles of evenly balanced mixtures and the major contributor profile of unevenly balanced mixtures. MDPI 2022-05-15 /pmc/articles/PMC9141285/ /pubmed/35627269 http://dx.doi.org/10.3390/genes13050884 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Yin, Yu
Zhang, Peng
Xing, Yu
A New Computational Deconvolution Algorithm for the Analysis of Forensic DNA Mixtures with SNP Markers
title A New Computational Deconvolution Algorithm for the Analysis of Forensic DNA Mixtures with SNP Markers
title_full A New Computational Deconvolution Algorithm for the Analysis of Forensic DNA Mixtures with SNP Markers
title_fullStr A New Computational Deconvolution Algorithm for the Analysis of Forensic DNA Mixtures with SNP Markers
title_full_unstemmed A New Computational Deconvolution Algorithm for the Analysis of Forensic DNA Mixtures with SNP Markers
title_short A New Computational Deconvolution Algorithm for the Analysis of Forensic DNA Mixtures with SNP Markers
title_sort new computational deconvolution algorithm for the analysis of forensic dna mixtures with snp markers
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9141285/
https://www.ncbi.nlm.nih.gov/pubmed/35627269
http://dx.doi.org/10.3390/genes13050884
work_keys_str_mv AT yinyu anewcomputationaldeconvolutionalgorithmfortheanalysisofforensicdnamixtureswithsnpmarkers
AT zhangpeng anewcomputationaldeconvolutionalgorithmfortheanalysisofforensicdnamixtureswithsnpmarkers
AT xingyu anewcomputationaldeconvolutionalgorithmfortheanalysisofforensicdnamixtureswithsnpmarkers
AT yinyu newcomputationaldeconvolutionalgorithmfortheanalysisofforensicdnamixtureswithsnpmarkers
AT zhangpeng newcomputationaldeconvolutionalgorithmfortheanalysisofforensicdnamixtureswithsnpmarkers
AT xingyu newcomputationaldeconvolutionalgorithmfortheanalysisofforensicdnamixtureswithsnpmarkers