Cargando…

US Population Data for 94 Identity-Informative SNP Loci

The US National Institute of Standards and Technology (NIST) analyzed a set of 1036 samples representing four major US population groups (African American, Asian American, Caucasian, and Hispanic) with 94 single nucleotide polymorphisms (SNPs) used for individual identification (iiSNPs). The compact...

Descripción completa

Detalles Bibliográficos
Autores principales: Kiesler, Kevin M., Borsuk, Lisa A., Steffen, Carolyn R., Vallone, Peter M., Gettings, Katherine B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10217833/
https://www.ncbi.nlm.nih.gov/pubmed/37239431
http://dx.doi.org/10.3390/genes14051071
_version_ 1785048630730686464
author Kiesler, Kevin M.
Borsuk, Lisa A.
Steffen, Carolyn R.
Vallone, Peter M.
Gettings, Katherine B.
author_facet Kiesler, Kevin M.
Borsuk, Lisa A.
Steffen, Carolyn R.
Vallone, Peter M.
Gettings, Katherine B.
author_sort Kiesler, Kevin M.
collection PubMed
description The US National Institute of Standards and Technology (NIST) analyzed a set of 1036 samples representing four major US population groups (African American, Asian American, Caucasian, and Hispanic) with 94 single nucleotide polymorphisms (SNPs) used for individual identification (iiSNPs). The compact size of iiSNP amplicons compared to short tandem repeat (STR) markers increases the likelihood of successful amplification with degraded DNA samples. Allele frequencies and relevant forensic statistics were calculated for each population group as well as the aggregate population sample. Examination of sequence data in the regions flanking the targeted SNPs identified additional variants, which can be combined with the target SNPs to form microhaplotypes (multiple phased SNPs within a short-read sequence). Comparison of iiSNP performance with and without flanking SNP variation identified four amplicons containing microhaplotypes with observed heterozygosity increases of greater than 15% over the targeted SNP alone. For this set of 1036 samples, comparison of average match probabilities from iiSNPs with the 20 CODIS core STR markers yielded an estimate of 1.7 × 10(−38) for iiSNPs (assuming independence between all 94 SNPs), which was four orders of magnitude lower (more discriminating) than STRs where internal sequence variation was considered, and 10 orders of magnitude lower than STRs using established capillary electrophoresis length-based genotypes.
format Online
Article
Text
id pubmed-10217833
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-102178332023-05-27 US Population Data for 94 Identity-Informative SNP Loci Kiesler, Kevin M. Borsuk, Lisa A. Steffen, Carolyn R. Vallone, Peter M. Gettings, Katherine B. Genes (Basel) Article The US National Institute of Standards and Technology (NIST) analyzed a set of 1036 samples representing four major US population groups (African American, Asian American, Caucasian, and Hispanic) with 94 single nucleotide polymorphisms (SNPs) used for individual identification (iiSNPs). The compact size of iiSNP amplicons compared to short tandem repeat (STR) markers increases the likelihood of successful amplification with degraded DNA samples. Allele frequencies and relevant forensic statistics were calculated for each population group as well as the aggregate population sample. Examination of sequence data in the regions flanking the targeted SNPs identified additional variants, which can be combined with the target SNPs to form microhaplotypes (multiple phased SNPs within a short-read sequence). Comparison of iiSNP performance with and without flanking SNP variation identified four amplicons containing microhaplotypes with observed heterozygosity increases of greater than 15% over the targeted SNP alone. For this set of 1036 samples, comparison of average match probabilities from iiSNPs with the 20 CODIS core STR markers yielded an estimate of 1.7 × 10(−38) for iiSNPs (assuming independence between all 94 SNPs), which was four orders of magnitude lower (more discriminating) than STRs where internal sequence variation was considered, and 10 orders of magnitude lower than STRs using established capillary electrophoresis length-based genotypes. MDPI 2023-05-12 /pmc/articles/PMC10217833/ /pubmed/37239431 http://dx.doi.org/10.3390/genes14051071 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Kiesler, Kevin M.
Borsuk, Lisa A.
Steffen, Carolyn R.
Vallone, Peter M.
Gettings, Katherine B.
US Population Data for 94 Identity-Informative SNP Loci
title US Population Data for 94 Identity-Informative SNP Loci
title_full US Population Data for 94 Identity-Informative SNP Loci
title_fullStr US Population Data for 94 Identity-Informative SNP Loci
title_full_unstemmed US Population Data for 94 Identity-Informative SNP Loci
title_short US Population Data for 94 Identity-Informative SNP Loci
title_sort us population data for 94 identity-informative snp loci
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10217833/
https://www.ncbi.nlm.nih.gov/pubmed/37239431
http://dx.doi.org/10.3390/genes14051071
work_keys_str_mv AT kieslerkevinm uspopulationdatafor94identityinformativesnploci
AT borsuklisaa uspopulationdatafor94identityinformativesnploci
AT steffencarolynr uspopulationdatafor94identityinformativesnploci
AT vallonepeterm uspopulationdatafor94identityinformativesnploci
AT gettingskatherineb uspopulationdatafor94identityinformativesnploci