Cargando…
US Population Data for 94 Identity-Informative SNP Loci
The US National Institute of Standards and Technology (NIST) analyzed a set of 1036 samples representing four major US population groups (African American, Asian American, Caucasian, and Hispanic) with 94 single nucleotide polymorphisms (SNPs) used for individual identification (iiSNPs). The compact...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10217833/ https://www.ncbi.nlm.nih.gov/pubmed/37239431 http://dx.doi.org/10.3390/genes14051071 |
_version_ | 1785048630730686464 |
---|---|
author | Kiesler, Kevin M. Borsuk, Lisa A. Steffen, Carolyn R. Vallone, Peter M. Gettings, Katherine B. |
author_facet | Kiesler, Kevin M. Borsuk, Lisa A. Steffen, Carolyn R. Vallone, Peter M. Gettings, Katherine B. |
author_sort | Kiesler, Kevin M. |
collection | PubMed |
description | The US National Institute of Standards and Technology (NIST) analyzed a set of 1036 samples representing four major US population groups (African American, Asian American, Caucasian, and Hispanic) with 94 single nucleotide polymorphisms (SNPs) used for individual identification (iiSNPs). The compact size of iiSNP amplicons compared to short tandem repeat (STR) markers increases the likelihood of successful amplification with degraded DNA samples. Allele frequencies and relevant forensic statistics were calculated for each population group as well as the aggregate population sample. Examination of sequence data in the regions flanking the targeted SNPs identified additional variants, which can be combined with the target SNPs to form microhaplotypes (multiple phased SNPs within a short-read sequence). Comparison of iiSNP performance with and without flanking SNP variation identified four amplicons containing microhaplotypes with observed heterozygosity increases of greater than 15% over the targeted SNP alone. For this set of 1036 samples, comparison of average match probabilities from iiSNPs with the 20 CODIS core STR markers yielded an estimate of 1.7 × 10(−38) for iiSNPs (assuming independence between all 94 SNPs), which was four orders of magnitude lower (more discriminating) than STRs where internal sequence variation was considered, and 10 orders of magnitude lower than STRs using established capillary electrophoresis length-based genotypes. |
format | Online Article Text |
id | pubmed-10217833 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-102178332023-05-27 US Population Data for 94 Identity-Informative SNP Loci Kiesler, Kevin M. Borsuk, Lisa A. Steffen, Carolyn R. Vallone, Peter M. Gettings, Katherine B. Genes (Basel) Article The US National Institute of Standards and Technology (NIST) analyzed a set of 1036 samples representing four major US population groups (African American, Asian American, Caucasian, and Hispanic) with 94 single nucleotide polymorphisms (SNPs) used for individual identification (iiSNPs). The compact size of iiSNP amplicons compared to short tandem repeat (STR) markers increases the likelihood of successful amplification with degraded DNA samples. Allele frequencies and relevant forensic statistics were calculated for each population group as well as the aggregate population sample. Examination of sequence data in the regions flanking the targeted SNPs identified additional variants, which can be combined with the target SNPs to form microhaplotypes (multiple phased SNPs within a short-read sequence). Comparison of iiSNP performance with and without flanking SNP variation identified four amplicons containing microhaplotypes with observed heterozygosity increases of greater than 15% over the targeted SNP alone. For this set of 1036 samples, comparison of average match probabilities from iiSNPs with the 20 CODIS core STR markers yielded an estimate of 1.7 × 10(−38) for iiSNPs (assuming independence between all 94 SNPs), which was four orders of magnitude lower (more discriminating) than STRs where internal sequence variation was considered, and 10 orders of magnitude lower than STRs using established capillary electrophoresis length-based genotypes. MDPI 2023-05-12 /pmc/articles/PMC10217833/ /pubmed/37239431 http://dx.doi.org/10.3390/genes14051071 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Kiesler, Kevin M. Borsuk, Lisa A. Steffen, Carolyn R. Vallone, Peter M. Gettings, Katherine B. US Population Data for 94 Identity-Informative SNP Loci |
title | US Population Data for 94 Identity-Informative SNP Loci |
title_full | US Population Data for 94 Identity-Informative SNP Loci |
title_fullStr | US Population Data for 94 Identity-Informative SNP Loci |
title_full_unstemmed | US Population Data for 94 Identity-Informative SNP Loci |
title_short | US Population Data for 94 Identity-Informative SNP Loci |
title_sort | us population data for 94 identity-informative snp loci |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10217833/ https://www.ncbi.nlm.nih.gov/pubmed/37239431 http://dx.doi.org/10.3390/genes14051071 |
work_keys_str_mv | AT kieslerkevinm uspopulationdatafor94identityinformativesnploci AT borsuklisaa uspopulationdatafor94identityinformativesnploci AT steffencarolynr uspopulationdatafor94identityinformativesnploci AT vallonepeterm uspopulationdatafor94identityinformativesnploci AT gettingskatherineb uspopulationdatafor94identityinformativesnploci |