Cargando…

A multi-dimensional evaluation of the ‘NIST 1032’ sample set across four forensic Y-STR multiplexes

This manuscript reports Y-chromosomal short tandem repeat (Y-STR) haplotypes for 1032 male U.S. population samples across 30 Y-STR loci characterized by three capillary electrophoresis (CE) length-based kits (PowerPlex Y23 System, Yfiler Plus PCR Amplification Kit, and Investigator Argus Y-28 QS Kit...

Descripción completa

Detalles Bibliográficos
Autores principales: Steffen, Carolyn R., Huszar, Tunde I., Borsuk, Lisa A., Vallone, Peter M., Gettings, Katherine B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9901497/
https://www.ncbi.nlm.nih.gov/pubmed/35007854
http://dx.doi.org/10.1016/j.fsigen.2021.102655
_version_ 1784883039107547136
author Steffen, Carolyn R.
Huszar, Tunde I.
Borsuk, Lisa A.
Vallone, Peter M.
Gettings, Katherine B.
author_facet Steffen, Carolyn R.
Huszar, Tunde I.
Borsuk, Lisa A.
Vallone, Peter M.
Gettings, Katherine B.
author_sort Steffen, Carolyn R.
collection PubMed
description This manuscript reports Y-chromosomal short tandem repeat (Y-STR) haplotypes for 1032 male U.S. population samples across 30 Y-STR loci characterized by three capillary electrophoresis (CE) length-based kits (PowerPlex Y23 System, Yfiler Plus PCR Amplification Kit, and Investigator Argus Y-28 QS Kit) and one sequence-based kit (ForenSeq DNA Signature Prep Kit): DYF387S1, DYS19, DYS385 a/b, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS437, DYS438, DYS439, DYS448, DYS449, DYS456, DYS458, DYS460, DYS481, DYS505, DYS518, DYS522, DYS533, DYS549, DYS570, DYS576, DYS612, DYS627, DYS635, DYS643, and Y-GATA-H4. The length-based Y-STR haplotypes include six loci that are not reported in the sequence-based kit (DYS393, DYS449, DYS456, DYS458, DYS518, and DYS627), whereas three loci included in the sequence-based kit are not present in length-based kits (DYS505, DYS522, and DYS612). For the latter, a custom multiplex was used to generate CE length-based data, allowing 1032 samples to be evaluated for concordance across the 30 Y-STR loci included in these four commercial Y-STR typing kits. Discordances between typing methods were analyzed further to assess underlying causes such as primer binding site mutations and flanking region insertions/deletions. Allele-level frequency and statistical information is provided for sequenced loci, excluding the multi-copy loci DYF387S1 and DYS385 a/b, for which locus-specific haplotype-level frequencies are provided instead. The resulting data reveals the degree of information gained through sequencing: 88% of sequenced Y-STR loci contain additional sequence-based alleles compared to length-based data, with the DYS389II locus containing the most additional alleles (51) observed by sequencing. Despite these allelic increases, only minimal improvement was observed in haplotype resolution by sequence, with all four commercial kits providing a similar ability to differentiate length-based haplotypes in this sample set. Finally, a subset of 369 male samples were compared to their corresponding additionally sequenced father samples, revealing the sequence basis for the 50 length-based changes observed, and no additional sequence-based mutations. GenBank accession numbers are reported for each unique sequence, and associated records are available in the STRSeq Y-Chromosomal STR Loci National Center for Biotechnology Information (NCBI) BioProject, accession PRJNA380347. Haplotype data is updated in the Y-STR Haplotype Reference Database (YHRD) for the ‘NIST 1032’ data set to now achieve the level of maximal haplotype of YHRD. All supplementary files including revisions to previously published Y-STR data are available in the NIST Public Data Repository: U.S. population data for human identification markers, DOI 10.18434/t4/1500024.
format Online
Article
Text
id pubmed-9901497
institution National Center for Biotechnology Information
language English
publishDate 2022
record_format MEDLINE/PubMed
spelling pubmed-99014972023-02-06 A multi-dimensional evaluation of the ‘NIST 1032’ sample set across four forensic Y-STR multiplexes Steffen, Carolyn R. Huszar, Tunde I. Borsuk, Lisa A. Vallone, Peter M. Gettings, Katherine B. Forensic Sci Int Genet Article This manuscript reports Y-chromosomal short tandem repeat (Y-STR) haplotypes for 1032 male U.S. population samples across 30 Y-STR loci characterized by three capillary electrophoresis (CE) length-based kits (PowerPlex Y23 System, Yfiler Plus PCR Amplification Kit, and Investigator Argus Y-28 QS Kit) and one sequence-based kit (ForenSeq DNA Signature Prep Kit): DYF387S1, DYS19, DYS385 a/b, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS437, DYS438, DYS439, DYS448, DYS449, DYS456, DYS458, DYS460, DYS481, DYS505, DYS518, DYS522, DYS533, DYS549, DYS570, DYS576, DYS612, DYS627, DYS635, DYS643, and Y-GATA-H4. The length-based Y-STR haplotypes include six loci that are not reported in the sequence-based kit (DYS393, DYS449, DYS456, DYS458, DYS518, and DYS627), whereas three loci included in the sequence-based kit are not present in length-based kits (DYS505, DYS522, and DYS612). For the latter, a custom multiplex was used to generate CE length-based data, allowing 1032 samples to be evaluated for concordance across the 30 Y-STR loci included in these four commercial Y-STR typing kits. Discordances between typing methods were analyzed further to assess underlying causes such as primer binding site mutations and flanking region insertions/deletions. Allele-level frequency and statistical information is provided for sequenced loci, excluding the multi-copy loci DYF387S1 and DYS385 a/b, for which locus-specific haplotype-level frequencies are provided instead. The resulting data reveals the degree of information gained through sequencing: 88% of sequenced Y-STR loci contain additional sequence-based alleles compared to length-based data, with the DYS389II locus containing the most additional alleles (51) observed by sequencing. Despite these allelic increases, only minimal improvement was observed in haplotype resolution by sequence, with all four commercial kits providing a similar ability to differentiate length-based haplotypes in this sample set. Finally, a subset of 369 male samples were compared to their corresponding additionally sequenced father samples, revealing the sequence basis for the 50 length-based changes observed, and no additional sequence-based mutations. GenBank accession numbers are reported for each unique sequence, and associated records are available in the STRSeq Y-Chromosomal STR Loci National Center for Biotechnology Information (NCBI) BioProject, accession PRJNA380347. Haplotype data is updated in the Y-STR Haplotype Reference Database (YHRD) for the ‘NIST 1032’ data set to now achieve the level of maximal haplotype of YHRD. All supplementary files including revisions to previously published Y-STR data are available in the NIST Public Data Repository: U.S. population data for human identification markers, DOI 10.18434/t4/1500024. 2022-03 2021-12-28 /pmc/articles/PMC9901497/ /pubmed/35007854 http://dx.doi.org/10.1016/j.fsigen.2021.102655 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) ).
spellingShingle Article
Steffen, Carolyn R.
Huszar, Tunde I.
Borsuk, Lisa A.
Vallone, Peter M.
Gettings, Katherine B.
A multi-dimensional evaluation of the ‘NIST 1032’ sample set across four forensic Y-STR multiplexes
title A multi-dimensional evaluation of the ‘NIST 1032’ sample set across four forensic Y-STR multiplexes
title_full A multi-dimensional evaluation of the ‘NIST 1032’ sample set across four forensic Y-STR multiplexes
title_fullStr A multi-dimensional evaluation of the ‘NIST 1032’ sample set across four forensic Y-STR multiplexes
title_full_unstemmed A multi-dimensional evaluation of the ‘NIST 1032’ sample set across four forensic Y-STR multiplexes
title_short A multi-dimensional evaluation of the ‘NIST 1032’ sample set across four forensic Y-STR multiplexes
title_sort multi-dimensional evaluation of the ‘nist 1032’ sample set across four forensic y-str multiplexes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9901497/
https://www.ncbi.nlm.nih.gov/pubmed/35007854
http://dx.doi.org/10.1016/j.fsigen.2021.102655
work_keys_str_mv AT steffencarolynr amultidimensionalevaluationofthenist1032samplesetacrossfourforensicystrmultiplexes
AT huszartundei amultidimensionalevaluationofthenist1032samplesetacrossfourforensicystrmultiplexes
AT borsuklisaa amultidimensionalevaluationofthenist1032samplesetacrossfourforensicystrmultiplexes
AT vallonepeterm amultidimensionalevaluationofthenist1032samplesetacrossfourforensicystrmultiplexes
AT gettingskatherineb amultidimensionalevaluationofthenist1032samplesetacrossfourforensicystrmultiplexes
AT steffencarolynr multidimensionalevaluationofthenist1032samplesetacrossfourforensicystrmultiplexes
AT huszartundei multidimensionalevaluationofthenist1032samplesetacrossfourforensicystrmultiplexes
AT borsuklisaa multidimensionalevaluationofthenist1032samplesetacrossfourforensicystrmultiplexes
AT vallonepeterm multidimensionalevaluationofthenist1032samplesetacrossfourforensicystrmultiplexes
AT gettingskatherineb multidimensionalevaluationofthenist1032samplesetacrossfourforensicystrmultiplexes