Cargando…

A phylogenetic framework facilitates Y-STR variant discovery and classification via massively parallel sequencing

Short tandem repeats on the male-specific region of the Y chromosome (Y-STRs) are permanently linked as haplotypes, and therefore Y-STR sequence diversity can be considered within the robust framework of a phylogeny of haplogroups defined by single nucleotide polymorphisms (SNPs). Here we use massiv...

Descripción completa

Detalles Bibliográficos
Autores principales: Huszar, Tunde I., Jobling, Mark A., Wetton, Jon H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6010625/
https://www.ncbi.nlm.nih.gov/pubmed/29679929
http://dx.doi.org/10.1016/j.fsigen.2018.03.012
_version_ 1783333620602109952
author Huszar, Tunde I.
Jobling, Mark A.
Wetton, Jon H.
author_facet Huszar, Tunde I.
Jobling, Mark A.
Wetton, Jon H.
author_sort Huszar, Tunde I.
collection PubMed
description Short tandem repeats on the male-specific region of the Y chromosome (Y-STRs) are permanently linked as haplotypes, and therefore Y-STR sequence diversity can be considered within the robust framework of a phylogeny of haplogroups defined by single nucleotide polymorphisms (SNPs). Here we use massively parallel sequencing (MPS) to analyse the 23 Y-STRs in Promega’s prototype PowerSeq™ Auto/Mito/Y System kit (containing the markers of the PowerPlex® Y23 [PPY23] System) in a set of 100 diverse Y chromosomes whose phylogenetic relationships are known from previous megabase-scale resequencing. Including allele duplications and alleles resulting from likely somatic mutation, we characterised 2311 alleles, demonstrating 99.83% concordance with capillary electrophoresis (CE) data on the same sample set. The set contains 267 distinct sequence-based alleles (an increase of 58% compared to the 169 detectable by CE), including 60 novel Y-STR variants phased with their flanking sequences which have not been reported previously to our knowledge. Variation includes 46 distinct alleles containing non-reference variants of SNPs/indels in both repeat and flanking regions, and 145 distinct alleles containing repeat pattern variants (RPV). For DYS385a,b, DYS481 and DYS390 we observed repeat count variation in short flanking segments previously considered invariable, and suggest new MPS-based structural designations based on these. We considered the observed variation in the context of the Y phylogeny: several specific haplogroup associations were observed for SNPs and indels, reflecting the low mutation rates of such variant types; however, RPVs showed less phylogenetic coherence and more recurrence, reflecting their relatively high mutation rates. In conclusion, our study reveals considerable additional diversity at the Y-STRs of the PPY23 set via MPS analysis, demonstrates high concordance with CE data, facilitates nomenclature standardisation, and places Y-STR sequence variants in their phylogenetic context.
format Online
Article
Text
id pubmed-6010625
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-60106252018-07-01 A phylogenetic framework facilitates Y-STR variant discovery and classification via massively parallel sequencing Huszar, Tunde I. Jobling, Mark A. Wetton, Jon H. Forensic Sci Int Genet Article Short tandem repeats on the male-specific region of the Y chromosome (Y-STRs) are permanently linked as haplotypes, and therefore Y-STR sequence diversity can be considered within the robust framework of a phylogeny of haplogroups defined by single nucleotide polymorphisms (SNPs). Here we use massively parallel sequencing (MPS) to analyse the 23 Y-STRs in Promega’s prototype PowerSeq™ Auto/Mito/Y System kit (containing the markers of the PowerPlex® Y23 [PPY23] System) in a set of 100 diverse Y chromosomes whose phylogenetic relationships are known from previous megabase-scale resequencing. Including allele duplications and alleles resulting from likely somatic mutation, we characterised 2311 alleles, demonstrating 99.83% concordance with capillary electrophoresis (CE) data on the same sample set. The set contains 267 distinct sequence-based alleles (an increase of 58% compared to the 169 detectable by CE), including 60 novel Y-STR variants phased with their flanking sequences which have not been reported previously to our knowledge. Variation includes 46 distinct alleles containing non-reference variants of SNPs/indels in both repeat and flanking regions, and 145 distinct alleles containing repeat pattern variants (RPV). For DYS385a,b, DYS481 and DYS390 we observed repeat count variation in short flanking segments previously considered invariable, and suggest new MPS-based structural designations based on these. We considered the observed variation in the context of the Y phylogeny: several specific haplogroup associations were observed for SNPs and indels, reflecting the low mutation rates of such variant types; however, RPVs showed less phylogenetic coherence and more recurrence, reflecting their relatively high mutation rates. In conclusion, our study reveals considerable additional diversity at the Y-STRs of the PPY23 set via MPS analysis, demonstrates high concordance with CE data, facilitates nomenclature standardisation, and places Y-STR sequence variants in their phylogenetic context. Elsevier 2018-07 /pmc/articles/PMC6010625/ /pubmed/29679929 http://dx.doi.org/10.1016/j.fsigen.2018.03.012 Text en © 2018 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Huszar, Tunde I.
Jobling, Mark A.
Wetton, Jon H.
A phylogenetic framework facilitates Y-STR variant discovery and classification via massively parallel sequencing
title A phylogenetic framework facilitates Y-STR variant discovery and classification via massively parallel sequencing
title_full A phylogenetic framework facilitates Y-STR variant discovery and classification via massively parallel sequencing
title_fullStr A phylogenetic framework facilitates Y-STR variant discovery and classification via massively parallel sequencing
title_full_unstemmed A phylogenetic framework facilitates Y-STR variant discovery and classification via massively parallel sequencing
title_short A phylogenetic framework facilitates Y-STR variant discovery and classification via massively parallel sequencing
title_sort phylogenetic framework facilitates y-str variant discovery and classification via massively parallel sequencing
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6010625/
https://www.ncbi.nlm.nih.gov/pubmed/29679929
http://dx.doi.org/10.1016/j.fsigen.2018.03.012
work_keys_str_mv AT huszartundei aphylogeneticframeworkfacilitatesystrvariantdiscoveryandclassificationviamassivelyparallelsequencing
AT joblingmarka aphylogeneticframeworkfacilitatesystrvariantdiscoveryandclassificationviamassivelyparallelsequencing
AT wettonjonh aphylogeneticframeworkfacilitatesystrvariantdiscoveryandclassificationviamassivelyparallelsequencing
AT huszartundei phylogeneticframeworkfacilitatesystrvariantdiscoveryandclassificationviamassivelyparallelsequencing
AT joblingmarka phylogeneticframeworkfacilitatesystrvariantdiscoveryandclassificationviamassivelyparallelsequencing
AT wettonjonh phylogeneticframeworkfacilitatesystrvariantdiscoveryandclassificationviamassivelyparallelsequencing