Cargando…

Sequence variations, flanking region mutations, and allele frequency at 31 autosomal STRs in the central Indian population by next generation sequencing (NGS)

Capillary electrophoresis-based analysis does not reflect the exact allele number variation at the STR loci due to the non-availability of the data on sequence variation in the repeat region and the SNPs in flanking regions. Herein, this study reports the length-based and sequence-based allelic data...

Descripción completa

Detalles Bibliográficos
Autores principales: Dash, Hirak Ranjan, Kaitholia, Kamlesh, Kumawat, R. K., Singh, Anil Kumar, Shrivastava, Pankaj, Chaubey, Gyaneshwer, Das, Surajit
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8636586/
https://www.ncbi.nlm.nih.gov/pubmed/34853383
http://dx.doi.org/10.1038/s41598-021-02690-5
_version_ 1784608555478810624
author Dash, Hirak Ranjan
Kaitholia, Kamlesh
Kumawat, R. K.
Singh, Anil Kumar
Shrivastava, Pankaj
Chaubey, Gyaneshwer
Das, Surajit
author_facet Dash, Hirak Ranjan
Kaitholia, Kamlesh
Kumawat, R. K.
Singh, Anil Kumar
Shrivastava, Pankaj
Chaubey, Gyaneshwer
Das, Surajit
author_sort Dash, Hirak Ranjan
collection PubMed
description Capillary electrophoresis-based analysis does not reflect the exact allele number variation at the STR loci due to the non-availability of the data on sequence variation in the repeat region and the SNPs in flanking regions. Herein, this study reports the length-based and sequence-based allelic data of 138 central Indian individuals at 31 autosomal STR loci by NGS. The sequence data at each allele was compared to the reference hg19 sequence. The length-based allelic results were found in concordance with the CE-based results. 20 out of 31 autosomal STR loci showed an increase in the number of alleles by the presence of sequence variation and/or SNPs in the flanking regions. The highest gain in the heterozygosity and allele numbers was observed in D5S2800, D1S1656, D16S539, D5S818, and vWA. rs25768 (A/G) at D5S818 was found to be the most frequent SNP in the studied population. Allele no. 15 of D3S1358, allele no. 19 of D2S1338, and allele no. 22 of D12S391 showed 5 isoalleles each with the same size and with different intervening sequences. Length-based determination of the alleles showed Penta E to be the most useful marker in the central Indian population among 31 STRs studied; however, sequence-based analysis advocated D2S1338 to be the most useful marker in terms of various forensic parameters. Population genetics analysis showed a shared genetic ancestry of the studied population with other Indian populations. This first-ever study to the best of our knowledge on sequence-based STR analysis in the central Indian population is expected to prove the use of NGS in forensic case-work and in forensic DNA laboratories.
format Online
Article
Text
id pubmed-8636586
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-86365862021-12-03 Sequence variations, flanking region mutations, and allele frequency at 31 autosomal STRs in the central Indian population by next generation sequencing (NGS) Dash, Hirak Ranjan Kaitholia, Kamlesh Kumawat, R. K. Singh, Anil Kumar Shrivastava, Pankaj Chaubey, Gyaneshwer Das, Surajit Sci Rep Article Capillary electrophoresis-based analysis does not reflect the exact allele number variation at the STR loci due to the non-availability of the data on sequence variation in the repeat region and the SNPs in flanking regions. Herein, this study reports the length-based and sequence-based allelic data of 138 central Indian individuals at 31 autosomal STR loci by NGS. The sequence data at each allele was compared to the reference hg19 sequence. The length-based allelic results were found in concordance with the CE-based results. 20 out of 31 autosomal STR loci showed an increase in the number of alleles by the presence of sequence variation and/or SNPs in the flanking regions. The highest gain in the heterozygosity and allele numbers was observed in D5S2800, D1S1656, D16S539, D5S818, and vWA. rs25768 (A/G) at D5S818 was found to be the most frequent SNP in the studied population. Allele no. 15 of D3S1358, allele no. 19 of D2S1338, and allele no. 22 of D12S391 showed 5 isoalleles each with the same size and with different intervening sequences. Length-based determination of the alleles showed Penta E to be the most useful marker in the central Indian population among 31 STRs studied; however, sequence-based analysis advocated D2S1338 to be the most useful marker in terms of various forensic parameters. Population genetics analysis showed a shared genetic ancestry of the studied population with other Indian populations. This first-ever study to the best of our knowledge on sequence-based STR analysis in the central Indian population is expected to prove the use of NGS in forensic case-work and in forensic DNA laboratories. Nature Publishing Group UK 2021-12-01 /pmc/articles/PMC8636586/ /pubmed/34853383 http://dx.doi.org/10.1038/s41598-021-02690-5 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Dash, Hirak Ranjan
Kaitholia, Kamlesh
Kumawat, R. K.
Singh, Anil Kumar
Shrivastava, Pankaj
Chaubey, Gyaneshwer
Das, Surajit
Sequence variations, flanking region mutations, and allele frequency at 31 autosomal STRs in the central Indian population by next generation sequencing (NGS)
title Sequence variations, flanking region mutations, and allele frequency at 31 autosomal STRs in the central Indian population by next generation sequencing (NGS)
title_full Sequence variations, flanking region mutations, and allele frequency at 31 autosomal STRs in the central Indian population by next generation sequencing (NGS)
title_fullStr Sequence variations, flanking region mutations, and allele frequency at 31 autosomal STRs in the central Indian population by next generation sequencing (NGS)
title_full_unstemmed Sequence variations, flanking region mutations, and allele frequency at 31 autosomal STRs in the central Indian population by next generation sequencing (NGS)
title_short Sequence variations, flanking region mutations, and allele frequency at 31 autosomal STRs in the central Indian population by next generation sequencing (NGS)
title_sort sequence variations, flanking region mutations, and allele frequency at 31 autosomal strs in the central indian population by next generation sequencing (ngs)
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8636586/
https://www.ncbi.nlm.nih.gov/pubmed/34853383
http://dx.doi.org/10.1038/s41598-021-02690-5
work_keys_str_mv AT dashhirakranjan sequencevariationsflankingregionmutationsandallelefrequencyat31autosomalstrsinthecentralindianpopulationbynextgenerationsequencingngs
AT kaitholiakamlesh sequencevariationsflankingregionmutationsandallelefrequencyat31autosomalstrsinthecentralindianpopulationbynextgenerationsequencingngs
AT kumawatrk sequencevariationsflankingregionmutationsandallelefrequencyat31autosomalstrsinthecentralindianpopulationbynextgenerationsequencingngs
AT singhanilkumar sequencevariationsflankingregionmutationsandallelefrequencyat31autosomalstrsinthecentralindianpopulationbynextgenerationsequencingngs
AT shrivastavapankaj sequencevariationsflankingregionmutationsandallelefrequencyat31autosomalstrsinthecentralindianpopulationbynextgenerationsequencingngs
AT chaubeygyaneshwer sequencevariationsflankingregionmutationsandallelefrequencyat31autosomalstrsinthecentralindianpopulationbynextgenerationsequencingngs
AT dassurajit sequencevariationsflankingregionmutationsandallelefrequencyat31autosomalstrsinthecentralindianpopulationbynextgenerationsequencingngs