Cargando…
Sequence variations, flanking region mutations, and allele frequency at 31 autosomal STRs in the central Indian population by next generation sequencing (NGS)
Capillary electrophoresis-based analysis does not reflect the exact allele number variation at the STR loci due to the non-availability of the data on sequence variation in the repeat region and the SNPs in flanking regions. Herein, this study reports the length-based and sequence-based allelic data...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8636586/ https://www.ncbi.nlm.nih.gov/pubmed/34853383 http://dx.doi.org/10.1038/s41598-021-02690-5 |
_version_ | 1784608555478810624 |
---|---|
author | Dash, Hirak Ranjan Kaitholia, Kamlesh Kumawat, R. K. Singh, Anil Kumar Shrivastava, Pankaj Chaubey, Gyaneshwer Das, Surajit |
author_facet | Dash, Hirak Ranjan Kaitholia, Kamlesh Kumawat, R. K. Singh, Anil Kumar Shrivastava, Pankaj Chaubey, Gyaneshwer Das, Surajit |
author_sort | Dash, Hirak Ranjan |
collection | PubMed |
description | Capillary electrophoresis-based analysis does not reflect the exact allele number variation at the STR loci due to the non-availability of the data on sequence variation in the repeat region and the SNPs in flanking regions. Herein, this study reports the length-based and sequence-based allelic data of 138 central Indian individuals at 31 autosomal STR loci by NGS. The sequence data at each allele was compared to the reference hg19 sequence. The length-based allelic results were found in concordance with the CE-based results. 20 out of 31 autosomal STR loci showed an increase in the number of alleles by the presence of sequence variation and/or SNPs in the flanking regions. The highest gain in the heterozygosity and allele numbers was observed in D5S2800, D1S1656, D16S539, D5S818, and vWA. rs25768 (A/G) at D5S818 was found to be the most frequent SNP in the studied population. Allele no. 15 of D3S1358, allele no. 19 of D2S1338, and allele no. 22 of D12S391 showed 5 isoalleles each with the same size and with different intervening sequences. Length-based determination of the alleles showed Penta E to be the most useful marker in the central Indian population among 31 STRs studied; however, sequence-based analysis advocated D2S1338 to be the most useful marker in terms of various forensic parameters. Population genetics analysis showed a shared genetic ancestry of the studied population with other Indian populations. This first-ever study to the best of our knowledge on sequence-based STR analysis in the central Indian population is expected to prove the use of NGS in forensic case-work and in forensic DNA laboratories. |
format | Online Article Text |
id | pubmed-8636586 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-86365862021-12-03 Sequence variations, flanking region mutations, and allele frequency at 31 autosomal STRs in the central Indian population by next generation sequencing (NGS) Dash, Hirak Ranjan Kaitholia, Kamlesh Kumawat, R. K. Singh, Anil Kumar Shrivastava, Pankaj Chaubey, Gyaneshwer Das, Surajit Sci Rep Article Capillary electrophoresis-based analysis does not reflect the exact allele number variation at the STR loci due to the non-availability of the data on sequence variation in the repeat region and the SNPs in flanking regions. Herein, this study reports the length-based and sequence-based allelic data of 138 central Indian individuals at 31 autosomal STR loci by NGS. The sequence data at each allele was compared to the reference hg19 sequence. The length-based allelic results were found in concordance with the CE-based results. 20 out of 31 autosomal STR loci showed an increase in the number of alleles by the presence of sequence variation and/or SNPs in the flanking regions. The highest gain in the heterozygosity and allele numbers was observed in D5S2800, D1S1656, D16S539, D5S818, and vWA. rs25768 (A/G) at D5S818 was found to be the most frequent SNP in the studied population. Allele no. 15 of D3S1358, allele no. 19 of D2S1338, and allele no. 22 of D12S391 showed 5 isoalleles each with the same size and with different intervening sequences. Length-based determination of the alleles showed Penta E to be the most useful marker in the central Indian population among 31 STRs studied; however, sequence-based analysis advocated D2S1338 to be the most useful marker in terms of various forensic parameters. Population genetics analysis showed a shared genetic ancestry of the studied population with other Indian populations. This first-ever study to the best of our knowledge on sequence-based STR analysis in the central Indian population is expected to prove the use of NGS in forensic case-work and in forensic DNA laboratories. Nature Publishing Group UK 2021-12-01 /pmc/articles/PMC8636586/ /pubmed/34853383 http://dx.doi.org/10.1038/s41598-021-02690-5 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Dash, Hirak Ranjan Kaitholia, Kamlesh Kumawat, R. K. Singh, Anil Kumar Shrivastava, Pankaj Chaubey, Gyaneshwer Das, Surajit Sequence variations, flanking region mutations, and allele frequency at 31 autosomal STRs in the central Indian population by next generation sequencing (NGS) |
title | Sequence variations, flanking region mutations, and allele frequency at 31 autosomal STRs in the central Indian population by next generation sequencing (NGS) |
title_full | Sequence variations, flanking region mutations, and allele frequency at 31 autosomal STRs in the central Indian population by next generation sequencing (NGS) |
title_fullStr | Sequence variations, flanking region mutations, and allele frequency at 31 autosomal STRs in the central Indian population by next generation sequencing (NGS) |
title_full_unstemmed | Sequence variations, flanking region mutations, and allele frequency at 31 autosomal STRs in the central Indian population by next generation sequencing (NGS) |
title_short | Sequence variations, flanking region mutations, and allele frequency at 31 autosomal STRs in the central Indian population by next generation sequencing (NGS) |
title_sort | sequence variations, flanking region mutations, and allele frequency at 31 autosomal strs in the central indian population by next generation sequencing (ngs) |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8636586/ https://www.ncbi.nlm.nih.gov/pubmed/34853383 http://dx.doi.org/10.1038/s41598-021-02690-5 |
work_keys_str_mv | AT dashhirakranjan sequencevariationsflankingregionmutationsandallelefrequencyat31autosomalstrsinthecentralindianpopulationbynextgenerationsequencingngs AT kaitholiakamlesh sequencevariationsflankingregionmutationsandallelefrequencyat31autosomalstrsinthecentralindianpopulationbynextgenerationsequencingngs AT kumawatrk sequencevariationsflankingregionmutationsandallelefrequencyat31autosomalstrsinthecentralindianpopulationbynextgenerationsequencingngs AT singhanilkumar sequencevariationsflankingregionmutationsandallelefrequencyat31autosomalstrsinthecentralindianpopulationbynextgenerationsequencingngs AT shrivastavapankaj sequencevariationsflankingregionmutationsandallelefrequencyat31autosomalstrsinthecentralindianpopulationbynextgenerationsequencingngs AT chaubeygyaneshwer sequencevariationsflankingregionmutationsandallelefrequencyat31autosomalstrsinthecentralindianpopulationbynextgenerationsequencingngs AT dassurajit sequencevariationsflankingregionmutationsandallelefrequencyat31autosomalstrsinthecentralindianpopulationbynextgenerationsequencingngs |