Cargando…

An Alu insertion map of the Indian population: identification and analysis in 1021 genomes of the IndiGen project

Actively retrotransposing primate-specific Alu repeats display insertion-deletion (InDel) polymorphism through their insertion at new loci. In the global datasets, Indian populations remain under-represented and so do their Alu InDels. Here, we report the genomic landscape of Alu InDels from the rec...

Descripción completa

Detalles Bibliográficos
Autores principales: Prakrithi, P, Singhal, Khushboo, Sharma, Disha, Jain, Abhinav, Bhoyar, Rahul C, Imran, Mohamed, Senthilvel, Vigneshwar, Divakar, Mohit Kumar, Mishra, Anushree, Scaria, Vinod, Sivasubbu, Sridhar, Mukerji, Mitali
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8846365/
https://www.ncbi.nlm.nih.gov/pubmed/35178516
http://dx.doi.org/10.1093/nargab/lqac009
_version_ 1784651839790120960
author Prakrithi, P
Singhal, Khushboo
Sharma, Disha
Jain, Abhinav
Bhoyar, Rahul C
Imran, Mohamed
Senthilvel, Vigneshwar
Divakar, Mohit Kumar
Mishra, Anushree
Scaria, Vinod
Sivasubbu, Sridhar
Mukerji, Mitali
author_facet Prakrithi, P
Singhal, Khushboo
Sharma, Disha
Jain, Abhinav
Bhoyar, Rahul C
Imran, Mohamed
Senthilvel, Vigneshwar
Divakar, Mohit Kumar
Mishra, Anushree
Scaria, Vinod
Sivasubbu, Sridhar
Mukerji, Mitali
author_sort Prakrithi, P
collection PubMed
description Actively retrotransposing primate-specific Alu repeats display insertion-deletion (InDel) polymorphism through their insertion at new loci. In the global datasets, Indian populations remain under-represented and so do their Alu InDels. Here, we report the genomic landscape of Alu InDels from the recently released 1021 Indian Genomes (IndiGen) (available at https://clingen.igib.res.in/indigen). We identified 9239 polymorphic Alu insertions that include private (3831), rare (3974) and common (1434) insertions with an average of 770 insertions per individual. We achieved an 89% PCR validation of the predicted genotypes in 94 samples tested. About 60% of identified InDels are unique to IndiGen when compared to other global datasets; 23% of sites were shared with both SGDP and HGSVC; among these, 58% (1289 sites) were common polymorphisms in IndiGen. The insertions not only show a bias for genic regions, with a preference for introns but also for the associated genes showing enrichment for processes like cell morphogenesis and neurogenesis (P-value < 0.05). Approximately, 60% of InDels mapped to genes present in the OMIM database. Finally, we show that 558 InDels can serve as ancestry informative markers to segregate global populations. This study provides a valuable resource for baseline Alu InDels that would be useful in population genomics.
format Online
Article
Text
id pubmed-8846365
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-88463652022-02-16 An Alu insertion map of the Indian population: identification and analysis in 1021 genomes of the IndiGen project Prakrithi, P Singhal, Khushboo Sharma, Disha Jain, Abhinav Bhoyar, Rahul C Imran, Mohamed Senthilvel, Vigneshwar Divakar, Mohit Kumar Mishra, Anushree Scaria, Vinod Sivasubbu, Sridhar Mukerji, Mitali NAR Genom Bioinform Standard Article Actively retrotransposing primate-specific Alu repeats display insertion-deletion (InDel) polymorphism through their insertion at new loci. In the global datasets, Indian populations remain under-represented and so do their Alu InDels. Here, we report the genomic landscape of Alu InDels from the recently released 1021 Indian Genomes (IndiGen) (available at https://clingen.igib.res.in/indigen). We identified 9239 polymorphic Alu insertions that include private (3831), rare (3974) and common (1434) insertions with an average of 770 insertions per individual. We achieved an 89% PCR validation of the predicted genotypes in 94 samples tested. About 60% of identified InDels are unique to IndiGen when compared to other global datasets; 23% of sites were shared with both SGDP and HGSVC; among these, 58% (1289 sites) were common polymorphisms in IndiGen. The insertions not only show a bias for genic regions, with a preference for introns but also for the associated genes showing enrichment for processes like cell morphogenesis and neurogenesis (P-value < 0.05). Approximately, 60% of InDels mapped to genes present in the OMIM database. Finally, we show that 558 InDels can serve as ancestry informative markers to segregate global populations. This study provides a valuable resource for baseline Alu InDels that would be useful in population genomics. Oxford University Press 2022-02-15 /pmc/articles/PMC8846365/ /pubmed/35178516 http://dx.doi.org/10.1093/nargab/lqac009 Text en The Author(s) 2022. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Standard Article
Prakrithi, P
Singhal, Khushboo
Sharma, Disha
Jain, Abhinav
Bhoyar, Rahul C
Imran, Mohamed
Senthilvel, Vigneshwar
Divakar, Mohit Kumar
Mishra, Anushree
Scaria, Vinod
Sivasubbu, Sridhar
Mukerji, Mitali
An Alu insertion map of the Indian population: identification and analysis in 1021 genomes of the IndiGen project
title An Alu insertion map of the Indian population: identification and analysis in 1021 genomes of the IndiGen project
title_full An Alu insertion map of the Indian population: identification and analysis in 1021 genomes of the IndiGen project
title_fullStr An Alu insertion map of the Indian population: identification and analysis in 1021 genomes of the IndiGen project
title_full_unstemmed An Alu insertion map of the Indian population: identification and analysis in 1021 genomes of the IndiGen project
title_short An Alu insertion map of the Indian population: identification and analysis in 1021 genomes of the IndiGen project
title_sort alu insertion map of the indian population: identification and analysis in 1021 genomes of the indigen project
topic Standard Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8846365/
https://www.ncbi.nlm.nih.gov/pubmed/35178516
http://dx.doi.org/10.1093/nargab/lqac009
work_keys_str_mv AT prakrithip analuinsertionmapoftheindianpopulationidentificationandanalysisin1021genomesoftheindigenproject
AT singhalkhushboo analuinsertionmapoftheindianpopulationidentificationandanalysisin1021genomesoftheindigenproject
AT sharmadisha analuinsertionmapoftheindianpopulationidentificationandanalysisin1021genomesoftheindigenproject
AT jainabhinav analuinsertionmapoftheindianpopulationidentificationandanalysisin1021genomesoftheindigenproject
AT bhoyarrahulc analuinsertionmapoftheindianpopulationidentificationandanalysisin1021genomesoftheindigenproject
AT imranmohamed analuinsertionmapoftheindianpopulationidentificationandanalysisin1021genomesoftheindigenproject
AT senthilvelvigneshwar analuinsertionmapoftheindianpopulationidentificationandanalysisin1021genomesoftheindigenproject
AT divakarmohitkumar analuinsertionmapoftheindianpopulationidentificationandanalysisin1021genomesoftheindigenproject
AT mishraanushree analuinsertionmapoftheindianpopulationidentificationandanalysisin1021genomesoftheindigenproject
AT scariavinod analuinsertionmapoftheindianpopulationidentificationandanalysisin1021genomesoftheindigenproject
AT sivasubbusridhar analuinsertionmapoftheindianpopulationidentificationandanalysisin1021genomesoftheindigenproject
AT mukerjimitali analuinsertionmapoftheindianpopulationidentificationandanalysisin1021genomesoftheindigenproject
AT prakrithip aluinsertionmapoftheindianpopulationidentificationandanalysisin1021genomesoftheindigenproject
AT singhalkhushboo aluinsertionmapoftheindianpopulationidentificationandanalysisin1021genomesoftheindigenproject
AT sharmadisha aluinsertionmapoftheindianpopulationidentificationandanalysisin1021genomesoftheindigenproject
AT jainabhinav aluinsertionmapoftheindianpopulationidentificationandanalysisin1021genomesoftheindigenproject
AT bhoyarrahulc aluinsertionmapoftheindianpopulationidentificationandanalysisin1021genomesoftheindigenproject
AT imranmohamed aluinsertionmapoftheindianpopulationidentificationandanalysisin1021genomesoftheindigenproject
AT senthilvelvigneshwar aluinsertionmapoftheindianpopulationidentificationandanalysisin1021genomesoftheindigenproject
AT divakarmohitkumar aluinsertionmapoftheindianpopulationidentificationandanalysisin1021genomesoftheindigenproject
AT mishraanushree aluinsertionmapoftheindianpopulationidentificationandanalysisin1021genomesoftheindigenproject
AT scariavinod aluinsertionmapoftheindianpopulationidentificationandanalysisin1021genomesoftheindigenproject
AT sivasubbusridhar aluinsertionmapoftheindianpopulationidentificationandanalysisin1021genomesoftheindigenproject
AT mukerjimitali aluinsertionmapoftheindianpopulationidentificationandanalysisin1021genomesoftheindigenproject