Cargando…

Integrating Linguistics, Social Structure, and Geography to Model Genetic Diversity within India

India represents an intricate tapestry of population substructure shaped by geography, language, culture, and social stratification. Although geography closely correlates with genetic structure in other parts of the world, the strict endogamy imposed by the Indian caste system and the large number o...

Descripción completa

Detalles Bibliográficos
Autores principales: Bose, Aritra, Platt, Daniel E, Parida, Laxmi, Drineas, Petros, Paschou, Peristera
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8097304/
https://www.ncbi.nlm.nih.gov/pubmed/33481022
http://dx.doi.org/10.1093/molbev/msaa321
_version_ 1783688327569866752
author Bose, Aritra
Platt, Daniel E
Parida, Laxmi
Drineas, Petros
Paschou, Peristera
author_facet Bose, Aritra
Platt, Daniel E
Parida, Laxmi
Drineas, Petros
Paschou, Peristera
author_sort Bose, Aritra
collection PubMed
description India represents an intricate tapestry of population substructure shaped by geography, language, culture, and social stratification. Although geography closely correlates with genetic structure in other parts of the world, the strict endogamy imposed by the Indian caste system and the large number of spoken languages add further levels of complexity to understand Indian population structure. To date, no study has attempted to model and evaluate how these factors have interacted to shape the patterns of genetic diversity within India. We merged all publicly available data from the Indian subcontinent into a data set of 891 individuals from 90 well-defined groups. Bringing together geography, genetics, and demographic factors, we developed Correlation Optimization of Genetics and Geodemographics to build a model that explains the observed population genetic substructure. We show that shared language along with social structure have been the most powerful forces in creating paths of gene flow in the subcontinent. Furthermore, we discover the ethnic groups that best capture the diverse genetic substructure using a ridge leverage score statistic. Integrating data from India with a data set of additional 1,323 individuals from 50 Eurasian populations, we find that Indo-European and Dravidian speakers of India show shared genetic drift with Europeans, whereas the Tibeto-Burman speaking tribal groups have maximum shared genetic drift with East Asians.
format Online
Article
Text
id pubmed-8097304
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-80973042021-05-10 Integrating Linguistics, Social Structure, and Geography to Model Genetic Diversity within India Bose, Aritra Platt, Daniel E Parida, Laxmi Drineas, Petros Paschou, Peristera Mol Biol Evol Discoveries India represents an intricate tapestry of population substructure shaped by geography, language, culture, and social stratification. Although geography closely correlates with genetic structure in other parts of the world, the strict endogamy imposed by the Indian caste system and the large number of spoken languages add further levels of complexity to understand Indian population structure. To date, no study has attempted to model and evaluate how these factors have interacted to shape the patterns of genetic diversity within India. We merged all publicly available data from the Indian subcontinent into a data set of 891 individuals from 90 well-defined groups. Bringing together geography, genetics, and demographic factors, we developed Correlation Optimization of Genetics and Geodemographics to build a model that explains the observed population genetic substructure. We show that shared language along with social structure have been the most powerful forces in creating paths of gene flow in the subcontinent. Furthermore, we discover the ethnic groups that best capture the diverse genetic substructure using a ridge leverage score statistic. Integrating data from India with a data set of additional 1,323 individuals from 50 Eurasian populations, we find that Indo-European and Dravidian speakers of India show shared genetic drift with Europeans, whereas the Tibeto-Burman speaking tribal groups have maximum shared genetic drift with East Asians. Oxford University Press 2021-01-22 /pmc/articles/PMC8097304/ /pubmed/33481022 http://dx.doi.org/10.1093/molbev/msaa321 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Discoveries
Bose, Aritra
Platt, Daniel E
Parida, Laxmi
Drineas, Petros
Paschou, Peristera
Integrating Linguistics, Social Structure, and Geography to Model Genetic Diversity within India
title Integrating Linguistics, Social Structure, and Geography to Model Genetic Diversity within India
title_full Integrating Linguistics, Social Structure, and Geography to Model Genetic Diversity within India
title_fullStr Integrating Linguistics, Social Structure, and Geography to Model Genetic Diversity within India
title_full_unstemmed Integrating Linguistics, Social Structure, and Geography to Model Genetic Diversity within India
title_short Integrating Linguistics, Social Structure, and Geography to Model Genetic Diversity within India
title_sort integrating linguistics, social structure, and geography to model genetic diversity within india
topic Discoveries
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8097304/
https://www.ncbi.nlm.nih.gov/pubmed/33481022
http://dx.doi.org/10.1093/molbev/msaa321
work_keys_str_mv AT bosearitra integratinglinguisticssocialstructureandgeographytomodelgeneticdiversitywithinindia
AT plattdaniele integratinglinguisticssocialstructureandgeographytomodelgeneticdiversitywithinindia
AT paridalaxmi integratinglinguisticssocialstructureandgeographytomodelgeneticdiversitywithinindia
AT drineaspetros integratinglinguisticssocialstructureandgeographytomodelgeneticdiversitywithinindia
AT paschouperistera integratinglinguisticssocialstructureandgeographytomodelgeneticdiversitywithinindia