Cargando…

Purifying selection shapes the coincident SNP distribution of primate coding sequences

Genome-wide analysis has observed an excess of coincident single nucleotide polymorphisms (coSNPs) at human-chimpanzee orthologous positions, and suggested that this is due to cryptic variation in the mutation rate. While this phenomenon primarily corresponds with non-coding coSNPs, the situation in...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Chia-Ying, Hung, Li-Yuan, Wu, Chan-Shuo, Chuang, Trees-Juen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4891680/
https://www.ncbi.nlm.nih.gov/pubmed/27255481
http://dx.doi.org/10.1038/srep27272
Descripción
Sumario:Genome-wide analysis has observed an excess of coincident single nucleotide polymorphisms (coSNPs) at human-chimpanzee orthologous positions, and suggested that this is due to cryptic variation in the mutation rate. While this phenomenon primarily corresponds with non-coding coSNPs, the situation in coding sequences remains unclear. Here we calculate the observed-to-expected ratio of coSNPs (coSNP(O/E)) to estimate the prevalence of human-chimpanzee coSNPs, and show that the excess of coSNPs is also present in coding regions. Intriguingly, coSNP(O/E) is much higher at zero-fold than at nonzero-fold degenerate sites; such a difference is due to an elevation of coSNP(O/E) at zero-fold degenerate sites, rather than a reduction at nonzero-fold degenerate ones. These trends are independent of chimpanzee subpopulation, population size, or sequencing techniques; and hold in broad generality across primates. We find that this discrepancy cannot fully explained by sequence contexts, shared ancestral polymorphisms, SNP density, and recombination rate, and that coSNP(O/E) in coding sequences is significantly influenced by purifying selection. We also show that selection and mutation rate affect coSNP(O/E) independently, and coSNPs tend to be less damaging and more correlated with human diseases than non-coSNPs. These suggest that coSNPs may represent a “signature” during primate protein evolution.