Cargando…

Purifying selection shapes the coincident SNP distribution of primate coding sequences

Genome-wide analysis has observed an excess of coincident single nucleotide polymorphisms (coSNPs) at human-chimpanzee orthologous positions, and suggested that this is due to cryptic variation in the mutation rate. While this phenomenon primarily corresponds with non-coding coSNPs, the situation in...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Chia-Ying, Hung, Li-Yuan, Wu, Chan-Shuo, Chuang, Trees-Juen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4891680/
https://www.ncbi.nlm.nih.gov/pubmed/27255481
http://dx.doi.org/10.1038/srep27272
_version_ 1782435312046178304
author Chen, Chia-Ying
Hung, Li-Yuan
Wu, Chan-Shuo
Chuang, Trees-Juen
author_facet Chen, Chia-Ying
Hung, Li-Yuan
Wu, Chan-Shuo
Chuang, Trees-Juen
author_sort Chen, Chia-Ying
collection PubMed
description Genome-wide analysis has observed an excess of coincident single nucleotide polymorphisms (coSNPs) at human-chimpanzee orthologous positions, and suggested that this is due to cryptic variation in the mutation rate. While this phenomenon primarily corresponds with non-coding coSNPs, the situation in coding sequences remains unclear. Here we calculate the observed-to-expected ratio of coSNPs (coSNP(O/E)) to estimate the prevalence of human-chimpanzee coSNPs, and show that the excess of coSNPs is also present in coding regions. Intriguingly, coSNP(O/E) is much higher at zero-fold than at nonzero-fold degenerate sites; such a difference is due to an elevation of coSNP(O/E) at zero-fold degenerate sites, rather than a reduction at nonzero-fold degenerate ones. These trends are independent of chimpanzee subpopulation, population size, or sequencing techniques; and hold in broad generality across primates. We find that this discrepancy cannot fully explained by sequence contexts, shared ancestral polymorphisms, SNP density, and recombination rate, and that coSNP(O/E) in coding sequences is significantly influenced by purifying selection. We also show that selection and mutation rate affect coSNP(O/E) independently, and coSNPs tend to be less damaging and more correlated with human diseases than non-coSNPs. These suggest that coSNPs may represent a “signature” during primate protein evolution.
format Online
Article
Text
id pubmed-4891680
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-48916802016-06-09 Purifying selection shapes the coincident SNP distribution of primate coding sequences Chen, Chia-Ying Hung, Li-Yuan Wu, Chan-Shuo Chuang, Trees-Juen Sci Rep Article Genome-wide analysis has observed an excess of coincident single nucleotide polymorphisms (coSNPs) at human-chimpanzee orthologous positions, and suggested that this is due to cryptic variation in the mutation rate. While this phenomenon primarily corresponds with non-coding coSNPs, the situation in coding sequences remains unclear. Here we calculate the observed-to-expected ratio of coSNPs (coSNP(O/E)) to estimate the prevalence of human-chimpanzee coSNPs, and show that the excess of coSNPs is also present in coding regions. Intriguingly, coSNP(O/E) is much higher at zero-fold than at nonzero-fold degenerate sites; such a difference is due to an elevation of coSNP(O/E) at zero-fold degenerate sites, rather than a reduction at nonzero-fold degenerate ones. These trends are independent of chimpanzee subpopulation, population size, or sequencing techniques; and hold in broad generality across primates. We find that this discrepancy cannot fully explained by sequence contexts, shared ancestral polymorphisms, SNP density, and recombination rate, and that coSNP(O/E) in coding sequences is significantly influenced by purifying selection. We also show that selection and mutation rate affect coSNP(O/E) independently, and coSNPs tend to be less damaging and more correlated with human diseases than non-coSNPs. These suggest that coSNPs may represent a “signature” during primate protein evolution. Nature Publishing Group 2016-06-03 /pmc/articles/PMC4891680/ /pubmed/27255481 http://dx.doi.org/10.1038/srep27272 Text en Copyright © 2016, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Chen, Chia-Ying
Hung, Li-Yuan
Wu, Chan-Shuo
Chuang, Trees-Juen
Purifying selection shapes the coincident SNP distribution of primate coding sequences
title Purifying selection shapes the coincident SNP distribution of primate coding sequences
title_full Purifying selection shapes the coincident SNP distribution of primate coding sequences
title_fullStr Purifying selection shapes the coincident SNP distribution of primate coding sequences
title_full_unstemmed Purifying selection shapes the coincident SNP distribution of primate coding sequences
title_short Purifying selection shapes the coincident SNP distribution of primate coding sequences
title_sort purifying selection shapes the coincident snp distribution of primate coding sequences
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4891680/
https://www.ncbi.nlm.nih.gov/pubmed/27255481
http://dx.doi.org/10.1038/srep27272
work_keys_str_mv AT chenchiaying purifyingselectionshapesthecoincidentsnpdistributionofprimatecodingsequences
AT hungliyuan purifyingselectionshapesthecoincidentsnpdistributionofprimatecodingsequences
AT wuchanshuo purifyingselectionshapesthecoincidentsnpdistributionofprimatecodingsequences
AT chuangtreesjuen purifyingselectionshapesthecoincidentsnpdistributionofprimatecodingsequences