Cargando…
Exploiting CpG Hypermutability to Identify Phenotypically Significant Variation Within Human Protein-Coding Genes
The CpG dinucleotide is disproportionately represented in human genetic variation due to the hypermutability of 5-methyl-cytosine (5mC). We exploit this hypermutability and a novel codon substitution model to identify candidate functionally important exonic nucleotides. Population genetic theory sug...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3184784/ https://www.ncbi.nlm.nih.gov/pubmed/21398426 http://dx.doi.org/10.1093/gbe/evr021 |
_version_ | 1782213144069799936 |
---|---|
author | Ying, Hua Huttley, Gavin |
author_facet | Ying, Hua Huttley, Gavin |
author_sort | Ying, Hua |
collection | PubMed |
description | The CpG dinucleotide is disproportionately represented in human genetic variation due to the hypermutability of 5-methyl-cytosine (5mC). We exploit this hypermutability and a novel codon substitution model to identify candidate functionally important exonic nucleotides. Population genetic theory suggests that codon positions with high cross-species CpG frequency will derive from stronger purifying selection. Using the phylogeny-based maximum likelihood inference framework, we applied codon substitution models with context-dependent parameters to measure the mutagenic and selective processes affecting CpG dinucleotides within exonic sequence. The suitability of these models was validated on >2,000 protein coding genes from a naturally occurring biological control, four yeast species that do not methylate their DNA. As expected, our analyses of yeast revealed no evidence for an elevated CpG transition rate or for substitution suppression affecting CpG-containing codons. Our analyses of >12,000 protein-coding genes from four primate lineages confirm the systemic influence of 5mC hypermutability on the divergence of these genes. After adjusting for confounding influences of mutation and the properties of the encoded amino acids, we confirmed that CpG-containing codons are under greater purifying selection in primates. Genes with significant evidence of enhanced suppression of nonsynonymous CpG changes were also shown to be significantly enriched in Online Mendelian Inheritance in Man. We developed a method for ranking candidate phenotypically influential CpG positions in human genes. Application of this method indicates that of the ∼1 million exonic CpG dinucleotides within humans, ∼20% are strong candidates for both hypermutability and disease association. |
format | Online Article Text |
id | pubmed-3184784 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-31847842011-10-03 Exploiting CpG Hypermutability to Identify Phenotypically Significant Variation Within Human Protein-Coding Genes Ying, Hua Huttley, Gavin Genome Biol Evol Research Articles The CpG dinucleotide is disproportionately represented in human genetic variation due to the hypermutability of 5-methyl-cytosine (5mC). We exploit this hypermutability and a novel codon substitution model to identify candidate functionally important exonic nucleotides. Population genetic theory suggests that codon positions with high cross-species CpG frequency will derive from stronger purifying selection. Using the phylogeny-based maximum likelihood inference framework, we applied codon substitution models with context-dependent parameters to measure the mutagenic and selective processes affecting CpG dinucleotides within exonic sequence. The suitability of these models was validated on >2,000 protein coding genes from a naturally occurring biological control, four yeast species that do not methylate their DNA. As expected, our analyses of yeast revealed no evidence for an elevated CpG transition rate or for substitution suppression affecting CpG-containing codons. Our analyses of >12,000 protein-coding genes from four primate lineages confirm the systemic influence of 5mC hypermutability on the divergence of these genes. After adjusting for confounding influences of mutation and the properties of the encoded amino acids, we confirmed that CpG-containing codons are under greater purifying selection in primates. Genes with significant evidence of enhanced suppression of nonsynonymous CpG changes were also shown to be significantly enriched in Online Mendelian Inheritance in Man. We developed a method for ranking candidate phenotypically influential CpG positions in human genes. Application of this method indicates that of the ∼1 million exonic CpG dinucleotides within humans, ∼20% are strong candidates for both hypermutability and disease association. Oxford University Press 2011-03-11 /pmc/articles/PMC3184784/ /pubmed/21398426 http://dx.doi.org/10.1093/gbe/evr021 Text en © The Author(s) 2011. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Articles Ying, Hua Huttley, Gavin Exploiting CpG Hypermutability to Identify Phenotypically Significant Variation Within Human Protein-Coding Genes |
title | Exploiting CpG Hypermutability to Identify Phenotypically Significant Variation Within Human Protein-Coding Genes |
title_full | Exploiting CpG Hypermutability to Identify Phenotypically Significant Variation Within Human Protein-Coding Genes |
title_fullStr | Exploiting CpG Hypermutability to Identify Phenotypically Significant Variation Within Human Protein-Coding Genes |
title_full_unstemmed | Exploiting CpG Hypermutability to Identify Phenotypically Significant Variation Within Human Protein-Coding Genes |
title_short | Exploiting CpG Hypermutability to Identify Phenotypically Significant Variation Within Human Protein-Coding Genes |
title_sort | exploiting cpg hypermutability to identify phenotypically significant variation within human protein-coding genes |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3184784/ https://www.ncbi.nlm.nih.gov/pubmed/21398426 http://dx.doi.org/10.1093/gbe/evr021 |
work_keys_str_mv | AT yinghua exploitingcpghypermutabilitytoidentifyphenotypicallysignificantvariationwithinhumanproteincodinggenes AT huttleygavin exploitingcpghypermutabilitytoidentifyphenotypicallysignificantvariationwithinhumanproteincodinggenes |