Cargando…

Exploiting CpG Hypermutability to Identify Phenotypically Significant Variation Within Human Protein-Coding Genes

The CpG dinucleotide is disproportionately represented in human genetic variation due to the hypermutability of 5-methyl-cytosine (5mC). We exploit this hypermutability and a novel codon substitution model to identify candidate functionally important exonic nucleotides. Population genetic theory sug...

Descripción completa

Detalles Bibliográficos
Autores principales: Ying, Hua, Huttley, Gavin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3184784/
https://www.ncbi.nlm.nih.gov/pubmed/21398426
http://dx.doi.org/10.1093/gbe/evr021
_version_ 1782213144069799936
author Ying, Hua
Huttley, Gavin
author_facet Ying, Hua
Huttley, Gavin
author_sort Ying, Hua
collection PubMed
description The CpG dinucleotide is disproportionately represented in human genetic variation due to the hypermutability of 5-methyl-cytosine (5mC). We exploit this hypermutability and a novel codon substitution model to identify candidate functionally important exonic nucleotides. Population genetic theory suggests that codon positions with high cross-species CpG frequency will derive from stronger purifying selection. Using the phylogeny-based maximum likelihood inference framework, we applied codon substitution models with context-dependent parameters to measure the mutagenic and selective processes affecting CpG dinucleotides within exonic sequence. The suitability of these models was validated on >2,000 protein coding genes from a naturally occurring biological control, four yeast species that do not methylate their DNA. As expected, our analyses of yeast revealed no evidence for an elevated CpG transition rate or for substitution suppression affecting CpG-containing codons. Our analyses of >12,000 protein-coding genes from four primate lineages confirm the systemic influence of 5mC hypermutability on the divergence of these genes. After adjusting for confounding influences of mutation and the properties of the encoded amino acids, we confirmed that CpG-containing codons are under greater purifying selection in primates. Genes with significant evidence of enhanced suppression of nonsynonymous CpG changes were also shown to be significantly enriched in Online Mendelian Inheritance in Man. We developed a method for ranking candidate phenotypically influential CpG positions in human genes. Application of this method indicates that of the ∼1 million exonic CpG dinucleotides within humans, ∼20% are strong candidates for both hypermutability and disease association.
format Online
Article
Text
id pubmed-3184784
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-31847842011-10-03 Exploiting CpG Hypermutability to Identify Phenotypically Significant Variation Within Human Protein-Coding Genes Ying, Hua Huttley, Gavin Genome Biol Evol Research Articles The CpG dinucleotide is disproportionately represented in human genetic variation due to the hypermutability of 5-methyl-cytosine (5mC). We exploit this hypermutability and a novel codon substitution model to identify candidate functionally important exonic nucleotides. Population genetic theory suggests that codon positions with high cross-species CpG frequency will derive from stronger purifying selection. Using the phylogeny-based maximum likelihood inference framework, we applied codon substitution models with context-dependent parameters to measure the mutagenic and selective processes affecting CpG dinucleotides within exonic sequence. The suitability of these models was validated on >2,000 protein coding genes from a naturally occurring biological control, four yeast species that do not methylate their DNA. As expected, our analyses of yeast revealed no evidence for an elevated CpG transition rate or for substitution suppression affecting CpG-containing codons. Our analyses of >12,000 protein-coding genes from four primate lineages confirm the systemic influence of 5mC hypermutability on the divergence of these genes. After adjusting for confounding influences of mutation and the properties of the encoded amino acids, we confirmed that CpG-containing codons are under greater purifying selection in primates. Genes with significant evidence of enhanced suppression of nonsynonymous CpG changes were also shown to be significantly enriched in Online Mendelian Inheritance in Man. We developed a method for ranking candidate phenotypically influential CpG positions in human genes. Application of this method indicates that of the ∼1 million exonic CpG dinucleotides within humans, ∼20% are strong candidates for both hypermutability and disease association. Oxford University Press 2011-03-11 /pmc/articles/PMC3184784/ /pubmed/21398426 http://dx.doi.org/10.1093/gbe/evr021 Text en © The Author(s) 2011. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Ying, Hua
Huttley, Gavin
Exploiting CpG Hypermutability to Identify Phenotypically Significant Variation Within Human Protein-Coding Genes
title Exploiting CpG Hypermutability to Identify Phenotypically Significant Variation Within Human Protein-Coding Genes
title_full Exploiting CpG Hypermutability to Identify Phenotypically Significant Variation Within Human Protein-Coding Genes
title_fullStr Exploiting CpG Hypermutability to Identify Phenotypically Significant Variation Within Human Protein-Coding Genes
title_full_unstemmed Exploiting CpG Hypermutability to Identify Phenotypically Significant Variation Within Human Protein-Coding Genes
title_short Exploiting CpG Hypermutability to Identify Phenotypically Significant Variation Within Human Protein-Coding Genes
title_sort exploiting cpg hypermutability to identify phenotypically significant variation within human protein-coding genes
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3184784/
https://www.ncbi.nlm.nih.gov/pubmed/21398426
http://dx.doi.org/10.1093/gbe/evr021
work_keys_str_mv AT yinghua exploitingcpghypermutabilitytoidentifyphenotypicallysignificantvariationwithinhumanproteincodinggenes
AT huttleygavin exploitingcpghypermutabilitytoidentifyphenotypicallysignificantvariationwithinhumanproteincodinggenes