Cargando…
Modelling mutational and selection pressures on dinucleotides in eukaryotic phyla –selection against CpG and UpA in cytoplasmically expressed RNA and in RNA viruses
BACKGROUND: Loss of CpG dinucleotides in genomic DNA through methylation-induced mutation is characteristic of vertebrates and plants. However, these and other eukaryotic phyla show a range of other dinucleotide frequency biases with currently uncharacterized underlying mutational or selection mecha...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3829696/ https://www.ncbi.nlm.nih.gov/pubmed/24020411 http://dx.doi.org/10.1186/1471-2164-14-610 |
_version_ | 1782291371529338880 |
---|---|
author | Simmonds, Peter Xia, Wenjun Baillie, J Kenneth McKinnon, Ken |
author_facet | Simmonds, Peter Xia, Wenjun Baillie, J Kenneth McKinnon, Ken |
author_sort | Simmonds, Peter |
collection | PubMed |
description | BACKGROUND: Loss of CpG dinucleotides in genomic DNA through methylation-induced mutation is characteristic of vertebrates and plants. However, these and other eukaryotic phyla show a range of other dinucleotide frequency biases with currently uncharacterized underlying mutational or selection mechanisms. We developed a parameterized Markov process to identify what neighbour context-dependent mutations best accounted for patterns of dinucleotide frequency biases in genomic and cytoplasmically expressed mRNA sequences of different vertebrates, other eukaryotic groups and RNA viruses that infect them. RESULTS: Consistently, 11- to 14-fold greater frequencies of the methylation-associated mutation of C to T upstream of G (depicted as C→T,G) than other transitions best modelled dinucleotide frequencies in mammalian genomic DNA. However, further mutations such as G→T,T (5-fold greater than the default transversion rate) were required to account for the full spectrum of dinucleotide frequencies in mammalian sequence datasets. Consistent with modeling predictions for these two mutations, instability of both CpG and CpT dinucleotides was identified through SNP frequency analysis of human DNA sequences. Different sets of context-dependent mutations were modelled in other eukaryotes with non-methylated genomic DNA. In contrast to genomic DNA, best-fit models of dinucleotide frequencies in transcribed RNA sequences expressed in the cytoplasm from all organisms were dominated by mutations that eliminated UpA dinucleotides, observations consistent with cytoplasmically driven selection for mRNA stability. Surprisingly, mRNA sequences from organisms with methylated genomes showed evidence for additional selection against CpG through further context-dependent mutations (eg. C→A,G). Similar mutation or selection processes were identified among single-stranded mammalian RNA viruses; these potentially account for their previously described but unexplained under-representations of CpG and UpA dinucleotides. CONCLUSIONS: Methods we have developed identify mutational processes and selection pressures in organisms that provide new insights into nucleotide compositional constraints and a wealth of biochemical and evolutionarily testable predictions for the future. |
format | Online Article Text |
id | pubmed-3829696 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-38296962013-11-20 Modelling mutational and selection pressures on dinucleotides in eukaryotic phyla –selection against CpG and UpA in cytoplasmically expressed RNA and in RNA viruses Simmonds, Peter Xia, Wenjun Baillie, J Kenneth McKinnon, Ken BMC Genomics Research Article BACKGROUND: Loss of CpG dinucleotides in genomic DNA through methylation-induced mutation is characteristic of vertebrates and plants. However, these and other eukaryotic phyla show a range of other dinucleotide frequency biases with currently uncharacterized underlying mutational or selection mechanisms. We developed a parameterized Markov process to identify what neighbour context-dependent mutations best accounted for patterns of dinucleotide frequency biases in genomic and cytoplasmically expressed mRNA sequences of different vertebrates, other eukaryotic groups and RNA viruses that infect them. RESULTS: Consistently, 11- to 14-fold greater frequencies of the methylation-associated mutation of C to T upstream of G (depicted as C→T,G) than other transitions best modelled dinucleotide frequencies in mammalian genomic DNA. However, further mutations such as G→T,T (5-fold greater than the default transversion rate) were required to account for the full spectrum of dinucleotide frequencies in mammalian sequence datasets. Consistent with modeling predictions for these two mutations, instability of both CpG and CpT dinucleotides was identified through SNP frequency analysis of human DNA sequences. Different sets of context-dependent mutations were modelled in other eukaryotes with non-methylated genomic DNA. In contrast to genomic DNA, best-fit models of dinucleotide frequencies in transcribed RNA sequences expressed in the cytoplasm from all organisms were dominated by mutations that eliminated UpA dinucleotides, observations consistent with cytoplasmically driven selection for mRNA stability. Surprisingly, mRNA sequences from organisms with methylated genomes showed evidence for additional selection against CpG through further context-dependent mutations (eg. C→A,G). Similar mutation or selection processes were identified among single-stranded mammalian RNA viruses; these potentially account for their previously described but unexplained under-representations of CpG and UpA dinucleotides. CONCLUSIONS: Methods we have developed identify mutational processes and selection pressures in organisms that provide new insights into nucleotide compositional constraints and a wealth of biochemical and evolutionarily testable predictions for the future. BioMed Central 2013-09-10 /pmc/articles/PMC3829696/ /pubmed/24020411 http://dx.doi.org/10.1186/1471-2164-14-610 Text en Copyright © 2013 Simmonds et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Simmonds, Peter Xia, Wenjun Baillie, J Kenneth McKinnon, Ken Modelling mutational and selection pressures on dinucleotides in eukaryotic phyla –selection against CpG and UpA in cytoplasmically expressed RNA and in RNA viruses |
title | Modelling mutational and selection pressures on dinucleotides in eukaryotic phyla –selection against CpG and UpA in cytoplasmically expressed RNA and in RNA viruses |
title_full | Modelling mutational and selection pressures on dinucleotides in eukaryotic phyla –selection against CpG and UpA in cytoplasmically expressed RNA and in RNA viruses |
title_fullStr | Modelling mutational and selection pressures on dinucleotides in eukaryotic phyla –selection against CpG and UpA in cytoplasmically expressed RNA and in RNA viruses |
title_full_unstemmed | Modelling mutational and selection pressures on dinucleotides in eukaryotic phyla –selection against CpG and UpA in cytoplasmically expressed RNA and in RNA viruses |
title_short | Modelling mutational and selection pressures on dinucleotides in eukaryotic phyla –selection against CpG and UpA in cytoplasmically expressed RNA and in RNA viruses |
title_sort | modelling mutational and selection pressures on dinucleotides in eukaryotic phyla –selection against cpg and upa in cytoplasmically expressed rna and in rna viruses |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3829696/ https://www.ncbi.nlm.nih.gov/pubmed/24020411 http://dx.doi.org/10.1186/1471-2164-14-610 |
work_keys_str_mv | AT simmondspeter modellingmutationalandselectionpressuresondinucleotidesineukaryoticphylaselectionagainstcpgandupaincytoplasmicallyexpressedrnaandinrnaviruses AT xiawenjun modellingmutationalandselectionpressuresondinucleotidesineukaryoticphylaselectionagainstcpgandupaincytoplasmicallyexpressedrnaandinrnaviruses AT bailliejkenneth modellingmutationalandselectionpressuresondinucleotidesineukaryoticphylaselectionagainstcpgandupaincytoplasmicallyexpressedrnaandinrnaviruses AT mckinnonken modellingmutationalandselectionpressuresondinucleotidesineukaryoticphylaselectionagainstcpgandupaincytoplasmicallyexpressedrnaandinrnaviruses |