Cargando…

Empirical codon substitution matrix

BACKGROUND: Codon substitution probabilities are used in many types of molecular evolution studies such as determining Ka/Ks ratios, creating ancestral DNA sequences or aligning coding DNA. Until the recent dramatic increase in genomic data enabled construction of empirical matrices, researchers rel...

Descripción completa

Detalles Bibliográficos
Autores principales:	Schneider, Adrian, Cannarozzi, Gina M, Gonnet, Gaston H
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2005
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1173088/ https://www.ncbi.nlm.nih.gov/pubmed/15927081 http://dx.doi.org/10.1186/1471-2105-6-134

_version_	1782124453787860992
author	Schneider, Adrian Cannarozzi, Gina M Gonnet, Gaston H
author_facet	Schneider, Adrian Cannarozzi, Gina M Gonnet, Gaston H
author_sort	Schneider, Adrian
collection	PubMed
description	BACKGROUND: Codon substitution probabilities are used in many types of molecular evolution studies such as determining Ka/Ks ratios, creating ancestral DNA sequences or aligning coding DNA. Until the recent dramatic increase in genomic data enabled construction of empirical matrices, researchers relied on parameterized models of codon evolution. Here we present the first empirical codon substitution matrix entirely built from alignments of coding sequences from vertebrate DNA and thus provide an alternative to parameterized models of codon evolution. RESULTS: A set of 17,502 alignments of orthologous sequences from five vertebrate genomes yielded 8.3 million aligned codons from which the number of substitutions between codons were counted. From this data, both a probability matrix and a matrix of similarity scores were computed. They are 64 × 64 matrices describing the substitutions between all codons. Substitutions from sense codons to stop codons are not considered, resulting in block diagonal matrices consisting of 61 × 61 entries for the sense codons and 3 × 3 entries for the stop codons. CONCLUSION: The amount of genomic data currently available allowed for the construction of an empirical codon substitution matrix. However, more sequence data is still needed to construct matrices from different subsets of DNA, specific to kingdoms, evolutionary distance or different amount of synonymous change. Codon mutation matrices have advantages for alignments up to medium evolutionary distances and for usages that require DNA such as ancestral reconstruction of DNA sequences and the calculation of Ka/Ks ratios.
format	Text
id	pubmed-1173088
institution	National Center for Biotechnology Information
language	English
publishDate	2005
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-11730882005-07-07 Empirical codon substitution matrix Schneider, Adrian Cannarozzi, Gina M Gonnet, Gaston H BMC Bioinformatics Research Article BACKGROUND: Codon substitution probabilities are used in many types of molecular evolution studies such as determining Ka/Ks ratios, creating ancestral DNA sequences or aligning coding DNA. Until the recent dramatic increase in genomic data enabled construction of empirical matrices, researchers relied on parameterized models of codon evolution. Here we present the first empirical codon substitution matrix entirely built from alignments of coding sequences from vertebrate DNA and thus provide an alternative to parameterized models of codon evolution. RESULTS: A set of 17,502 alignments of orthologous sequences from five vertebrate genomes yielded 8.3 million aligned codons from which the number of substitutions between codons were counted. From this data, both a probability matrix and a matrix of similarity scores were computed. They are 64 × 64 matrices describing the substitutions between all codons. Substitutions from sense codons to stop codons are not considered, resulting in block diagonal matrices consisting of 61 × 61 entries for the sense codons and 3 × 3 entries for the stop codons. CONCLUSION: The amount of genomic data currently available allowed for the construction of an empirical codon substitution matrix. However, more sequence data is still needed to construct matrices from different subsets of DNA, specific to kingdoms, evolutionary distance or different amount of synonymous change. Codon mutation matrices have advantages for alignments up to medium evolutionary distances and for usages that require DNA such as ancestral reconstruction of DNA sequences and the calculation of Ka/Ks ratios. BioMed Central 2005-06-01 /pmc/articles/PMC1173088/ /pubmed/15927081 http://dx.doi.org/10.1186/1471-2105-6-134 Text en Copyright © 2005 Schneider et al; licensee BioMed Central Ltd.
spellingShingle	Research Article Schneider, Adrian Cannarozzi, Gina M Gonnet, Gaston H Empirical codon substitution matrix
title	Empirical codon substitution matrix
title_full	Empirical codon substitution matrix
title_fullStr	Empirical codon substitution matrix
title_full_unstemmed	Empirical codon substitution matrix
title_short	Empirical codon substitution matrix
title_sort	empirical codon substitution matrix
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1173088/ https://www.ncbi.nlm.nih.gov/pubmed/15927081 http://dx.doi.org/10.1186/1471-2105-6-134
work_keys_str_mv	AT schneideradrian empiricalcodonsubstitutionmatrix AT cannarozziginam empiricalcodonsubstitutionmatrix AT gonnetgastonh empiricalcodonsubstitutionmatrix

Empirical codon substitution matrix

Ejemplares similares