Cargando…

Genome bias influences amino acid choices: analysis of amino acid substitution and re-compilation of substitution matrices exclusive to an AT-biased genome

The genomic era has seen a remarkable increase in the number of genomes being sequenced and annotated. Nonetheless, annotation remains a serious challenge for compositionally biased genomes. For the preliminary annotation, popular nucleotide and protein comparison methods such as BLAST are widely em...

Descripción completa

Detalles Bibliográficos
Autores principales: Paila, Umadevi, Kondam, Rohini, Ranjan, Akash
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2588515/
https://www.ncbi.nlm.nih.gov/pubmed/18948281
http://dx.doi.org/10.1093/nar/gkn635
_version_ 1782160942956544000
author Paila, Umadevi
Kondam, Rohini
Ranjan, Akash
author_facet Paila, Umadevi
Kondam, Rohini
Ranjan, Akash
author_sort Paila, Umadevi
collection PubMed
description The genomic era has seen a remarkable increase in the number of genomes being sequenced and annotated. Nonetheless, annotation remains a serious challenge for compositionally biased genomes. For the preliminary annotation, popular nucleotide and protein comparison methods such as BLAST are widely employed. These methods make use of matrices to score alignments such as the amino acid substitution matrices. Since a nucleotide bias leads to an overall bias in the amino acid composition of proteins, it is possible that a genome with nucleotide bias may have introduced atypical amino acid substitutions in its proteome. Consequently, standard matrices fail to perform well in sequence analysis of these genomes. To address this issue, we examined the amino acid substitution in the AT-rich genome of Plasmodium falciparum, chosen as a reference and reconstituted a substitution matrix in the genome's context. The matrix was used to generate protein sequence alignments for the parasite proteins that improved across the functional regions. We attribute this to the consistency that may have been achieved amid the target and background frequencies calculated exclusively in our study. This study has important implications on annotation of proteins that are of experimental interest but give poor sequence alignments with standard conventional matrices.
format Text
id pubmed-2588515
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-25885152009-03-04 Genome bias influences amino acid choices: analysis of amino acid substitution and re-compilation of substitution matrices exclusive to an AT-biased genome Paila, Umadevi Kondam, Rohini Ranjan, Akash Nucleic Acids Res Computational Biology The genomic era has seen a remarkable increase in the number of genomes being sequenced and annotated. Nonetheless, annotation remains a serious challenge for compositionally biased genomes. For the preliminary annotation, popular nucleotide and protein comparison methods such as BLAST are widely employed. These methods make use of matrices to score alignments such as the amino acid substitution matrices. Since a nucleotide bias leads to an overall bias in the amino acid composition of proteins, it is possible that a genome with nucleotide bias may have introduced atypical amino acid substitutions in its proteome. Consequently, standard matrices fail to perform well in sequence analysis of these genomes. To address this issue, we examined the amino acid substitution in the AT-rich genome of Plasmodium falciparum, chosen as a reference and reconstituted a substitution matrix in the genome's context. The matrix was used to generate protein sequence alignments for the parasite proteins that improved across the functional regions. We attribute this to the consistency that may have been achieved amid the target and background frequencies calculated exclusively in our study. This study has important implications on annotation of proteins that are of experimental interest but give poor sequence alignments with standard conventional matrices. Oxford University Press 2008-12 2008-10-23 /pmc/articles/PMC2588515/ /pubmed/18948281 http://dx.doi.org/10.1093/nar/gkn635 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Paila, Umadevi
Kondam, Rohini
Ranjan, Akash
Genome bias influences amino acid choices: analysis of amino acid substitution and re-compilation of substitution matrices exclusive to an AT-biased genome
title Genome bias influences amino acid choices: analysis of amino acid substitution and re-compilation of substitution matrices exclusive to an AT-biased genome
title_full Genome bias influences amino acid choices: analysis of amino acid substitution and re-compilation of substitution matrices exclusive to an AT-biased genome
title_fullStr Genome bias influences amino acid choices: analysis of amino acid substitution and re-compilation of substitution matrices exclusive to an AT-biased genome
title_full_unstemmed Genome bias influences amino acid choices: analysis of amino acid substitution and re-compilation of substitution matrices exclusive to an AT-biased genome
title_short Genome bias influences amino acid choices: analysis of amino acid substitution and re-compilation of substitution matrices exclusive to an AT-biased genome
title_sort genome bias influences amino acid choices: analysis of amino acid substitution and re-compilation of substitution matrices exclusive to an at-biased genome
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2588515/
https://www.ncbi.nlm.nih.gov/pubmed/18948281
http://dx.doi.org/10.1093/nar/gkn635
work_keys_str_mv AT pailaumadevi genomebiasinfluencesaminoacidchoicesanalysisofaminoacidsubstitutionandrecompilationofsubstitutionmatricesexclusivetoanatbiasedgenome
AT kondamrohini genomebiasinfluencesaminoacidchoicesanalysisofaminoacidsubstitutionandrecompilationofsubstitutionmatricesexclusivetoanatbiasedgenome
AT ranjanakash genomebiasinfluencesaminoacidchoicesanalysisofaminoacidsubstitutionandrecompilationofsubstitutionmatricesexclusivetoanatbiasedgenome