Cargando…

Mining prokaryotic genomes for unknown amino acids: a stop-codon-based approach

BACKGROUND: Selenocysteine and pyrrolysine are the 21st and 22nd amino acids, which are genetically encoded by stop codons. Since a number of microbial genomes have been completely sequenced to date, it is tempting to ask whether the 23rd amino acid is left undiscovered in these genomes. Recently, a...

Descripción completa

Detalles Bibliográficos
Autores principales: Fujita, Masashi, Mihara, Hisaaki, Goto, Susumu, Esaki, Nobuyoshi, Kanehisa, Minoru
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1914089/
https://www.ncbi.nlm.nih.gov/pubmed/17597547
http://dx.doi.org/10.1186/1471-2105-8-225
_version_ 1782134104146313216
author Fujita, Masashi
Mihara, Hisaaki
Goto, Susumu
Esaki, Nobuyoshi
Kanehisa, Minoru
author_facet Fujita, Masashi
Mihara, Hisaaki
Goto, Susumu
Esaki, Nobuyoshi
Kanehisa, Minoru
author_sort Fujita, Masashi
collection PubMed
description BACKGROUND: Selenocysteine and pyrrolysine are the 21st and 22nd amino acids, which are genetically encoded by stop codons. Since a number of microbial genomes have been completely sequenced to date, it is tempting to ask whether the 23rd amino acid is left undiscovered in these genomes. Recently, a computational study addressed this question and reported that no tRNA gene for unknown amino acid was found in genome sequences available. However, performance of the tRNA prediction program on an unknown tRNA family, which may have atypical sequence and structure, is unclear, thereby rendering their result inconclusive. A protein-level study will provide independent insight into the novel amino acid. RESULTS: Assuming that the 23rd amino acid is also encoded by a stop codon, we systematically predicted proteins that contain stop-codon-encoded amino acids from 191 prokaryotic genomes. Since our prediction method relies only on the conservation patterns of primary sequences, it also provides an opportunity to search novel selenoproteins and other readthrough proteins. It successfully recovered many of currently known selenoproteins and pyrrolysine proteins. However, no promising candidate for the 23rd amino acid was detected, and only one novel selenoprotein was predicted. CONCLUSION: Our result suggests that the unknown amino acid encoded by stop codons does not exist, or its phylogenetic distribution is rather limited, which is in agreement with the previous study on tRNA. The method described here can be used in future studies to explore novel readthrough events from complete genomes, which are rapidly growing.
format Text
id pubmed-1914089
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-19140892007-07-13 Mining prokaryotic genomes for unknown amino acids: a stop-codon-based approach Fujita, Masashi Mihara, Hisaaki Goto, Susumu Esaki, Nobuyoshi Kanehisa, Minoru BMC Bioinformatics Research Article BACKGROUND: Selenocysteine and pyrrolysine are the 21st and 22nd amino acids, which are genetically encoded by stop codons. Since a number of microbial genomes have been completely sequenced to date, it is tempting to ask whether the 23rd amino acid is left undiscovered in these genomes. Recently, a computational study addressed this question and reported that no tRNA gene for unknown amino acid was found in genome sequences available. However, performance of the tRNA prediction program on an unknown tRNA family, which may have atypical sequence and structure, is unclear, thereby rendering their result inconclusive. A protein-level study will provide independent insight into the novel amino acid. RESULTS: Assuming that the 23rd amino acid is also encoded by a stop codon, we systematically predicted proteins that contain stop-codon-encoded amino acids from 191 prokaryotic genomes. Since our prediction method relies only on the conservation patterns of primary sequences, it also provides an opportunity to search novel selenoproteins and other readthrough proteins. It successfully recovered many of currently known selenoproteins and pyrrolysine proteins. However, no promising candidate for the 23rd amino acid was detected, and only one novel selenoprotein was predicted. CONCLUSION: Our result suggests that the unknown amino acid encoded by stop codons does not exist, or its phylogenetic distribution is rather limited, which is in agreement with the previous study on tRNA. The method described here can be used in future studies to explore novel readthrough events from complete genomes, which are rapidly growing. BioMed Central 2007-06-28 /pmc/articles/PMC1914089/ /pubmed/17597547 http://dx.doi.org/10.1186/1471-2105-8-225 Text en Copyright © 2007 Fujita et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Fujita, Masashi
Mihara, Hisaaki
Goto, Susumu
Esaki, Nobuyoshi
Kanehisa, Minoru
Mining prokaryotic genomes for unknown amino acids: a stop-codon-based approach
title Mining prokaryotic genomes for unknown amino acids: a stop-codon-based approach
title_full Mining prokaryotic genomes for unknown amino acids: a stop-codon-based approach
title_fullStr Mining prokaryotic genomes for unknown amino acids: a stop-codon-based approach
title_full_unstemmed Mining prokaryotic genomes for unknown amino acids: a stop-codon-based approach
title_short Mining prokaryotic genomes for unknown amino acids: a stop-codon-based approach
title_sort mining prokaryotic genomes for unknown amino acids: a stop-codon-based approach
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1914089/
https://www.ncbi.nlm.nih.gov/pubmed/17597547
http://dx.doi.org/10.1186/1471-2105-8-225
work_keys_str_mv AT fujitamasashi miningprokaryoticgenomesforunknownaminoacidsastopcodonbasedapproach
AT miharahisaaki miningprokaryoticgenomesforunknownaminoacidsastopcodonbasedapproach
AT gotosusumu miningprokaryoticgenomesforunknownaminoacidsastopcodonbasedapproach
AT esakinobuyoshi miningprokaryoticgenomesforunknownaminoacidsastopcodonbasedapproach
AT kanehisaminoru miningprokaryoticgenomesforunknownaminoacidsastopcodonbasedapproach