Cargando…
Mining prokaryotic genomes for unknown amino acids: a stop-codon-based approach
BACKGROUND: Selenocysteine and pyrrolysine are the 21st and 22nd amino acids, which are genetically encoded by stop codons. Since a number of microbial genomes have been completely sequenced to date, it is tempting to ask whether the 23rd amino acid is left undiscovered in these genomes. Recently, a...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1914089/ https://www.ncbi.nlm.nih.gov/pubmed/17597547 http://dx.doi.org/10.1186/1471-2105-8-225 |
_version_ | 1782134104146313216 |
---|---|
author | Fujita, Masashi Mihara, Hisaaki Goto, Susumu Esaki, Nobuyoshi Kanehisa, Minoru |
author_facet | Fujita, Masashi Mihara, Hisaaki Goto, Susumu Esaki, Nobuyoshi Kanehisa, Minoru |
author_sort | Fujita, Masashi |
collection | PubMed |
description | BACKGROUND: Selenocysteine and pyrrolysine are the 21st and 22nd amino acids, which are genetically encoded by stop codons. Since a number of microbial genomes have been completely sequenced to date, it is tempting to ask whether the 23rd amino acid is left undiscovered in these genomes. Recently, a computational study addressed this question and reported that no tRNA gene for unknown amino acid was found in genome sequences available. However, performance of the tRNA prediction program on an unknown tRNA family, which may have atypical sequence and structure, is unclear, thereby rendering their result inconclusive. A protein-level study will provide independent insight into the novel amino acid. RESULTS: Assuming that the 23rd amino acid is also encoded by a stop codon, we systematically predicted proteins that contain stop-codon-encoded amino acids from 191 prokaryotic genomes. Since our prediction method relies only on the conservation patterns of primary sequences, it also provides an opportunity to search novel selenoproteins and other readthrough proteins. It successfully recovered many of currently known selenoproteins and pyrrolysine proteins. However, no promising candidate for the 23rd amino acid was detected, and only one novel selenoprotein was predicted. CONCLUSION: Our result suggests that the unknown amino acid encoded by stop codons does not exist, or its phylogenetic distribution is rather limited, which is in agreement with the previous study on tRNA. The method described here can be used in future studies to explore novel readthrough events from complete genomes, which are rapidly growing. |
format | Text |
id | pubmed-1914089 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-19140892007-07-13 Mining prokaryotic genomes for unknown amino acids: a stop-codon-based approach Fujita, Masashi Mihara, Hisaaki Goto, Susumu Esaki, Nobuyoshi Kanehisa, Minoru BMC Bioinformatics Research Article BACKGROUND: Selenocysteine and pyrrolysine are the 21st and 22nd amino acids, which are genetically encoded by stop codons. Since a number of microbial genomes have been completely sequenced to date, it is tempting to ask whether the 23rd amino acid is left undiscovered in these genomes. Recently, a computational study addressed this question and reported that no tRNA gene for unknown amino acid was found in genome sequences available. However, performance of the tRNA prediction program on an unknown tRNA family, which may have atypical sequence and structure, is unclear, thereby rendering their result inconclusive. A protein-level study will provide independent insight into the novel amino acid. RESULTS: Assuming that the 23rd amino acid is also encoded by a stop codon, we systematically predicted proteins that contain stop-codon-encoded amino acids from 191 prokaryotic genomes. Since our prediction method relies only on the conservation patterns of primary sequences, it also provides an opportunity to search novel selenoproteins and other readthrough proteins. It successfully recovered many of currently known selenoproteins and pyrrolysine proteins. However, no promising candidate for the 23rd amino acid was detected, and only one novel selenoprotein was predicted. CONCLUSION: Our result suggests that the unknown amino acid encoded by stop codons does not exist, or its phylogenetic distribution is rather limited, which is in agreement with the previous study on tRNA. The method described here can be used in future studies to explore novel readthrough events from complete genomes, which are rapidly growing. BioMed Central 2007-06-28 /pmc/articles/PMC1914089/ /pubmed/17597547 http://dx.doi.org/10.1186/1471-2105-8-225 Text en Copyright © 2007 Fujita et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Fujita, Masashi Mihara, Hisaaki Goto, Susumu Esaki, Nobuyoshi Kanehisa, Minoru Mining prokaryotic genomes for unknown amino acids: a stop-codon-based approach |
title | Mining prokaryotic genomes for unknown amino acids: a stop-codon-based approach |
title_full | Mining prokaryotic genomes for unknown amino acids: a stop-codon-based approach |
title_fullStr | Mining prokaryotic genomes for unknown amino acids: a stop-codon-based approach |
title_full_unstemmed | Mining prokaryotic genomes for unknown amino acids: a stop-codon-based approach |
title_short | Mining prokaryotic genomes for unknown amino acids: a stop-codon-based approach |
title_sort | mining prokaryotic genomes for unknown amino acids: a stop-codon-based approach |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1914089/ https://www.ncbi.nlm.nih.gov/pubmed/17597547 http://dx.doi.org/10.1186/1471-2105-8-225 |
work_keys_str_mv | AT fujitamasashi miningprokaryoticgenomesforunknownaminoacidsastopcodonbasedapproach AT miharahisaaki miningprokaryoticgenomesforunknownaminoacidsastopcodonbasedapproach AT gotosusumu miningprokaryoticgenomesforunknownaminoacidsastopcodonbasedapproach AT esakinobuyoshi miningprokaryoticgenomesforunknownaminoacidsastopcodonbasedapproach AT kanehisaminoru miningprokaryoticgenomesforunknownaminoacidsastopcodonbasedapproach |