Cargando…

A functional update of the Escherichia coli K-12 genome

BACKGROUND: Since the genome of Escherichia coli K-12 was initially annotated in 1997, additional functional information based on biological characterization and functions of sequence-similar proteins has become available. On the basis of this new information, an updated version of the annotated chr...

Descripción completa

Detalles Bibliográficos
Autores principales: Serres, Margrethe H, Gopal, Shuba, Nahum, Laila A, Liang, Ping, Gaasterland, Terry, Riley, Monica
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2001
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC56896/
https://www.ncbi.nlm.nih.gov/pubmed/11574054
_version_ 1782120041311895552
author Serres, Margrethe H
Gopal, Shuba
Nahum, Laila A
Liang, Ping
Gaasterland, Terry
Riley, Monica
author_facet Serres, Margrethe H
Gopal, Shuba
Nahum, Laila A
Liang, Ping
Gaasterland, Terry
Riley, Monica
author_sort Serres, Margrethe H
collection PubMed
description BACKGROUND: Since the genome of Escherichia coli K-12 was initially annotated in 1997, additional functional information based on biological characterization and functions of sequence-similar proteins has become available. On the basis of this new information, an updated version of the annotated chromosome has been generated. RESULTS: The E. coli K-12 chromosome is currently represented by 4,401 genes encoding 116 RNAs and 4,285 proteins. The boundaries of the genes identified in the GenBank Accession U00096 were used. Some protein-coding sequences are compound and encode multimodular proteins. The coding sequences (CDSs) are represented by modules (protein elements of at least 100 amino acids with biological activity and independent evolutionary history). There are 4,616 identified modules in the 4,285 proteins. Of these, 48.9% have been characterized, 29.5% have an imputed function, 2.1% have a phenotype and 19.5% have no function assignment. Only 7% of the modules appear unique to E. coli, and this number is expected to be reduced as more genome data becomes available. The imputed functions were assigned on the basis of manual evaluation of functions predicted by BLAST and DARWIN analyses and by the MAGPIE genome annotation system. CONCLUSIONS: Much knowledge has been gained about functions encoded by the E. coli K-12 genome since the 1997 annotation was published. The data presented here should be useful for analysis of E. coli gene products as well as gene products encoded by other genomes.
format Text
id pubmed-56896
institution National Center for Biotechnology Information
language English
publishDate 2001
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-568962001-09-28 A functional update of the Escherichia coli K-12 genome Serres, Margrethe H Gopal, Shuba Nahum, Laila A Liang, Ping Gaasterland, Terry Riley, Monica Genome Biol Research BACKGROUND: Since the genome of Escherichia coli K-12 was initially annotated in 1997, additional functional information based on biological characterization and functions of sequence-similar proteins has become available. On the basis of this new information, an updated version of the annotated chromosome has been generated. RESULTS: The E. coli K-12 chromosome is currently represented by 4,401 genes encoding 116 RNAs and 4,285 proteins. The boundaries of the genes identified in the GenBank Accession U00096 were used. Some protein-coding sequences are compound and encode multimodular proteins. The coding sequences (CDSs) are represented by modules (protein elements of at least 100 amino acids with biological activity and independent evolutionary history). There are 4,616 identified modules in the 4,285 proteins. Of these, 48.9% have been characterized, 29.5% have an imputed function, 2.1% have a phenotype and 19.5% have no function assignment. Only 7% of the modules appear unique to E. coli, and this number is expected to be reduced as more genome data becomes available. The imputed functions were assigned on the basis of manual evaluation of functions predicted by BLAST and DARWIN analyses and by the MAGPIE genome annotation system. CONCLUSIONS: Much knowledge has been gained about functions encoded by the E. coli K-12 genome since the 1997 annotation was published. The data presented here should be useful for analysis of E. coli gene products as well as gene products encoded by other genomes. BioMed Central 2001 2001-08-20 /pmc/articles/PMC56896/ /pubmed/11574054 Text en Copyright © 2001 Serres et al., licensee BioMed Central Ltd
spellingShingle Research
Serres, Margrethe H
Gopal, Shuba
Nahum, Laila A
Liang, Ping
Gaasterland, Terry
Riley, Monica
A functional update of the Escherichia coli K-12 genome
title A functional update of the Escherichia coli K-12 genome
title_full A functional update of the Escherichia coli K-12 genome
title_fullStr A functional update of the Escherichia coli K-12 genome
title_full_unstemmed A functional update of the Escherichia coli K-12 genome
title_short A functional update of the Escherichia coli K-12 genome
title_sort functional update of the escherichia coli k-12 genome
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC56896/
https://www.ncbi.nlm.nih.gov/pubmed/11574054
work_keys_str_mv AT serresmargretheh afunctionalupdateoftheescherichiacolik12genome
AT gopalshuba afunctionalupdateoftheescherichiacolik12genome
AT nahumlailaa afunctionalupdateoftheescherichiacolik12genome
AT liangping afunctionalupdateoftheescherichiacolik12genome
AT gaasterlandterry afunctionalupdateoftheescherichiacolik12genome
AT rileymonica afunctionalupdateoftheescherichiacolik12genome
AT serresmargretheh functionalupdateoftheescherichiacolik12genome
AT gopalshuba functionalupdateoftheescherichiacolik12genome
AT nahumlailaa functionalupdateoftheescherichiacolik12genome
AT liangping functionalupdateoftheescherichiacolik12genome
AT gaasterlandterry functionalupdateoftheescherichiacolik12genome
AT rileymonica functionalupdateoftheescherichiacolik12genome