Cargando…
The Subsystems Approach to Genome Annotation and its Use in the Project to Annotate 1000 Genomes
The release of the 1000(th) complete microbial genome will occur in the next two to three years. In anticipation of this milestone, the Fellowship for Interpretation of Genomes (FIG) launched the Project to Annotate 1000 Genomes. The project is built around the principle that the key to improved acc...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2005
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1251668/ https://www.ncbi.nlm.nih.gov/pubmed/16214803 http://dx.doi.org/10.1093/nar/gki866 |
_version_ | 1782125733958647808 |
---|---|
author | Overbeek, Ross Begley, Tadhg Butler, Ralph M. Choudhuri, Jomuna V. Chuang, Han-Yu Cohoon, Matthew de Crécy-Lagard, Valérie Diaz, Naryttza Disz, Terry Edwards, Robert Fonstein, Michael Frank, Ed D. Gerdes, Svetlana Glass, Elizabeth M. Goesmann, Alexander Hanson, Andrew Iwata-Reuyl, Dirk Jensen, Roy Jamshidi, Neema Krause, Lutz Kubal, Michael Larsen, Niels Linke, Burkhard McHardy, Alice C. Meyer, Folker Neuweger, Heiko Olsen, Gary Olson, Robert Osterman, Andrei Portnoy, Vasiliy Pusch, Gordon D. Rodionov, Dmitry A. Rückert, Christian Steiner, Jason Stevens, Rick Thiele, Ines Vassieva, Olga Ye, Yuzhen Zagnitko, Olga Vonstein, Veronika |
author_facet | Overbeek, Ross Begley, Tadhg Butler, Ralph M. Choudhuri, Jomuna V. Chuang, Han-Yu Cohoon, Matthew de Crécy-Lagard, Valérie Diaz, Naryttza Disz, Terry Edwards, Robert Fonstein, Michael Frank, Ed D. Gerdes, Svetlana Glass, Elizabeth M. Goesmann, Alexander Hanson, Andrew Iwata-Reuyl, Dirk Jensen, Roy Jamshidi, Neema Krause, Lutz Kubal, Michael Larsen, Niels Linke, Burkhard McHardy, Alice C. Meyer, Folker Neuweger, Heiko Olsen, Gary Olson, Robert Osterman, Andrei Portnoy, Vasiliy Pusch, Gordon D. Rodionov, Dmitry A. Rückert, Christian Steiner, Jason Stevens, Rick Thiele, Ines Vassieva, Olga Ye, Yuzhen Zagnitko, Olga Vonstein, Veronika |
author_sort | Overbeek, Ross |
collection | PubMed |
description | The release of the 1000(th) complete microbial genome will occur in the next two to three years. In anticipation of this milestone, the Fellowship for Interpretation of Genomes (FIG) launched the Project to Annotate 1000 Genomes. The project is built around the principle that the key to improved accuracy in high-throughput annotation technology is to have experts annotate single subsystems over the complete collection of genomes, rather than having an annotation expert attempt to annotate all of the genes in a single genome. Using the subsystems approach, all of the genes implementing the subsystem are analyzed by an expert in that subsystem. An annotation environment was created where populated subsystems are curated and projected to new genomes. A portable notion of a populated subsystem was defined, and tools developed for exchanging and curating these objects. Tools were also developed to resolve conflicts between populated subsystems. The SEED is the first annotation environment that supports this model of annotation. Here, we describe the subsystem approach, and offer the first release of our growing library of populated subsystems. The initial release of data includes 180 177 distinct proteins with 2133 distinct functional roles. This data comes from 173 subsystems and 383 different organisms. |
format | Text |
id | pubmed-1251668 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2005 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-12516682005-10-12 The Subsystems Approach to Genome Annotation and its Use in the Project to Annotate 1000 Genomes Overbeek, Ross Begley, Tadhg Butler, Ralph M. Choudhuri, Jomuna V. Chuang, Han-Yu Cohoon, Matthew de Crécy-Lagard, Valérie Diaz, Naryttza Disz, Terry Edwards, Robert Fonstein, Michael Frank, Ed D. Gerdes, Svetlana Glass, Elizabeth M. Goesmann, Alexander Hanson, Andrew Iwata-Reuyl, Dirk Jensen, Roy Jamshidi, Neema Krause, Lutz Kubal, Michael Larsen, Niels Linke, Burkhard McHardy, Alice C. Meyer, Folker Neuweger, Heiko Olsen, Gary Olson, Robert Osterman, Andrei Portnoy, Vasiliy Pusch, Gordon D. Rodionov, Dmitry A. Rückert, Christian Steiner, Jason Stevens, Rick Thiele, Ines Vassieva, Olga Ye, Yuzhen Zagnitko, Olga Vonstein, Veronika Nucleic Acids Res Article The release of the 1000(th) complete microbial genome will occur in the next two to three years. In anticipation of this milestone, the Fellowship for Interpretation of Genomes (FIG) launched the Project to Annotate 1000 Genomes. The project is built around the principle that the key to improved accuracy in high-throughput annotation technology is to have experts annotate single subsystems over the complete collection of genomes, rather than having an annotation expert attempt to annotate all of the genes in a single genome. Using the subsystems approach, all of the genes implementing the subsystem are analyzed by an expert in that subsystem. An annotation environment was created where populated subsystems are curated and projected to new genomes. A portable notion of a populated subsystem was defined, and tools developed for exchanging and curating these objects. Tools were also developed to resolve conflicts between populated subsystems. The SEED is the first annotation environment that supports this model of annotation. Here, we describe the subsystem approach, and offer the first release of our growing library of populated subsystems. The initial release of data includes 180 177 distinct proteins with 2133 distinct functional roles. This data comes from 173 subsystems and 383 different organisms. Oxford University Press 2005 2005-10-07 /pmc/articles/PMC1251668/ /pubmed/16214803 http://dx.doi.org/10.1093/nar/gki866 Text en © The Author 2005. Published by Oxford University Press. All rights reserved |
spellingShingle | Article Overbeek, Ross Begley, Tadhg Butler, Ralph M. Choudhuri, Jomuna V. Chuang, Han-Yu Cohoon, Matthew de Crécy-Lagard, Valérie Diaz, Naryttza Disz, Terry Edwards, Robert Fonstein, Michael Frank, Ed D. Gerdes, Svetlana Glass, Elizabeth M. Goesmann, Alexander Hanson, Andrew Iwata-Reuyl, Dirk Jensen, Roy Jamshidi, Neema Krause, Lutz Kubal, Michael Larsen, Niels Linke, Burkhard McHardy, Alice C. Meyer, Folker Neuweger, Heiko Olsen, Gary Olson, Robert Osterman, Andrei Portnoy, Vasiliy Pusch, Gordon D. Rodionov, Dmitry A. Rückert, Christian Steiner, Jason Stevens, Rick Thiele, Ines Vassieva, Olga Ye, Yuzhen Zagnitko, Olga Vonstein, Veronika The Subsystems Approach to Genome Annotation and its Use in the Project to Annotate 1000 Genomes |
title | The Subsystems Approach to Genome Annotation and its Use in the Project to Annotate 1000 Genomes |
title_full | The Subsystems Approach to Genome Annotation and its Use in the Project to Annotate 1000 Genomes |
title_fullStr | The Subsystems Approach to Genome Annotation and its Use in the Project to Annotate 1000 Genomes |
title_full_unstemmed | The Subsystems Approach to Genome Annotation and its Use in the Project to Annotate 1000 Genomes |
title_short | The Subsystems Approach to Genome Annotation and its Use in the Project to Annotate 1000 Genomes |
title_sort | subsystems approach to genome annotation and its use in the project to annotate 1000 genomes |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1251668/ https://www.ncbi.nlm.nih.gov/pubmed/16214803 http://dx.doi.org/10.1093/nar/gki866 |
work_keys_str_mv | AT overbeekross thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT begleytadhg thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT butlerralphm thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT choudhurijomunav thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT chuanghanyu thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT cohoonmatthew thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT decrecylagardvalerie thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT diaznaryttza thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT diszterry thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT edwardsrobert thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT fonsteinmichael thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT frankedd thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT gerdessvetlana thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT glasselizabethm thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT goesmannalexander thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT hansonandrew thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT iwatareuyldirk thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT jensenroy thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT jamshidineema thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT krauselutz thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT kubalmichael thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT larsenniels thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT linkeburkhard thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT mchardyalicec thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT meyerfolker thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT neuwegerheiko thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT olsengary thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT olsonrobert thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT ostermanandrei thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT portnoyvasiliy thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT puschgordond thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT rodionovdmitrya thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT ruckertchristian thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT steinerjason thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT stevensrick thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT thieleines thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT vassievaolga thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT yeyuzhen thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT zagnitkoolga thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT vonsteinveronika thesubsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT overbeekross subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT begleytadhg subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT butlerralphm subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT choudhurijomunav subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT chuanghanyu subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT cohoonmatthew subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT decrecylagardvalerie subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT diaznaryttza subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT diszterry subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT edwardsrobert subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT fonsteinmichael subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT frankedd subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT gerdessvetlana subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT glasselizabethm subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT goesmannalexander subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT hansonandrew subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT iwatareuyldirk subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT jensenroy subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT jamshidineema subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT krauselutz subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT kubalmichael subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT larsenniels subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT linkeburkhard subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT mchardyalicec subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT meyerfolker subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT neuwegerheiko subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT olsengary subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT olsonrobert subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT ostermanandrei subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT portnoyvasiliy subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT puschgordond subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT rodionovdmitrya subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT ruckertchristian subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT steinerjason subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT stevensrick subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT thieleines subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT vassievaolga subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT yeyuzhen subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT zagnitkoolga subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes AT vonsteinveronika subsystemsapproachtogenomeannotationanditsuseintheprojecttoannotate1000genomes |