Cargando…
RefSeq: an update on prokaryotic genome annotation and curation
The Reference Sequence (RefSeq) project at the National Center for Biotechnology Information (NCBI) provides annotation for over 95 000 prokaryotic genomes that meet standards for sequence quality, completeness, and freedom from contamination. Genomes are annotated by a single Prokaryotic Genome Ann...
Autores principales: | , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5753331/ https://www.ncbi.nlm.nih.gov/pubmed/29112715 http://dx.doi.org/10.1093/nar/gkx1068 |
_version_ | 1783290252106924032 |
---|---|
author | Haft, Daniel H DiCuccio, Michael Badretdin, Azat Brover, Vyacheslav Chetvernin, Vyacheslav O’Neill, Kathleen Li, Wenjun Chitsaz, Farideh Derbyshire, Myra K Gonzales, Noreen R Gwadz, Marc Lu, Fu Marchler, Gabriele H Song, James S Thanki, Narmada Yamashita, Roxanne A Zheng, Chanjuan Thibaud-Nissen, Françoise Geer, Lewis Y Marchler-Bauer, Aron Pruitt, Kim D |
author_facet | Haft, Daniel H DiCuccio, Michael Badretdin, Azat Brover, Vyacheslav Chetvernin, Vyacheslav O’Neill, Kathleen Li, Wenjun Chitsaz, Farideh Derbyshire, Myra K Gonzales, Noreen R Gwadz, Marc Lu, Fu Marchler, Gabriele H Song, James S Thanki, Narmada Yamashita, Roxanne A Zheng, Chanjuan Thibaud-Nissen, Françoise Geer, Lewis Y Marchler-Bauer, Aron Pruitt, Kim D |
author_sort | Haft, Daniel H |
collection | PubMed |
description | The Reference Sequence (RefSeq) project at the National Center for Biotechnology Information (NCBI) provides annotation for over 95 000 prokaryotic genomes that meet standards for sequence quality, completeness, and freedom from contamination. Genomes are annotated by a single Prokaryotic Genome Annotation Pipeline (PGAP) to provide users with a resource that is as consistent and accurate as possible. Notable recent changes include the development of a hierarchical evidence scheme, a new focus on curating annotation evidence sources, the addition and curation of protein profile hidden Markov models (HMMs), release of an updated pipeline (PGAP-4), and comprehensive re-annotation of RefSeq prokaryotic genomes. Antimicrobial resistance proteins have been reannotated comprehensively, improved structural annotation of insertion sequence transposases and selenoproteins is provided, curated complex domain architectures have given upgraded names to millions of multidomain proteins, and we introduce a new kind of annotation rule—BlastRules. Continual curation of supporting evidence, and propagation of improved names onto RefSeq proteins ensures that the functional annotation of genomes is kept current. An increasing share of our annotation now derives from HMMs and other sets of annotation rules that are portable by nature, and available for download and for reuse by other investigators. RefSeq is found at https://www.ncbi.nlm.nih.gov/refseq/. |
format | Online Article Text |
id | pubmed-5753331 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-57533312018-01-05 RefSeq: an update on prokaryotic genome annotation and curation Haft, Daniel H DiCuccio, Michael Badretdin, Azat Brover, Vyacheslav Chetvernin, Vyacheslav O’Neill, Kathleen Li, Wenjun Chitsaz, Farideh Derbyshire, Myra K Gonzales, Noreen R Gwadz, Marc Lu, Fu Marchler, Gabriele H Song, James S Thanki, Narmada Yamashita, Roxanne A Zheng, Chanjuan Thibaud-Nissen, Françoise Geer, Lewis Y Marchler-Bauer, Aron Pruitt, Kim D Nucleic Acids Res Database Issue The Reference Sequence (RefSeq) project at the National Center for Biotechnology Information (NCBI) provides annotation for over 95 000 prokaryotic genomes that meet standards for sequence quality, completeness, and freedom from contamination. Genomes are annotated by a single Prokaryotic Genome Annotation Pipeline (PGAP) to provide users with a resource that is as consistent and accurate as possible. Notable recent changes include the development of a hierarchical evidence scheme, a new focus on curating annotation evidence sources, the addition and curation of protein profile hidden Markov models (HMMs), release of an updated pipeline (PGAP-4), and comprehensive re-annotation of RefSeq prokaryotic genomes. Antimicrobial resistance proteins have been reannotated comprehensively, improved structural annotation of insertion sequence transposases and selenoproteins is provided, curated complex domain architectures have given upgraded names to millions of multidomain proteins, and we introduce a new kind of annotation rule—BlastRules. Continual curation of supporting evidence, and propagation of improved names onto RefSeq proteins ensures that the functional annotation of genomes is kept current. An increasing share of our annotation now derives from HMMs and other sets of annotation rules that are portable by nature, and available for download and for reuse by other investigators. RefSeq is found at https://www.ncbi.nlm.nih.gov/refseq/. Oxford University Press 2018-01-04 2017-11-03 /pmc/articles/PMC5753331/ /pubmed/29112715 http://dx.doi.org/10.1093/nar/gkx1068 Text en Published by Oxford University Press on behalf of Nucleic Acids Research 2017. This work is written by (a) US Government employee(s) and is in the public domain in the US. |
spellingShingle | Database Issue Haft, Daniel H DiCuccio, Michael Badretdin, Azat Brover, Vyacheslav Chetvernin, Vyacheslav O’Neill, Kathleen Li, Wenjun Chitsaz, Farideh Derbyshire, Myra K Gonzales, Noreen R Gwadz, Marc Lu, Fu Marchler, Gabriele H Song, James S Thanki, Narmada Yamashita, Roxanne A Zheng, Chanjuan Thibaud-Nissen, Françoise Geer, Lewis Y Marchler-Bauer, Aron Pruitt, Kim D RefSeq: an update on prokaryotic genome annotation and curation |
title | RefSeq: an update on prokaryotic genome annotation and curation |
title_full | RefSeq: an update on prokaryotic genome annotation and curation |
title_fullStr | RefSeq: an update on prokaryotic genome annotation and curation |
title_full_unstemmed | RefSeq: an update on prokaryotic genome annotation and curation |
title_short | RefSeq: an update on prokaryotic genome annotation and curation |
title_sort | refseq: an update on prokaryotic genome annotation and curation |
topic | Database Issue |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5753331/ https://www.ncbi.nlm.nih.gov/pubmed/29112715 http://dx.doi.org/10.1093/nar/gkx1068 |
work_keys_str_mv | AT haftdanielh refseqanupdateonprokaryoticgenomeannotationandcuration AT dicucciomichael refseqanupdateonprokaryoticgenomeannotationandcuration AT badretdinazat refseqanupdateonprokaryoticgenomeannotationandcuration AT brovervyacheslav refseqanupdateonprokaryoticgenomeannotationandcuration AT chetverninvyacheslav refseqanupdateonprokaryoticgenomeannotationandcuration AT oneillkathleen refseqanupdateonprokaryoticgenomeannotationandcuration AT liwenjun refseqanupdateonprokaryoticgenomeannotationandcuration AT chitsazfarideh refseqanupdateonprokaryoticgenomeannotationandcuration AT derbyshiremyrak refseqanupdateonprokaryoticgenomeannotationandcuration AT gonzalesnoreenr refseqanupdateonprokaryoticgenomeannotationandcuration AT gwadzmarc refseqanupdateonprokaryoticgenomeannotationandcuration AT lufu refseqanupdateonprokaryoticgenomeannotationandcuration AT marchlergabrieleh refseqanupdateonprokaryoticgenomeannotationandcuration AT songjamess refseqanupdateonprokaryoticgenomeannotationandcuration AT thankinarmada refseqanupdateonprokaryoticgenomeannotationandcuration AT yamashitaroxannea refseqanupdateonprokaryoticgenomeannotationandcuration AT zhengchanjuan refseqanupdateonprokaryoticgenomeannotationandcuration AT thibaudnissenfrancoise refseqanupdateonprokaryoticgenomeannotationandcuration AT geerlewisy refseqanupdateonprokaryoticgenomeannotationandcuration AT marchlerbaueraron refseqanupdateonprokaryoticgenomeannotationandcuration AT pruittkimd refseqanupdateonprokaryoticgenomeannotationandcuration |