Cargando…

RefSeq: an update on prokaryotic genome annotation and curation

The Reference Sequence (RefSeq) project at the National Center for Biotechnology Information (NCBI) provides annotation for over 95 000 prokaryotic genomes that meet standards for sequence quality, completeness, and freedom from contamination. Genomes are annotated by a single Prokaryotic Genome Ann...

Descripción completa

Detalles Bibliográficos
Autores principales: Haft, Daniel H, DiCuccio, Michael, Badretdin, Azat, Brover, Vyacheslav, Chetvernin, Vyacheslav, O’Neill, Kathleen, Li, Wenjun, Chitsaz, Farideh, Derbyshire, Myra K, Gonzales, Noreen R, Gwadz, Marc, Lu, Fu, Marchler, Gabriele H, Song, James S, Thanki, Narmada, Yamashita, Roxanne A, Zheng, Chanjuan, Thibaud-Nissen, Françoise, Geer, Lewis Y, Marchler-Bauer, Aron, Pruitt, Kim D
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5753331/
https://www.ncbi.nlm.nih.gov/pubmed/29112715
http://dx.doi.org/10.1093/nar/gkx1068
_version_ 1783290252106924032
author Haft, Daniel H
DiCuccio, Michael
Badretdin, Azat
Brover, Vyacheslav
Chetvernin, Vyacheslav
O’Neill, Kathleen
Li, Wenjun
Chitsaz, Farideh
Derbyshire, Myra K
Gonzales, Noreen R
Gwadz, Marc
Lu, Fu
Marchler, Gabriele H
Song, James S
Thanki, Narmada
Yamashita, Roxanne A
Zheng, Chanjuan
Thibaud-Nissen, Françoise
Geer, Lewis Y
Marchler-Bauer, Aron
Pruitt, Kim D
author_facet Haft, Daniel H
DiCuccio, Michael
Badretdin, Azat
Brover, Vyacheslav
Chetvernin, Vyacheslav
O’Neill, Kathleen
Li, Wenjun
Chitsaz, Farideh
Derbyshire, Myra K
Gonzales, Noreen R
Gwadz, Marc
Lu, Fu
Marchler, Gabriele H
Song, James S
Thanki, Narmada
Yamashita, Roxanne A
Zheng, Chanjuan
Thibaud-Nissen, Françoise
Geer, Lewis Y
Marchler-Bauer, Aron
Pruitt, Kim D
author_sort Haft, Daniel H
collection PubMed
description The Reference Sequence (RefSeq) project at the National Center for Biotechnology Information (NCBI) provides annotation for over 95 000 prokaryotic genomes that meet standards for sequence quality, completeness, and freedom from contamination. Genomes are annotated by a single Prokaryotic Genome Annotation Pipeline (PGAP) to provide users with a resource that is as consistent and accurate as possible. Notable recent changes include the development of a hierarchical evidence scheme, a new focus on curating annotation evidence sources, the addition and curation of protein profile hidden Markov models (HMMs), release of an updated pipeline (PGAP-4), and comprehensive re-annotation of RefSeq prokaryotic genomes. Antimicrobial resistance proteins have been reannotated comprehensively, improved structural annotation of insertion sequence transposases and selenoproteins is provided, curated complex domain architectures have given upgraded names to millions of multidomain proteins, and we introduce a new kind of annotation rule—BlastRules. Continual curation of supporting evidence, and propagation of improved names onto RefSeq proteins ensures that the functional annotation of genomes is kept current. An increasing share of our annotation now derives from HMMs and other sets of annotation rules that are portable by nature, and available for download and for reuse by other investigators. RefSeq is found at https://www.ncbi.nlm.nih.gov/refseq/.
format Online
Article
Text
id pubmed-5753331
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-57533312018-01-05 RefSeq: an update on prokaryotic genome annotation and curation Haft, Daniel H DiCuccio, Michael Badretdin, Azat Brover, Vyacheslav Chetvernin, Vyacheslav O’Neill, Kathleen Li, Wenjun Chitsaz, Farideh Derbyshire, Myra K Gonzales, Noreen R Gwadz, Marc Lu, Fu Marchler, Gabriele H Song, James S Thanki, Narmada Yamashita, Roxanne A Zheng, Chanjuan Thibaud-Nissen, Françoise Geer, Lewis Y Marchler-Bauer, Aron Pruitt, Kim D Nucleic Acids Res Database Issue The Reference Sequence (RefSeq) project at the National Center for Biotechnology Information (NCBI) provides annotation for over 95 000 prokaryotic genomes that meet standards for sequence quality, completeness, and freedom from contamination. Genomes are annotated by a single Prokaryotic Genome Annotation Pipeline (PGAP) to provide users with a resource that is as consistent and accurate as possible. Notable recent changes include the development of a hierarchical evidence scheme, a new focus on curating annotation evidence sources, the addition and curation of protein profile hidden Markov models (HMMs), release of an updated pipeline (PGAP-4), and comprehensive re-annotation of RefSeq prokaryotic genomes. Antimicrobial resistance proteins have been reannotated comprehensively, improved structural annotation of insertion sequence transposases and selenoproteins is provided, curated complex domain architectures have given upgraded names to millions of multidomain proteins, and we introduce a new kind of annotation rule—BlastRules. Continual curation of supporting evidence, and propagation of improved names onto RefSeq proteins ensures that the functional annotation of genomes is kept current. An increasing share of our annotation now derives from HMMs and other sets of annotation rules that are portable by nature, and available for download and for reuse by other investigators. RefSeq is found at https://www.ncbi.nlm.nih.gov/refseq/. Oxford University Press 2018-01-04 2017-11-03 /pmc/articles/PMC5753331/ /pubmed/29112715 http://dx.doi.org/10.1093/nar/gkx1068 Text en Published by Oxford University Press on behalf of Nucleic Acids Research 2017. This work is written by (a) US Government employee(s) and is in the public domain in the US.
spellingShingle Database Issue
Haft, Daniel H
DiCuccio, Michael
Badretdin, Azat
Brover, Vyacheslav
Chetvernin, Vyacheslav
O’Neill, Kathleen
Li, Wenjun
Chitsaz, Farideh
Derbyshire, Myra K
Gonzales, Noreen R
Gwadz, Marc
Lu, Fu
Marchler, Gabriele H
Song, James S
Thanki, Narmada
Yamashita, Roxanne A
Zheng, Chanjuan
Thibaud-Nissen, Françoise
Geer, Lewis Y
Marchler-Bauer, Aron
Pruitt, Kim D
RefSeq: an update on prokaryotic genome annotation and curation
title RefSeq: an update on prokaryotic genome annotation and curation
title_full RefSeq: an update on prokaryotic genome annotation and curation
title_fullStr RefSeq: an update on prokaryotic genome annotation and curation
title_full_unstemmed RefSeq: an update on prokaryotic genome annotation and curation
title_short RefSeq: an update on prokaryotic genome annotation and curation
title_sort refseq: an update on prokaryotic genome annotation and curation
topic Database Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5753331/
https://www.ncbi.nlm.nih.gov/pubmed/29112715
http://dx.doi.org/10.1093/nar/gkx1068
work_keys_str_mv AT haftdanielh refseqanupdateonprokaryoticgenomeannotationandcuration
AT dicucciomichael refseqanupdateonprokaryoticgenomeannotationandcuration
AT badretdinazat refseqanupdateonprokaryoticgenomeannotationandcuration
AT brovervyacheslav refseqanupdateonprokaryoticgenomeannotationandcuration
AT chetverninvyacheslav refseqanupdateonprokaryoticgenomeannotationandcuration
AT oneillkathleen refseqanupdateonprokaryoticgenomeannotationandcuration
AT liwenjun refseqanupdateonprokaryoticgenomeannotationandcuration
AT chitsazfarideh refseqanupdateonprokaryoticgenomeannotationandcuration
AT derbyshiremyrak refseqanupdateonprokaryoticgenomeannotationandcuration
AT gonzalesnoreenr refseqanupdateonprokaryoticgenomeannotationandcuration
AT gwadzmarc refseqanupdateonprokaryoticgenomeannotationandcuration
AT lufu refseqanupdateonprokaryoticgenomeannotationandcuration
AT marchlergabrieleh refseqanupdateonprokaryoticgenomeannotationandcuration
AT songjamess refseqanupdateonprokaryoticgenomeannotationandcuration
AT thankinarmada refseqanupdateonprokaryoticgenomeannotationandcuration
AT yamashitaroxannea refseqanupdateonprokaryoticgenomeannotationandcuration
AT zhengchanjuan refseqanupdateonprokaryoticgenomeannotationandcuration
AT thibaudnissenfrancoise refseqanupdateonprokaryoticgenomeannotationandcuration
AT geerlewisy refseqanupdateonprokaryoticgenomeannotationandcuration
AT marchlerbaueraron refseqanupdateonprokaryoticgenomeannotationandcuration
AT pruittkimd refseqanupdateonprokaryoticgenomeannotationandcuration