Cargando…

proGenomes3: approaching one million accurately and consistently annotated high-quality prokaryotic genomes

The interpretation of genomic, transcriptomic and other microbial ‘omics data is highly dependent on the availability of well-annotated genomes. As the number of publicly available microbial genomes continues to increase exponentially, the need for quality control and consistent annotation is becomi...

Descripción completa

Detalles Bibliográficos
Autores principales: Fullam, Anthony, Letunic, Ivica, Schmidt, Thomas S B, Ducarmon, Quinten R, Karcher, Nicolai, Khedkar, Supriya, Kuhn, Michael, Larralde, Martin, Maistrenko, Oleksandr M, Malfertheiner, Lukas, Milanese, Alessio, Rodrigues, Joao Frederico Matias, Sanchis-López, Claudia, Schudoma, Christian, Szklarczyk, Damian, Sunagawa, Shinichi, Zeller, Georg, Huerta-Cepas, Jaime, von Mering, Christian, Bork, Peer, Mende, Daniel R
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825469/
https://www.ncbi.nlm.nih.gov/pubmed/36408900
http://dx.doi.org/10.1093/nar/gkac1078
Descripción
Sumario:The interpretation of genomic, transcriptomic and other microbial ‘omics data is highly dependent on the availability of well-annotated genomes. As the number of publicly available microbial genomes continues to increase exponentially, the need for quality control and consistent annotation is becoming critical. We present proGenomes3, a database of 907 388 high-quality genomes containing 4 billion genes that passed stringent criteria and have been consistently annotated using multiple functional and taxonomic databases including mobile genetic elements and biosynthetic gene clusters. proGenomes3 encompasses 41 171 species-level clusters, defined based on universal single copy marker genes, for which pan-genomes and contextual habitat annotations are provided. The database is available at http://progenomes.embl.de/