Cargando…
ATGC database and ATGC-COGs: an updated resource for micro- and macro-evolutionary studies of prokaryotic genomes and protein family annotation
The Alignable Tight Genomic Clusters (ATGCs) database is a collection of closely related bacterial and archaeal genomes that provides several tools to aid research into evolutionary processes in the microbial world. Each ATGC is a taxonomy-independent cluster of 2 or more completely sequenced genome...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5210634/ https://www.ncbi.nlm.nih.gov/pubmed/28053163 http://dx.doi.org/10.1093/nar/gkw934 |
_version_ | 1782490924217008128 |
---|---|
author | Kristensen, David M. Wolf, Yuri I. Koonin, Eugene V. |
author_facet | Kristensen, David M. Wolf, Yuri I. Koonin, Eugene V. |
author_sort | Kristensen, David M. |
collection | PubMed |
description | The Alignable Tight Genomic Clusters (ATGCs) database is a collection of closely related bacterial and archaeal genomes that provides several tools to aid research into evolutionary processes in the microbial world. Each ATGC is a taxonomy-independent cluster of 2 or more completely sequenced genomes that meet the objective criteria of a high degree of local gene order (synteny) and a small number of synonymous substitutions in the protein-coding genes. As such, each ATGC is suited for analysis of microevolutionary variations within a cohesive group of organisms (e.g. species), whereas the entire collection of ATGCs is useful for macroevolutionary studies. The ATGC database includes many forms of pre-computed data, in particular ATGC-COGs (Clusters of Orthologous Genes), multiple sequence alignments, a set of ‘index’ orthologs representing the most well-conserved members of each ATGC-COG, the phylogenetic tree of the organisms within each ATGC, etc. Although the ATGC database contains several million proteins from thousands of genomes organized into hundreds of clusters (roughly a 4-fold increase since the last version of the ATGC database), it is now built with completely automated methods and will be regularly updated following new releases of the NCBI RefSeq database. The ATGC database is hosted jointly at the University of Iowa at dmk-brain.ecn.uiowa.edu/ATGC/ and the NCBI at ftp.ncbi.nlm.nih.gov/pub/kristensen/ATGC/atgc_home.html. |
format | Online Article Text |
id | pubmed-5210634 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-52106342017-01-05 ATGC database and ATGC-COGs: an updated resource for micro- and macro-evolutionary studies of prokaryotic genomes and protein family annotation Kristensen, David M. Wolf, Yuri I. Koonin, Eugene V. Nucleic Acids Res Database Issue The Alignable Tight Genomic Clusters (ATGCs) database is a collection of closely related bacterial and archaeal genomes that provides several tools to aid research into evolutionary processes in the microbial world. Each ATGC is a taxonomy-independent cluster of 2 or more completely sequenced genomes that meet the objective criteria of a high degree of local gene order (synteny) and a small number of synonymous substitutions in the protein-coding genes. As such, each ATGC is suited for analysis of microevolutionary variations within a cohesive group of organisms (e.g. species), whereas the entire collection of ATGCs is useful for macroevolutionary studies. The ATGC database includes many forms of pre-computed data, in particular ATGC-COGs (Clusters of Orthologous Genes), multiple sequence alignments, a set of ‘index’ orthologs representing the most well-conserved members of each ATGC-COG, the phylogenetic tree of the organisms within each ATGC, etc. Although the ATGC database contains several million proteins from thousands of genomes organized into hundreds of clusters (roughly a 4-fold increase since the last version of the ATGC database), it is now built with completely automated methods and will be regularly updated following new releases of the NCBI RefSeq database. The ATGC database is hosted jointly at the University of Iowa at dmk-brain.ecn.uiowa.edu/ATGC/ and the NCBI at ftp.ncbi.nlm.nih.gov/pub/kristensen/ATGC/atgc_home.html. Oxford University Press 2017-01-04 2016-10-18 /pmc/articles/PMC5210634/ /pubmed/28053163 http://dx.doi.org/10.1093/nar/gkw934 Text en Published by Oxford University Press on behalf of Nucleic Acids Research 2016. This work is written by (a) US Government employee(s) and is in the public domain in the US. |
spellingShingle | Database Issue Kristensen, David M. Wolf, Yuri I. Koonin, Eugene V. ATGC database and ATGC-COGs: an updated resource for micro- and macro-evolutionary studies of prokaryotic genomes and protein family annotation |
title | ATGC database and ATGC-COGs: an updated resource for micro- and macro-evolutionary studies of prokaryotic genomes and protein family annotation |
title_full | ATGC database and ATGC-COGs: an updated resource for micro- and macro-evolutionary studies of prokaryotic genomes and protein family annotation |
title_fullStr | ATGC database and ATGC-COGs: an updated resource for micro- and macro-evolutionary studies of prokaryotic genomes and protein family annotation |
title_full_unstemmed | ATGC database and ATGC-COGs: an updated resource for micro- and macro-evolutionary studies of prokaryotic genomes and protein family annotation |
title_short | ATGC database and ATGC-COGs: an updated resource for micro- and macro-evolutionary studies of prokaryotic genomes and protein family annotation |
title_sort | atgc database and atgc-cogs: an updated resource for micro- and macro-evolutionary studies of prokaryotic genomes and protein family annotation |
topic | Database Issue |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5210634/ https://www.ncbi.nlm.nih.gov/pubmed/28053163 http://dx.doi.org/10.1093/nar/gkw934 |
work_keys_str_mv | AT kristensendavidm atgcdatabaseandatgccogsanupdatedresourceformicroandmacroevolutionarystudiesofprokaryoticgenomesandproteinfamilyannotation AT wolfyurii atgcdatabaseandatgccogsanupdatedresourceformicroandmacroevolutionarystudiesofprokaryoticgenomesandproteinfamilyannotation AT koonineugenev atgcdatabaseandatgccogsanupdatedresourceformicroandmacroevolutionarystudiesofprokaryoticgenomesandproteinfamilyannotation |