Cargando…
CoVDB: a comprehensive database for comparative analysis of coronavirus genes and genomes
The recent SARS epidemic has boosted interest in the discovery of novel human and animal coronaviruses. By July 2007, more than 3000 coronavirus sequence records, including 264 complete genomes, are available in GenBank. The number of coronavirus species with complete genomes available has increased...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2238867/ https://www.ncbi.nlm.nih.gov/pubmed/17913743 http://dx.doi.org/10.1093/nar/gkm754 |
_version_ | 1782150478591688704 |
---|---|
author | Huang, Yi Lau, Susanna K. P. Woo, Patrick C. Y. Yuen, Kwok-yung |
author_facet | Huang, Yi Lau, Susanna K. P. Woo, Patrick C. Y. Yuen, Kwok-yung |
author_sort | Huang, Yi |
collection | PubMed |
description | The recent SARS epidemic has boosted interest in the discovery of novel human and animal coronaviruses. By July 2007, more than 3000 coronavirus sequence records, including 264 complete genomes, are available in GenBank. The number of coronavirus species with complete genomes available has increased from 9 in 2003 to 25 in 2007, of which six, including coronavirus HKU1, bat SARS coronavirus, group 1 bat coronavirus HKU2, groups 2c and 2d coronaviruses, were sequenced by our laboratory. To overcome the problems we encountered in the existing databases during comparative sequence analysis, we built a comprehensive database, CoVDB (http://covdb.microbiology.hku.hk), of annotated coronavirus genes and genomes. CoVDB provides a convenient platform for rapid and accurate batch sequence retrieval, the cornerstone and bottleneck for comparative gene or genome analysis. Sequences can be directly downloaded from the website in FASTA format. CoVDB also provides detailed annotation of all coronavirus sequences using a standardized nomenclature system, and overcomes the problems of duplicated and identical sequences in other databases. For complete genomes, a single representative sequence for each species is available for comparative analysis such as phylogenetic studies. With the annotated sequences in CoVDB, more specific blast search results can be generated for efficient downstream analysis. |
format | Text |
id | pubmed-2238867 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-22388672008-02-12 CoVDB: a comprehensive database for comparative analysis of coronavirus genes and genomes Huang, Yi Lau, Susanna K. P. Woo, Patrick C. Y. Yuen, Kwok-yung Nucleic Acids Res Articles The recent SARS epidemic has boosted interest in the discovery of novel human and animal coronaviruses. By July 2007, more than 3000 coronavirus sequence records, including 264 complete genomes, are available in GenBank. The number of coronavirus species with complete genomes available has increased from 9 in 2003 to 25 in 2007, of which six, including coronavirus HKU1, bat SARS coronavirus, group 1 bat coronavirus HKU2, groups 2c and 2d coronaviruses, were sequenced by our laboratory. To overcome the problems we encountered in the existing databases during comparative sequence analysis, we built a comprehensive database, CoVDB (http://covdb.microbiology.hku.hk), of annotated coronavirus genes and genomes. CoVDB provides a convenient platform for rapid and accurate batch sequence retrieval, the cornerstone and bottleneck for comparative gene or genome analysis. Sequences can be directly downloaded from the website in FASTA format. CoVDB also provides detailed annotation of all coronavirus sequences using a standardized nomenclature system, and overcomes the problems of duplicated and identical sequences in other databases. For complete genomes, a single representative sequence for each species is available for comparative analysis such as phylogenetic studies. With the annotated sequences in CoVDB, more specific blast search results can be generated for efficient downstream analysis. Oxford University Press 2008-01 2007-10-02 /pmc/articles/PMC2238867/ /pubmed/17913743 http://dx.doi.org/10.1093/nar/gkm754 Text en © 2007 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Articles Huang, Yi Lau, Susanna K. P. Woo, Patrick C. Y. Yuen, Kwok-yung CoVDB: a comprehensive database for comparative analysis of coronavirus genes and genomes |
title | CoVDB: a comprehensive database for comparative analysis of coronavirus genes and genomes |
title_full | CoVDB: a comprehensive database for comparative analysis of coronavirus genes and genomes |
title_fullStr | CoVDB: a comprehensive database for comparative analysis of coronavirus genes and genomes |
title_full_unstemmed | CoVDB: a comprehensive database for comparative analysis of coronavirus genes and genomes |
title_short | CoVDB: a comprehensive database for comparative analysis of coronavirus genes and genomes |
title_sort | covdb: a comprehensive database for comparative analysis of coronavirus genes and genomes |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2238867/ https://www.ncbi.nlm.nih.gov/pubmed/17913743 http://dx.doi.org/10.1093/nar/gkm754 |
work_keys_str_mv | AT huangyi covdbacomprehensivedatabaseforcomparativeanalysisofcoronavirusgenesandgenomes AT laususannakp covdbacomprehensivedatabaseforcomparativeanalysisofcoronavirusgenesandgenomes AT woopatrickcy covdbacomprehensivedatabaseforcomparativeanalysisofcoronavirusgenesandgenomes AT yuenkwokyung covdbacomprehensivedatabaseforcomparativeanalysisofcoronavirusgenesandgenomes |