Cargando…

CoVDB: a comprehensive database for comparative analysis of coronavirus genes and genomes

The recent SARS epidemic has boosted interest in the discovery of novel human and animal coronaviruses. By July 2007, more than 3000 coronavirus sequence records, including 264 complete genomes, are available in GenBank. The number of coronavirus species with complete genomes available has increased...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Yi, Lau, Susanna K. P., Woo, Patrick C. Y., Yuen, Kwok-yung
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2238867/
https://www.ncbi.nlm.nih.gov/pubmed/17913743
http://dx.doi.org/10.1093/nar/gkm754
_version_ 1782150478591688704
author Huang, Yi
Lau, Susanna K. P.
Woo, Patrick C. Y.
Yuen, Kwok-yung
author_facet Huang, Yi
Lau, Susanna K. P.
Woo, Patrick C. Y.
Yuen, Kwok-yung
author_sort Huang, Yi
collection PubMed
description The recent SARS epidemic has boosted interest in the discovery of novel human and animal coronaviruses. By July 2007, more than 3000 coronavirus sequence records, including 264 complete genomes, are available in GenBank. The number of coronavirus species with complete genomes available has increased from 9 in 2003 to 25 in 2007, of which six, including coronavirus HKU1, bat SARS coronavirus, group 1 bat coronavirus HKU2, groups 2c and 2d coronaviruses, were sequenced by our laboratory. To overcome the problems we encountered in the existing databases during comparative sequence analysis, we built a comprehensive database, CoVDB (http://covdb.microbiology.hku.hk), of annotated coronavirus genes and genomes. CoVDB provides a convenient platform for rapid and accurate batch sequence retrieval, the cornerstone and bottleneck for comparative gene or genome analysis. Sequences can be directly downloaded from the website in FASTA format. CoVDB also provides detailed annotation of all coronavirus sequences using a standardized nomenclature system, and overcomes the problems of duplicated and identical sequences in other databases. For complete genomes, a single representative sequence for each species is available for comparative analysis such as phylogenetic studies. With the annotated sequences in CoVDB, more specific blast search results can be generated for efficient downstream analysis.
format Text
id pubmed-2238867
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-22388672008-02-12 CoVDB: a comprehensive database for comparative analysis of coronavirus genes and genomes Huang, Yi Lau, Susanna K. P. Woo, Patrick C. Y. Yuen, Kwok-yung Nucleic Acids Res Articles The recent SARS epidemic has boosted interest in the discovery of novel human and animal coronaviruses. By July 2007, more than 3000 coronavirus sequence records, including 264 complete genomes, are available in GenBank. The number of coronavirus species with complete genomes available has increased from 9 in 2003 to 25 in 2007, of which six, including coronavirus HKU1, bat SARS coronavirus, group 1 bat coronavirus HKU2, groups 2c and 2d coronaviruses, were sequenced by our laboratory. To overcome the problems we encountered in the existing databases during comparative sequence analysis, we built a comprehensive database, CoVDB (http://covdb.microbiology.hku.hk), of annotated coronavirus genes and genomes. CoVDB provides a convenient platform for rapid and accurate batch sequence retrieval, the cornerstone and bottleneck for comparative gene or genome analysis. Sequences can be directly downloaded from the website in FASTA format. CoVDB also provides detailed annotation of all coronavirus sequences using a standardized nomenclature system, and overcomes the problems of duplicated and identical sequences in other databases. For complete genomes, a single representative sequence for each species is available for comparative analysis such as phylogenetic studies. With the annotated sequences in CoVDB, more specific blast search results can be generated for efficient downstream analysis. Oxford University Press 2008-01 2007-10-02 /pmc/articles/PMC2238867/ /pubmed/17913743 http://dx.doi.org/10.1093/nar/gkm754 Text en © 2007 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Huang, Yi
Lau, Susanna K. P.
Woo, Patrick C. Y.
Yuen, Kwok-yung
CoVDB: a comprehensive database for comparative analysis of coronavirus genes and genomes
title CoVDB: a comprehensive database for comparative analysis of coronavirus genes and genomes
title_full CoVDB: a comprehensive database for comparative analysis of coronavirus genes and genomes
title_fullStr CoVDB: a comprehensive database for comparative analysis of coronavirus genes and genomes
title_full_unstemmed CoVDB: a comprehensive database for comparative analysis of coronavirus genes and genomes
title_short CoVDB: a comprehensive database for comparative analysis of coronavirus genes and genomes
title_sort covdb: a comprehensive database for comparative analysis of coronavirus genes and genomes
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2238867/
https://www.ncbi.nlm.nih.gov/pubmed/17913743
http://dx.doi.org/10.1093/nar/gkm754
work_keys_str_mv AT huangyi covdbacomprehensivedatabaseforcomparativeanalysisofcoronavirusgenesandgenomes
AT laususannakp covdbacomprehensivedatabaseforcomparativeanalysisofcoronavirusgenesandgenomes
AT woopatrickcy covdbacomprehensivedatabaseforcomparativeanalysisofcoronavirusgenesandgenomes
AT yuenkwokyung covdbacomprehensivedatabaseforcomparativeanalysisofcoronavirusgenesandgenomes