Cargando…

CASCAD: a database of annotated candidate single nucleotide polymorphisms associated with expressed sequences

BACKGROUND: With the recent progress made in large-scale genome sequencing projects a vast amount of novel data is becoming available. A comparative sequence analysis, exploiting sequence information from various resources, can be used to uncover hidden information, such as genetic variation. Althou...

Descripción completa

Detalles Bibliográficos
Autores principales:	Guryev, Victor, Berezikov, Eugene, Cuppen, Edwin
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2005
Materias:	Database
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC548278/ https://www.ncbi.nlm.nih.gov/pubmed/15676075 http://dx.doi.org/10.1186/1471-2164-6-10

_version_	1782122331280244736
author	Guryev, Victor Berezikov, Eugene Cuppen, Edwin
author_facet	Guryev, Victor Berezikov, Eugene Cuppen, Edwin
author_sort	Guryev, Victor
collection	PubMed
description	BACKGROUND: With the recent progress made in large-scale genome sequencing projects a vast amount of novel data is becoming available. A comparative sequence analysis, exploiting sequence information from various resources, can be used to uncover hidden information, such as genetic variation. Although there are enormous amounts of SNPs for a wide variety of organisms submitted to NCBI dbSNP and annotated in most genome assembly viewers like Ensembl and the UCSC Genome Browser, these platforms do not easily allow for extensive annotation and incorporation of experimental data supporting the polymorphism. However, such information is very important for selecting the most promising and useful candidate polymorphisms for use in experimental setups. DESCRIPTION: The CASCAD database is designed for presentation and query of candidate SNPs that are retrieved by in silico mining of high-throughput sequencing data. Currently, the database provides collections of laboratory rat (Rattus norvegicus) and zebrafish (Danio rerio) candidate SNPs. The database stores detailed information about raw data supporting the candidate, extensive annotation and links to external databases (e.g. GenBank, Ensembl, UniGene, and LocusLink), verification information, and predictions of a potential effect for non-synonymous polymorphisms in coding regions. The CASCAD website allows search based on an arbitrary combination of 27 different parameters related to characteristics like candidate SNP quality, genomic localization, and sequence data source or strain. In addition, the database can be queried with any custom nucleotide sequences of interest. The interface is crosslinked to other public databases and tightly coupled with primer design and local genome assembly interfaces in order to facilitate experimental verification of candidates. CONCLUSIONS: The CASCAD database discloses detailed information on rat and zebrafish candidate SNPs, including the raw data underlying its discovery. An advanced web-based search interface allows universal access to the database content and allows various queries supporting many types of research utilizing single nucleotide polymorphisms.
format	Text
id	pubmed-548278
institution	National Center for Biotechnology Information
language	English
publishDate	2005
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-5482782005-02-06 CASCAD: a database of annotated candidate single nucleotide polymorphisms associated with expressed sequences Guryev, Victor Berezikov, Eugene Cuppen, Edwin BMC Genomics Database BACKGROUND: With the recent progress made in large-scale genome sequencing projects a vast amount of novel data is becoming available. A comparative sequence analysis, exploiting sequence information from various resources, can be used to uncover hidden information, such as genetic variation. Although there are enormous amounts of SNPs for a wide variety of organisms submitted to NCBI dbSNP and annotated in most genome assembly viewers like Ensembl and the UCSC Genome Browser, these platforms do not easily allow for extensive annotation and incorporation of experimental data supporting the polymorphism. However, such information is very important for selecting the most promising and useful candidate polymorphisms for use in experimental setups. DESCRIPTION: The CASCAD database is designed for presentation and query of candidate SNPs that are retrieved by in silico mining of high-throughput sequencing data. Currently, the database provides collections of laboratory rat (Rattus norvegicus) and zebrafish (Danio rerio) candidate SNPs. The database stores detailed information about raw data supporting the candidate, extensive annotation and links to external databases (e.g. GenBank, Ensembl, UniGene, and LocusLink), verification information, and predictions of a potential effect for non-synonymous polymorphisms in coding regions. The CASCAD website allows search based on an arbitrary combination of 27 different parameters related to characteristics like candidate SNP quality, genomic localization, and sequence data source or strain. In addition, the database can be queried with any custom nucleotide sequences of interest. The interface is crosslinked to other public databases and tightly coupled with primer design and local genome assembly interfaces in order to facilitate experimental verification of candidates. CONCLUSIONS: The CASCAD database discloses detailed information on rat and zebrafish candidate SNPs, including the raw data underlying its discovery. An advanced web-based search interface allows universal access to the database content and allows various queries supporting many types of research utilizing single nucleotide polymorphisms. BioMed Central 2005-01-27 /pmc/articles/PMC548278/ /pubmed/15676075 http://dx.doi.org/10.1186/1471-2164-6-10 Text en Copyright © 2005 Guryev et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Database Guryev, Victor Berezikov, Eugene Cuppen, Edwin CASCAD: a database of annotated candidate single nucleotide polymorphisms associated with expressed sequences
title	CASCAD: a database of annotated candidate single nucleotide polymorphisms associated with expressed sequences
title_full	CASCAD: a database of annotated candidate single nucleotide polymorphisms associated with expressed sequences
title_fullStr	CASCAD: a database of annotated candidate single nucleotide polymorphisms associated with expressed sequences
title_full_unstemmed	CASCAD: a database of annotated candidate single nucleotide polymorphisms associated with expressed sequences
title_short	CASCAD: a database of annotated candidate single nucleotide polymorphisms associated with expressed sequences
title_sort	cascad: a database of annotated candidate single nucleotide polymorphisms associated with expressed sequences
topic	Database
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC548278/ https://www.ncbi.nlm.nih.gov/pubmed/15676075 http://dx.doi.org/10.1186/1471-2164-6-10
work_keys_str_mv	AT guryevvictor cascadadatabaseofannotatedcandidatesinglenucleotidepolymorphismsassociatedwithexpressedsequences AT berezikoveugene cascadadatabaseofannotatedcandidatesinglenucleotidepolymorphismsassociatedwithexpressedsequences AT cuppenedwin cascadadatabaseofannotatedcandidatesinglenucleotidepolymorphismsassociatedwithexpressedsequences

CASCAD: a database of annotated candidate single nucleotide polymorphisms associated with expressed sequences

Ejemplares similares