Cargando…
Pancreatic Expression database: a generic model for the organization, integration and mining of complex cancer datasets
BACKGROUND: Pancreatic cancer is the 5th leading cause of cancer death in both males and females. In recent years, a wealth of gene and protein expression studies have been published broadening our understanding of pancreatic cancer biology. Due to the explosive growth in publicly available data fro...
Autores principales: | , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2216037/ https://www.ncbi.nlm.nih.gov/pubmed/18045474 http://dx.doi.org/10.1186/1471-2164-8-439 |
_version_ | 1782149097757605888 |
---|---|
author | Chelala, Claude Hahn, Stephan A Whiteman, Hannah J Barry, Sayka Hariharan, Deepak Radon, Tomasz P Lemoine, Nicholas R Crnogorac-Jurcevic, Tatjana |
author_facet | Chelala, Claude Hahn, Stephan A Whiteman, Hannah J Barry, Sayka Hariharan, Deepak Radon, Tomasz P Lemoine, Nicholas R Crnogorac-Jurcevic, Tatjana |
author_sort | Chelala, Claude |
collection | PubMed |
description | BACKGROUND: Pancreatic cancer is the 5th leading cause of cancer death in both males and females. In recent years, a wealth of gene and protein expression studies have been published broadening our understanding of pancreatic cancer biology. Due to the explosive growth in publicly available data from multiple different sources it is becoming increasingly difficult for individual researchers to integrate these into their current research programmes. The Pancreatic Expression database, a generic web-based system, is aiming to close this gap by providing the research community with an open access tool, not only to mine currently available pancreatic cancer data sets but also to include their own data in the database. DESCRIPTION: Currently, the database holds 32 datasets comprising 7636 gene expression measurements extracted from 20 different published gene or protein expression studies from various pancreatic cancer types, pancreatic precursor lesions (PanINs) and chronic pancreatitis. The pancreatic data are stored in a data management system based on the BioMart technology alongside the human genome gene and protein annotations, sequence, homologue, SNP and antibody data. Interrogation of the database can be achieved through both a web-based query interface and through web services using combined criteria from pancreatic (disease stages, regulation, differential expression, expression, platform technology, publication) and/or public data (antibodies, genomic region, gene-related accessions, ontology, expression patterns, multi-species comparisons, protein data, SNPs). Thus, our database enables connections between otherwise disparate data sources and allows relatively simple navigation between all data types and annotations. CONCLUSION: The database structure and content provides a powerful and high-speed data-mining tool for cancer research. It can be used for target discovery i.e. of biomarkers from body fluids, identification and analysis of genes associated with the progression of cancer, cross-platform meta-analysis, SNP selection for pancreatic cancer association studies, cancer gene promoter analysis as well as mining cancer ontology information. The data model is generic and can be easily extended and applied to other types of cancer. The database is available online with no restrictions for the scientific community at . |
format | Text |
id | pubmed-2216037 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-22160372008-01-29 Pancreatic Expression database: a generic model for the organization, integration and mining of complex cancer datasets Chelala, Claude Hahn, Stephan A Whiteman, Hannah J Barry, Sayka Hariharan, Deepak Radon, Tomasz P Lemoine, Nicholas R Crnogorac-Jurcevic, Tatjana BMC Genomics Database BACKGROUND: Pancreatic cancer is the 5th leading cause of cancer death in both males and females. In recent years, a wealth of gene and protein expression studies have been published broadening our understanding of pancreatic cancer biology. Due to the explosive growth in publicly available data from multiple different sources it is becoming increasingly difficult for individual researchers to integrate these into their current research programmes. The Pancreatic Expression database, a generic web-based system, is aiming to close this gap by providing the research community with an open access tool, not only to mine currently available pancreatic cancer data sets but also to include their own data in the database. DESCRIPTION: Currently, the database holds 32 datasets comprising 7636 gene expression measurements extracted from 20 different published gene or protein expression studies from various pancreatic cancer types, pancreatic precursor lesions (PanINs) and chronic pancreatitis. The pancreatic data are stored in a data management system based on the BioMart technology alongside the human genome gene and protein annotations, sequence, homologue, SNP and antibody data. Interrogation of the database can be achieved through both a web-based query interface and through web services using combined criteria from pancreatic (disease stages, regulation, differential expression, expression, platform technology, publication) and/or public data (antibodies, genomic region, gene-related accessions, ontology, expression patterns, multi-species comparisons, protein data, SNPs). Thus, our database enables connections between otherwise disparate data sources and allows relatively simple navigation between all data types and annotations. CONCLUSION: The database structure and content provides a powerful and high-speed data-mining tool for cancer research. It can be used for target discovery i.e. of biomarkers from body fluids, identification and analysis of genes associated with the progression of cancer, cross-platform meta-analysis, SNP selection for pancreatic cancer association studies, cancer gene promoter analysis as well as mining cancer ontology information. The data model is generic and can be easily extended and applied to other types of cancer. The database is available online with no restrictions for the scientific community at . BioMed Central 2007-11-28 /pmc/articles/PMC2216037/ /pubmed/18045474 http://dx.doi.org/10.1186/1471-2164-8-439 Text en Copyright © 2007 Chelala et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Database Chelala, Claude Hahn, Stephan A Whiteman, Hannah J Barry, Sayka Hariharan, Deepak Radon, Tomasz P Lemoine, Nicholas R Crnogorac-Jurcevic, Tatjana Pancreatic Expression database: a generic model for the organization, integration and mining of complex cancer datasets |
title | Pancreatic Expression database: a generic model for the organization, integration and mining of complex cancer datasets |
title_full | Pancreatic Expression database: a generic model for the organization, integration and mining of complex cancer datasets |
title_fullStr | Pancreatic Expression database: a generic model for the organization, integration and mining of complex cancer datasets |
title_full_unstemmed | Pancreatic Expression database: a generic model for the organization, integration and mining of complex cancer datasets |
title_short | Pancreatic Expression database: a generic model for the organization, integration and mining of complex cancer datasets |
title_sort | pancreatic expression database: a generic model for the organization, integration and mining of complex cancer datasets |
topic | Database |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2216037/ https://www.ncbi.nlm.nih.gov/pubmed/18045474 http://dx.doi.org/10.1186/1471-2164-8-439 |
work_keys_str_mv | AT chelalaclaude pancreaticexpressiondatabaseagenericmodelfortheorganizationintegrationandminingofcomplexcancerdatasets AT hahnstephana pancreaticexpressiondatabaseagenericmodelfortheorganizationintegrationandminingofcomplexcancerdatasets AT whitemanhannahj pancreaticexpressiondatabaseagenericmodelfortheorganizationintegrationandminingofcomplexcancerdatasets AT barrysayka pancreaticexpressiondatabaseagenericmodelfortheorganizationintegrationandminingofcomplexcancerdatasets AT hariharandeepak pancreaticexpressiondatabaseagenericmodelfortheorganizationintegrationandminingofcomplexcancerdatasets AT radontomaszp pancreaticexpressiondatabaseagenericmodelfortheorganizationintegrationandminingofcomplexcancerdatasets AT lemoinenicholasr pancreaticexpressiondatabaseagenericmodelfortheorganizationintegrationandminingofcomplexcancerdatasets AT crnogoracjurcevictatjana pancreaticexpressiondatabaseagenericmodelfortheorganizationintegrationandminingofcomplexcancerdatasets |