Cargando…
Database integration of 4923 publicly-available samples of breast cancer molecular and clinical data
We outline a paradigm for meta-microarray database creation and integration with clinical variables. We use as our implementation example a breast cancer database linking RNA expression measurements (by microarray) and clinical variables, such as survival metrics and tumor size. Such an endeavor inv...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Medical Informatics Association
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3814460/ https://www.ncbi.nlm.nih.gov/pubmed/24303324 |
_version_ | 1782289259890212864 |
---|---|
author | Planey, Catherine R. Butte, Atul J. |
author_facet | Planey, Catherine R. Butte, Atul J. |
author_sort | Planey, Catherine R. |
collection | PubMed |
description | We outline a paradigm for meta-microarray database creation and integration with clinical variables. We use as our implementation example a breast cancer database linking RNA expression measurements (by microarray) and clinical variables, such as survival metrics and tumor size. Such an endeavor involves integrating across different microarray datasets as well as clinical parameters. To this end, we created a data curation and processing pipeline, formal database ontology, and SQL schema to optimally query, analyze and visualize data from over 30 publicly available breast cancer microarray studies listed in the Gene Expression Omnibus (GEO). We demonstrate several pilot examples using this database. This methodology serves as a model for future meta-analyses of complex public clinical datasets, in particular those in the field of cancer. |
format | Online Article Text |
id | pubmed-3814460 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | American Medical Informatics Association |
record_format | MEDLINE/PubMed |
spelling | pubmed-38144602013-12-03 Database integration of 4923 publicly-available samples of breast cancer molecular and clinical data Planey, Catherine R. Butte, Atul J. AMIA Jt Summits Transl Sci Proc Articles We outline a paradigm for meta-microarray database creation and integration with clinical variables. We use as our implementation example a breast cancer database linking RNA expression measurements (by microarray) and clinical variables, such as survival metrics and tumor size. Such an endeavor involves integrating across different microarray datasets as well as clinical parameters. To this end, we created a data curation and processing pipeline, formal database ontology, and SQL schema to optimally query, analyze and visualize data from over 30 publicly available breast cancer microarray studies listed in the Gene Expression Omnibus (GEO). We demonstrate several pilot examples using this database. This methodology serves as a model for future meta-analyses of complex public clinical datasets, in particular those in the field of cancer. American Medical Informatics Association 2013-03-18 /pmc/articles/PMC3814460/ /pubmed/24303324 Text en ©2013 AMIA - All rights reserved. |
spellingShingle | Articles Planey, Catherine R. Butte, Atul J. Database integration of 4923 publicly-available samples of breast cancer molecular and clinical data |
title | Database integration of 4923 publicly-available samples of breast cancer molecular and clinical data |
title_full | Database integration of 4923 publicly-available samples of breast cancer molecular and clinical data |
title_fullStr | Database integration of 4923 publicly-available samples of breast cancer molecular and clinical data |
title_full_unstemmed | Database integration of 4923 publicly-available samples of breast cancer molecular and clinical data |
title_short | Database integration of 4923 publicly-available samples of breast cancer molecular and clinical data |
title_sort | database integration of 4923 publicly-available samples of breast cancer molecular and clinical data |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3814460/ https://www.ncbi.nlm.nih.gov/pubmed/24303324 |
work_keys_str_mv | AT planeycatheriner databaseintegrationof4923publiclyavailablesamplesofbreastcancermolecularandclinicaldata AT butteatulj databaseintegrationof4923publiclyavailablesamplesofbreastcancermolecularandclinicaldata |