Cargando…

The Stanford Data Miner: a novel approach for integrating and exploring heterogeneous immunological data

BACKGROUND: Systems-level approaches are increasingly common in both murine and human translational studies. These approaches employ multiple high information content assays. As a result, there is a need for tools to integrate heterogeneous types of laboratory and clinical/demographic data, and to a...

Descripción completa

Detalles Bibliográficos
Autores principales: Siebert, Janet C, Munsil, Wes, Rosenberg-Hasson, Yael, Davis, Mark M, Maecker, Holden T
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3358233/
https://www.ncbi.nlm.nih.gov/pubmed/22452993
http://dx.doi.org/10.1186/1479-5876-10-62
_version_ 1782233755063156736
author Siebert, Janet C
Munsil, Wes
Rosenberg-Hasson, Yael
Davis, Mark M
Maecker, Holden T
author_facet Siebert, Janet C
Munsil, Wes
Rosenberg-Hasson, Yael
Davis, Mark M
Maecker, Holden T
author_sort Siebert, Janet C
collection PubMed
description BACKGROUND: Systems-level approaches are increasingly common in both murine and human translational studies. These approaches employ multiple high information content assays. As a result, there is a need for tools to integrate heterogeneous types of laboratory and clinical/demographic data, and to allow the exploration of that data by aggregating and/or segregating results based on particular variables (e.g., mean cytokine levels by age and gender). METHODS: Here we describe the application of standard data warehousing tools to create a novel environment for user-driven upload, integration, and exploration of heterogeneous data. The system presented here currently supports flow cytometry and immunoassays performed in the Stanford Human Immune Monitoring Center, but could be applied more generally. RESULTS: Users upload assay results contained in platform-specific spreadsheets of a defined format, and clinical and demographic data in spreadsheets of flexible format. Users then map sample IDs to connect the assay results with the metadata. An OLAP (on-line analytical processing) data exploration interface allows filtering and display of various dimensions (e.g., Luminex analytes in rows, treatment group in columns, filtered on a particular study). Statistics such as mean, median, and N can be displayed. The views can be expanded or contracted to aggregate or segregate data at various levels. Individual-level data is accessible with a single click. The result is a user-driven system that permits data integration and exploration in a variety of settings. We show how the system can be used to find gender-specific differences in serum cytokine levels, and compare them across experiments and assay types. CONCLUSIONS: We have used the tools and techniques of data warehousing, including open-source business intelligence software, to support investigator-driven data integration and mining of diverse immunological data.
format Online
Article
Text
id pubmed-3358233
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33582332012-05-23 The Stanford Data Miner: a novel approach for integrating and exploring heterogeneous immunological data Siebert, Janet C Munsil, Wes Rosenberg-Hasson, Yael Davis, Mark M Maecker, Holden T J Transl Med Methodology BACKGROUND: Systems-level approaches are increasingly common in both murine and human translational studies. These approaches employ multiple high information content assays. As a result, there is a need for tools to integrate heterogeneous types of laboratory and clinical/demographic data, and to allow the exploration of that data by aggregating and/or segregating results based on particular variables (e.g., mean cytokine levels by age and gender). METHODS: Here we describe the application of standard data warehousing tools to create a novel environment for user-driven upload, integration, and exploration of heterogeneous data. The system presented here currently supports flow cytometry and immunoassays performed in the Stanford Human Immune Monitoring Center, but could be applied more generally. RESULTS: Users upload assay results contained in platform-specific spreadsheets of a defined format, and clinical and demographic data in spreadsheets of flexible format. Users then map sample IDs to connect the assay results with the metadata. An OLAP (on-line analytical processing) data exploration interface allows filtering and display of various dimensions (e.g., Luminex analytes in rows, treatment group in columns, filtered on a particular study). Statistics such as mean, median, and N can be displayed. The views can be expanded or contracted to aggregate or segregate data at various levels. Individual-level data is accessible with a single click. The result is a user-driven system that permits data integration and exploration in a variety of settings. We show how the system can be used to find gender-specific differences in serum cytokine levels, and compare them across experiments and assay types. CONCLUSIONS: We have used the tools and techniques of data warehousing, including open-source business intelligence software, to support investigator-driven data integration and mining of diverse immunological data. BioMed Central 2012-03-28 /pmc/articles/PMC3358233/ /pubmed/22452993 http://dx.doi.org/10.1186/1479-5876-10-62 Text en Copyright ©2012 Siebert et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology
Siebert, Janet C
Munsil, Wes
Rosenberg-Hasson, Yael
Davis, Mark M
Maecker, Holden T
The Stanford Data Miner: a novel approach for integrating and exploring heterogeneous immunological data
title The Stanford Data Miner: a novel approach for integrating and exploring heterogeneous immunological data
title_full The Stanford Data Miner: a novel approach for integrating and exploring heterogeneous immunological data
title_fullStr The Stanford Data Miner: a novel approach for integrating and exploring heterogeneous immunological data
title_full_unstemmed The Stanford Data Miner: a novel approach for integrating and exploring heterogeneous immunological data
title_short The Stanford Data Miner: a novel approach for integrating and exploring heterogeneous immunological data
title_sort stanford data miner: a novel approach for integrating and exploring heterogeneous immunological data
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3358233/
https://www.ncbi.nlm.nih.gov/pubmed/22452993
http://dx.doi.org/10.1186/1479-5876-10-62
work_keys_str_mv AT siebertjanetc thestanforddatamineranovelapproachforintegratingandexploringheterogeneousimmunologicaldata
AT munsilwes thestanforddatamineranovelapproachforintegratingandexploringheterogeneousimmunologicaldata
AT rosenberghassonyael thestanforddatamineranovelapproachforintegratingandexploringheterogeneousimmunologicaldata
AT davismarkm thestanforddatamineranovelapproachforintegratingandexploringheterogeneousimmunologicaldata
AT maeckerholdent thestanforddatamineranovelapproachforintegratingandexploringheterogeneousimmunologicaldata
AT siebertjanetc stanforddatamineranovelapproachforintegratingandexploringheterogeneousimmunologicaldata
AT munsilwes stanforddatamineranovelapproachforintegratingandexploringheterogeneousimmunologicaldata
AT rosenberghassonyael stanforddatamineranovelapproachforintegratingandexploringheterogeneousimmunologicaldata
AT davismarkm stanforddatamineranovelapproachforintegratingandexploringheterogeneousimmunologicaldata
AT maeckerholdent stanforddatamineranovelapproachforintegratingandexploringheterogeneousimmunologicaldata