Cargando…

The Ontology of Biological and Clinical Statistics (OBCS) for standardized and reproducible statistical analysis

BACKGROUND: Statistics play a critical role in biological and clinical research. However, most reports of scientific results in the published literature make it difficult for the reader to reproduce the statistical analyses performed in achieving those results because they provide inadequate documen...

Descripción completa

Detalles Bibliográficos
Autores principales: Zheng, Jie, Harris, Marcelline R., Masci, Anna Maria, Lin, Yu, Hero, Alfred, Smith, Barry, He, Yongqun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5024438/
https://www.ncbi.nlm.nih.gov/pubmed/27627881
http://dx.doi.org/10.1186/s13326-016-0100-2
_version_ 1782453799278870528
author Zheng, Jie
Harris, Marcelline R.
Masci, Anna Maria
Lin, Yu
Hero, Alfred
Smith, Barry
He, Yongqun
author_facet Zheng, Jie
Harris, Marcelline R.
Masci, Anna Maria
Lin, Yu
Hero, Alfred
Smith, Barry
He, Yongqun
author_sort Zheng, Jie
collection PubMed
description BACKGROUND: Statistics play a critical role in biological and clinical research. However, most reports of scientific results in the published literature make it difficult for the reader to reproduce the statistical analyses performed in achieving those results because they provide inadequate documentation of the statistical tests and algorithms applied. The Ontology of Biological and Clinical Statistics (OBCS) is put forward here as a step towards solving this problem. RESULTS: The terms in OBCS including ‘data collection’, ‘data transformation in statistics’, ‘data visualization’, ‘statistical data analysis’, and ‘drawing a conclusion based on data’, cover the major types of statistical processes used in basic biological research and clinical outcome studies. OBCS is aligned with the Basic Formal Ontology (BFO) and extends the Ontology of Biomedical Investigations (OBI), an OBO (Open Biological and Biomedical Ontologies) Foundry ontology supported by over 20 research communities. Currently, OBCS comprehends 878 terms, representing 20 BFO classes, 403 OBI classes, 229 OBCS specific classes, and 122 classes imported from ten other OBO ontologies. We discuss two examples illustrating how the ontology is being applied. In the first (biological) use case, we describe how OBCS was applied to represent the high throughput microarray data analysis of immunological transcriptional profiles in human subjects vaccinated with an influenza vaccine. In the second (clinical outcomes) use case, we applied OBCS to represent the processing of electronic health care data to determine the associations between hospital staffing levels and patient mortality. Our case studies were designed to show how OBCS can be used for the consistent representation of statistical analysis pipelines under two different research paradigms. Other ongoing projects using OBCS for statistical data processing are also discussed. The OBCS source code and documentation are available at: https://github.com/obcs/obcs. CONCLUSIONS: The Ontology of Biological and Clinical Statistics (OBCS) is a community-based open source ontology in the domain of biological and clinical statistics. OBCS is a timely ontology that represents statistics-related terms and their relations in a rigorous fashion, facilitates standard data analysis and integration, and supports reproducible biological and clinical research.
format Online
Article
Text
id pubmed-5024438
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-50244382016-09-20 The Ontology of Biological and Clinical Statistics (OBCS) for standardized and reproducible statistical analysis Zheng, Jie Harris, Marcelline R. Masci, Anna Maria Lin, Yu Hero, Alfred Smith, Barry He, Yongqun J Biomed Semantics Research BACKGROUND: Statistics play a critical role in biological and clinical research. However, most reports of scientific results in the published literature make it difficult for the reader to reproduce the statistical analyses performed in achieving those results because they provide inadequate documentation of the statistical tests and algorithms applied. The Ontology of Biological and Clinical Statistics (OBCS) is put forward here as a step towards solving this problem. RESULTS: The terms in OBCS including ‘data collection’, ‘data transformation in statistics’, ‘data visualization’, ‘statistical data analysis’, and ‘drawing a conclusion based on data’, cover the major types of statistical processes used in basic biological research and clinical outcome studies. OBCS is aligned with the Basic Formal Ontology (BFO) and extends the Ontology of Biomedical Investigations (OBI), an OBO (Open Biological and Biomedical Ontologies) Foundry ontology supported by over 20 research communities. Currently, OBCS comprehends 878 terms, representing 20 BFO classes, 403 OBI classes, 229 OBCS specific classes, and 122 classes imported from ten other OBO ontologies. We discuss two examples illustrating how the ontology is being applied. In the first (biological) use case, we describe how OBCS was applied to represent the high throughput microarray data analysis of immunological transcriptional profiles in human subjects vaccinated with an influenza vaccine. In the second (clinical outcomes) use case, we applied OBCS to represent the processing of electronic health care data to determine the associations between hospital staffing levels and patient mortality. Our case studies were designed to show how OBCS can be used for the consistent representation of statistical analysis pipelines under two different research paradigms. Other ongoing projects using OBCS for statistical data processing are also discussed. The OBCS source code and documentation are available at: https://github.com/obcs/obcs. CONCLUSIONS: The Ontology of Biological and Clinical Statistics (OBCS) is a community-based open source ontology in the domain of biological and clinical statistics. OBCS is a timely ontology that represents statistics-related terms and their relations in a rigorous fashion, facilitates standard data analysis and integration, and supports reproducible biological and clinical research. BioMed Central 2016-09-14 /pmc/articles/PMC5024438/ /pubmed/27627881 http://dx.doi.org/10.1186/s13326-016-0100-2 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Zheng, Jie
Harris, Marcelline R.
Masci, Anna Maria
Lin, Yu
Hero, Alfred
Smith, Barry
He, Yongqun
The Ontology of Biological and Clinical Statistics (OBCS) for standardized and reproducible statistical analysis
title The Ontology of Biological and Clinical Statistics (OBCS) for standardized and reproducible statistical analysis
title_full The Ontology of Biological and Clinical Statistics (OBCS) for standardized and reproducible statistical analysis
title_fullStr The Ontology of Biological and Clinical Statistics (OBCS) for standardized and reproducible statistical analysis
title_full_unstemmed The Ontology of Biological and Clinical Statistics (OBCS) for standardized and reproducible statistical analysis
title_short The Ontology of Biological and Clinical Statistics (OBCS) for standardized and reproducible statistical analysis
title_sort ontology of biological and clinical statistics (obcs) for standardized and reproducible statistical analysis
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5024438/
https://www.ncbi.nlm.nih.gov/pubmed/27627881
http://dx.doi.org/10.1186/s13326-016-0100-2
work_keys_str_mv AT zhengjie theontologyofbiologicalandclinicalstatisticsobcsforstandardizedandreproduciblestatisticalanalysis
AT harrismarcelliner theontologyofbiologicalandclinicalstatisticsobcsforstandardizedandreproduciblestatisticalanalysis
AT masciannamaria theontologyofbiologicalandclinicalstatisticsobcsforstandardizedandreproduciblestatisticalanalysis
AT linyu theontologyofbiologicalandclinicalstatisticsobcsforstandardizedandreproduciblestatisticalanalysis
AT heroalfred theontologyofbiologicalandclinicalstatisticsobcsforstandardizedandreproduciblestatisticalanalysis
AT smithbarry theontologyofbiologicalandclinicalstatisticsobcsforstandardizedandreproduciblestatisticalanalysis
AT heyongqun theontologyofbiologicalandclinicalstatisticsobcsforstandardizedandreproduciblestatisticalanalysis
AT zhengjie ontologyofbiologicalandclinicalstatisticsobcsforstandardizedandreproduciblestatisticalanalysis
AT harrismarcelliner ontologyofbiologicalandclinicalstatisticsobcsforstandardizedandreproduciblestatisticalanalysis
AT masciannamaria ontologyofbiologicalandclinicalstatisticsobcsforstandardizedandreproduciblestatisticalanalysis
AT linyu ontologyofbiologicalandclinicalstatisticsobcsforstandardizedandreproduciblestatisticalanalysis
AT heroalfred ontologyofbiologicalandclinicalstatisticsobcsforstandardizedandreproduciblestatisticalanalysis
AT smithbarry ontologyofbiologicalandclinicalstatisticsobcsforstandardizedandreproduciblestatisticalanalysis
AT heyongqun ontologyofbiologicalandclinicalstatisticsobcsforstandardizedandreproduciblestatisticalanalysis