Cargando…

An information model for computable cancer phenotypes

BACKGROUND: Standards, methods, and tools supporting the integration of clinical data and genomic information are an area of significant need and rapid growth in biomedical informatics. Integration of cancer clinical data and cancer genomic information poses unique challenges, because of the high vo...

Descripción completa

Detalles Bibliográficos
Autores principales: Hochheiser, Harry, Castine, Melissa, Harris, David, Savova, Guergana, Jacobson, Rebecca S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5024416/
https://www.ncbi.nlm.nih.gov/pubmed/27629872
http://dx.doi.org/10.1186/s12911-016-0358-4
_version_ 1782453794721759232
author Hochheiser, Harry
Castine, Melissa
Harris, David
Savova, Guergana
Jacobson, Rebecca S.
author_facet Hochheiser, Harry
Castine, Melissa
Harris, David
Savova, Guergana
Jacobson, Rebecca S.
author_sort Hochheiser, Harry
collection PubMed
description BACKGROUND: Standards, methods, and tools supporting the integration of clinical data and genomic information are an area of significant need and rapid growth in biomedical informatics. Integration of cancer clinical data and cancer genomic information poses unique challenges, because of the high volume and complexity of clinical data, as well as the heterogeneity and instability of cancer genome data when compared with germline data. Current information models of clinical and genomic data are not sufficiently expressive to represent individual observations and to aggregate those observations into longitudinal summaries over the course of cancer care. These models are acutely needed to support the development of systems and tools for generating the so called clinical “deep phenotype” of individual cancer patients, a process which remains almost entirely manual in cancer research and precision medicine. METHODS: Reviews of existing ontologies and interviews with cancer researchers were used to inform iterative development of a cancer phenotype information model. We translated a subset of the Fast Healthcare Interoperability Resources (FHIR) models into the OWL 2 Description Logic (DL) representation, and added extensions as needed for modeling cancer phenotypes with terms derived from the NCI Thesaurus. Models were validated with domain experts and evaluated against competency questions. RESULTS: The DeepPhe Information model represents cancer phenotype data at increasing levels of abstraction from mention level in clinical documents to summaries of key events and findings. We describe the model using breast cancer as an example, depicting methods to represent phenotypic features of cancers, tumors, treatment regimens, and specific biologic behaviors that span the entire course of a patient’s disease. CONCLUSIONS: We present a multi-scale information model for representing individual document mentions, document level classifications, episodes along a disease course, and phenotype summarization, linking individual observations to high-level summaries in support of subsequent integration and analysis.
format Online
Article
Text
id pubmed-5024416
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-50244162016-09-20 An information model for computable cancer phenotypes Hochheiser, Harry Castine, Melissa Harris, David Savova, Guergana Jacobson, Rebecca S. BMC Med Inform Decis Mak Research Article BACKGROUND: Standards, methods, and tools supporting the integration of clinical data and genomic information are an area of significant need and rapid growth in biomedical informatics. Integration of cancer clinical data and cancer genomic information poses unique challenges, because of the high volume and complexity of clinical data, as well as the heterogeneity and instability of cancer genome data when compared with germline data. Current information models of clinical and genomic data are not sufficiently expressive to represent individual observations and to aggregate those observations into longitudinal summaries over the course of cancer care. These models are acutely needed to support the development of systems and tools for generating the so called clinical “deep phenotype” of individual cancer patients, a process which remains almost entirely manual in cancer research and precision medicine. METHODS: Reviews of existing ontologies and interviews with cancer researchers were used to inform iterative development of a cancer phenotype information model. We translated a subset of the Fast Healthcare Interoperability Resources (FHIR) models into the OWL 2 Description Logic (DL) representation, and added extensions as needed for modeling cancer phenotypes with terms derived from the NCI Thesaurus. Models were validated with domain experts and evaluated against competency questions. RESULTS: The DeepPhe Information model represents cancer phenotype data at increasing levels of abstraction from mention level in clinical documents to summaries of key events and findings. We describe the model using breast cancer as an example, depicting methods to represent phenotypic features of cancers, tumors, treatment regimens, and specific biologic behaviors that span the entire course of a patient’s disease. CONCLUSIONS: We present a multi-scale information model for representing individual document mentions, document level classifications, episodes along a disease course, and phenotype summarization, linking individual observations to high-level summaries in support of subsequent integration and analysis. BioMed Central 2016-09-15 /pmc/articles/PMC5024416/ /pubmed/27629872 http://dx.doi.org/10.1186/s12911-016-0358-4 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Hochheiser, Harry
Castine, Melissa
Harris, David
Savova, Guergana
Jacobson, Rebecca S.
An information model for computable cancer phenotypes
title An information model for computable cancer phenotypes
title_full An information model for computable cancer phenotypes
title_fullStr An information model for computable cancer phenotypes
title_full_unstemmed An information model for computable cancer phenotypes
title_short An information model for computable cancer phenotypes
title_sort information model for computable cancer phenotypes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5024416/
https://www.ncbi.nlm.nih.gov/pubmed/27629872
http://dx.doi.org/10.1186/s12911-016-0358-4
work_keys_str_mv AT hochheiserharry aninformationmodelforcomputablecancerphenotypes
AT castinemelissa aninformationmodelforcomputablecancerphenotypes
AT harrisdavid aninformationmodelforcomputablecancerphenotypes
AT savovaguergana aninformationmodelforcomputablecancerphenotypes
AT jacobsonrebeccas aninformationmodelforcomputablecancerphenotypes
AT hochheiserharry informationmodelforcomputablecancerphenotypes
AT castinemelissa informationmodelforcomputablecancerphenotypes
AT harrisdavid informationmodelforcomputablecancerphenotypes
AT savovaguergana informationmodelforcomputablecancerphenotypes
AT jacobsonrebeccas informationmodelforcomputablecancerphenotypes