Cargando…

Applied Veterinary Informatics: Development of a Semantic and Domain-Specific Method to Construct a Canine Data Repository

Animals are used to study the pathogenesis of various human diseases, but typically as animal models with induced disease. However, companion animals develop disease spontaneously in a way that mirrors disease development in humans. The purpose of this study is to develop a semantic and domain-speci...

Descripción completa

Detalles Bibliográficos
Autores principales: Boland, Mary Regina, Casal, Margret L., Kraus, Marc S., Gelzer, Anna R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6901510/
https://www.ncbi.nlm.nih.gov/pubmed/31819105
http://dx.doi.org/10.1038/s41598-019-55035-8
_version_ 1783477513804054528
author Boland, Mary Regina
Casal, Margret L.
Kraus, Marc S.
Gelzer, Anna R.
author_facet Boland, Mary Regina
Casal, Margret L.
Kraus, Marc S.
Gelzer, Anna R.
author_sort Boland, Mary Regina
collection PubMed
description Animals are used to study the pathogenesis of various human diseases, but typically as animal models with induced disease. However, companion animals develop disease spontaneously in a way that mirrors disease development in humans. The purpose of this study is to develop a semantic and domain-specific method to enable construction of a data repository from a veterinary hospital that would be useful for future studies. We developed a two-phase method that combines semantic and domain-specific approaches to construct a canine data repository of clinical data collected during routine care at the Matthew J Ryan Veterinary Hospital of the University of Pennsylvania (PennVet). Our framework consists of two phases: (1) a semantic data-cleaning phase and (2) a domain-specific data-cleaning phase. We validated our data repository using a gold standard of known breed predispositions for certain diseases (i.e., mitral valve disease, atrial fibrillation and osteosarcoma). Our two-phase method allowed us to maximize data retention (99.8% of data retained), while ensuring the quality of our result. Our final population contained 84,405 dogs treated between 2000 and 2017 from 194 distinct dog breeds. We observed the expected breed associations with mitral valve disease, atrial fibrillation, and osteosarcoma (P < 0.05) after adjusting for multiple comparisons. Precision ranged from 60.0 to 83.3 for the three diseases (avg. 74.2) and recall ranged from 31.6 to 83.3 (avg. 53.3). Our study describes a two-phase method to construct a clinical data repository using canine data obtained during routine clinical care at a veterinary hospital.
format Online
Article
Text
id pubmed-6901510
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-69015102019-12-12 Applied Veterinary Informatics: Development of a Semantic and Domain-Specific Method to Construct a Canine Data Repository Boland, Mary Regina Casal, Margret L. Kraus, Marc S. Gelzer, Anna R. Sci Rep Article Animals are used to study the pathogenesis of various human diseases, but typically as animal models with induced disease. However, companion animals develop disease spontaneously in a way that mirrors disease development in humans. The purpose of this study is to develop a semantic and domain-specific method to enable construction of a data repository from a veterinary hospital that would be useful for future studies. We developed a two-phase method that combines semantic and domain-specific approaches to construct a canine data repository of clinical data collected during routine care at the Matthew J Ryan Veterinary Hospital of the University of Pennsylvania (PennVet). Our framework consists of two phases: (1) a semantic data-cleaning phase and (2) a domain-specific data-cleaning phase. We validated our data repository using a gold standard of known breed predispositions for certain diseases (i.e., mitral valve disease, atrial fibrillation and osteosarcoma). Our two-phase method allowed us to maximize data retention (99.8% of data retained), while ensuring the quality of our result. Our final population contained 84,405 dogs treated between 2000 and 2017 from 194 distinct dog breeds. We observed the expected breed associations with mitral valve disease, atrial fibrillation, and osteosarcoma (P < 0.05) after adjusting for multiple comparisons. Precision ranged from 60.0 to 83.3 for the three diseases (avg. 74.2) and recall ranged from 31.6 to 83.3 (avg. 53.3). Our study describes a two-phase method to construct a clinical data repository using canine data obtained during routine clinical care at a veterinary hospital. Nature Publishing Group UK 2019-12-09 /pmc/articles/PMC6901510/ /pubmed/31819105 http://dx.doi.org/10.1038/s41598-019-55035-8 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Boland, Mary Regina
Casal, Margret L.
Kraus, Marc S.
Gelzer, Anna R.
Applied Veterinary Informatics: Development of a Semantic and Domain-Specific Method to Construct a Canine Data Repository
title Applied Veterinary Informatics: Development of a Semantic and Domain-Specific Method to Construct a Canine Data Repository
title_full Applied Veterinary Informatics: Development of a Semantic and Domain-Specific Method to Construct a Canine Data Repository
title_fullStr Applied Veterinary Informatics: Development of a Semantic and Domain-Specific Method to Construct a Canine Data Repository
title_full_unstemmed Applied Veterinary Informatics: Development of a Semantic and Domain-Specific Method to Construct a Canine Data Repository
title_short Applied Veterinary Informatics: Development of a Semantic and Domain-Specific Method to Construct a Canine Data Repository
title_sort applied veterinary informatics: development of a semantic and domain-specific method to construct a canine data repository
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6901510/
https://www.ncbi.nlm.nih.gov/pubmed/31819105
http://dx.doi.org/10.1038/s41598-019-55035-8
work_keys_str_mv AT bolandmaryregina appliedveterinaryinformaticsdevelopmentofasemanticanddomainspecificmethodtoconstructacaninedatarepository
AT casalmargretl appliedveterinaryinformaticsdevelopmentofasemanticanddomainspecificmethodtoconstructacaninedatarepository
AT krausmarcs appliedveterinaryinformaticsdevelopmentofasemanticanddomainspecificmethodtoconstructacaninedatarepository
AT gelzerannar appliedveterinaryinformaticsdevelopmentofasemanticanddomainspecificmethodtoconstructacaninedatarepository