Cargando…

Feasibility of Using Clinical Element Models (CEM) to Standardize Phenotype Variables in the Database of Genotypes and Phenotypes (dbGaP)

The database of Genotypes and Phenotypes (dbGaP) contains various types of data generated from genome-wide association studies (GWAS). These data can be used to facilitate novel scientific discoveries and to reduce cost and time for exploratory research. However, idiosyncrasies and inconsistencies i...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Ko-Wei, Tharp, Melissa, Conway, Mike, Hsieh, Alexander, Ross, Mindy, Kim, Jihoon, Kim, Hyeon-Eui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3776754/
https://www.ncbi.nlm.nih.gov/pubmed/24058713
http://dx.doi.org/10.1371/journal.pone.0076384
_version_ 1782284875965923328
author Lin, Ko-Wei
Tharp, Melissa
Conway, Mike
Hsieh, Alexander
Ross, Mindy
Kim, Jihoon
Kim, Hyeon-Eui
author_facet Lin, Ko-Wei
Tharp, Melissa
Conway, Mike
Hsieh, Alexander
Ross, Mindy
Kim, Jihoon
Kim, Hyeon-Eui
author_sort Lin, Ko-Wei
collection PubMed
description The database of Genotypes and Phenotypes (dbGaP) contains various types of data generated from genome-wide association studies (GWAS). These data can be used to facilitate novel scientific discoveries and to reduce cost and time for exploratory research. However, idiosyncrasies and inconsistencies in phenotype variable names are a major barrier to reusing these data. We addressed these challenges in standardizing phenotype variables by formalizing their descriptions using Clinical Element Models (CEM). Designed to represent clinical data, CEMs were highly expressive and thus were able to represent a majority (77.5%) of the 215 phenotype variable descriptions. However, their high expressivity also made it difficult to directly apply them to research data such as phenotype variables in dbGaP. Our study suggested that simplification of the template models makes it more straightforward to formally represent the key semantics of phenotype variables.
format Online
Article
Text
id pubmed-3776754
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-37767542013-09-20 Feasibility of Using Clinical Element Models (CEM) to Standardize Phenotype Variables in the Database of Genotypes and Phenotypes (dbGaP) Lin, Ko-Wei Tharp, Melissa Conway, Mike Hsieh, Alexander Ross, Mindy Kim, Jihoon Kim, Hyeon-Eui PLoS One Research Article The database of Genotypes and Phenotypes (dbGaP) contains various types of data generated from genome-wide association studies (GWAS). These data can be used to facilitate novel scientific discoveries and to reduce cost and time for exploratory research. However, idiosyncrasies and inconsistencies in phenotype variable names are a major barrier to reusing these data. We addressed these challenges in standardizing phenotype variables by formalizing their descriptions using Clinical Element Models (CEM). Designed to represent clinical data, CEMs were highly expressive and thus were able to represent a majority (77.5%) of the 215 phenotype variable descriptions. However, their high expressivity also made it difficult to directly apply them to research data such as phenotype variables in dbGaP. Our study suggested that simplification of the template models makes it more straightforward to formally represent the key semantics of phenotype variables. Public Library of Science 2013-09-18 /pmc/articles/PMC3776754/ /pubmed/24058713 http://dx.doi.org/10.1371/journal.pone.0076384 Text en © 2013 Lin et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Lin, Ko-Wei
Tharp, Melissa
Conway, Mike
Hsieh, Alexander
Ross, Mindy
Kim, Jihoon
Kim, Hyeon-Eui
Feasibility of Using Clinical Element Models (CEM) to Standardize Phenotype Variables in the Database of Genotypes and Phenotypes (dbGaP)
title Feasibility of Using Clinical Element Models (CEM) to Standardize Phenotype Variables in the Database of Genotypes and Phenotypes (dbGaP)
title_full Feasibility of Using Clinical Element Models (CEM) to Standardize Phenotype Variables in the Database of Genotypes and Phenotypes (dbGaP)
title_fullStr Feasibility of Using Clinical Element Models (CEM) to Standardize Phenotype Variables in the Database of Genotypes and Phenotypes (dbGaP)
title_full_unstemmed Feasibility of Using Clinical Element Models (CEM) to Standardize Phenotype Variables in the Database of Genotypes and Phenotypes (dbGaP)
title_short Feasibility of Using Clinical Element Models (CEM) to Standardize Phenotype Variables in the Database of Genotypes and Phenotypes (dbGaP)
title_sort feasibility of using clinical element models (cem) to standardize phenotype variables in the database of genotypes and phenotypes (dbgap)
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3776754/
https://www.ncbi.nlm.nih.gov/pubmed/24058713
http://dx.doi.org/10.1371/journal.pone.0076384
work_keys_str_mv AT linkowei feasibilityofusingclinicalelementmodelscemtostandardizephenotypevariablesinthedatabaseofgenotypesandphenotypesdbgap
AT tharpmelissa feasibilityofusingclinicalelementmodelscemtostandardizephenotypevariablesinthedatabaseofgenotypesandphenotypesdbgap
AT conwaymike feasibilityofusingclinicalelementmodelscemtostandardizephenotypevariablesinthedatabaseofgenotypesandphenotypesdbgap
AT hsiehalexander feasibilityofusingclinicalelementmodelscemtostandardizephenotypevariablesinthedatabaseofgenotypesandphenotypesdbgap
AT rossmindy feasibilityofusingclinicalelementmodelscemtostandardizephenotypevariablesinthedatabaseofgenotypesandphenotypesdbgap
AT kimjihoon feasibilityofusingclinicalelementmodelscemtostandardizephenotypevariablesinthedatabaseofgenotypesandphenotypesdbgap
AT kimhyeoneui feasibilityofusingclinicalelementmodelscemtostandardizephenotypevariablesinthedatabaseofgenotypesandphenotypesdbgap