Cargando…
Feasibility of Using Clinical Element Models (CEM) to Standardize Phenotype Variables in the Database of Genotypes and Phenotypes (dbGaP)
The database of Genotypes and Phenotypes (dbGaP) contains various types of data generated from genome-wide association studies (GWAS). These data can be used to facilitate novel scientific discoveries and to reduce cost and time for exploratory research. However, idiosyncrasies and inconsistencies i...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3776754/ https://www.ncbi.nlm.nih.gov/pubmed/24058713 http://dx.doi.org/10.1371/journal.pone.0076384 |
_version_ | 1782284875965923328 |
---|---|
author | Lin, Ko-Wei Tharp, Melissa Conway, Mike Hsieh, Alexander Ross, Mindy Kim, Jihoon Kim, Hyeon-Eui |
author_facet | Lin, Ko-Wei Tharp, Melissa Conway, Mike Hsieh, Alexander Ross, Mindy Kim, Jihoon Kim, Hyeon-Eui |
author_sort | Lin, Ko-Wei |
collection | PubMed |
description | The database of Genotypes and Phenotypes (dbGaP) contains various types of data generated from genome-wide association studies (GWAS). These data can be used to facilitate novel scientific discoveries and to reduce cost and time for exploratory research. However, idiosyncrasies and inconsistencies in phenotype variable names are a major barrier to reusing these data. We addressed these challenges in standardizing phenotype variables by formalizing their descriptions using Clinical Element Models (CEM). Designed to represent clinical data, CEMs were highly expressive and thus were able to represent a majority (77.5%) of the 215 phenotype variable descriptions. However, their high expressivity also made it difficult to directly apply them to research data such as phenotype variables in dbGaP. Our study suggested that simplification of the template models makes it more straightforward to formally represent the key semantics of phenotype variables. |
format | Online Article Text |
id | pubmed-3776754 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-37767542013-09-20 Feasibility of Using Clinical Element Models (CEM) to Standardize Phenotype Variables in the Database of Genotypes and Phenotypes (dbGaP) Lin, Ko-Wei Tharp, Melissa Conway, Mike Hsieh, Alexander Ross, Mindy Kim, Jihoon Kim, Hyeon-Eui PLoS One Research Article The database of Genotypes and Phenotypes (dbGaP) contains various types of data generated from genome-wide association studies (GWAS). These data can be used to facilitate novel scientific discoveries and to reduce cost and time for exploratory research. However, idiosyncrasies and inconsistencies in phenotype variable names are a major barrier to reusing these data. We addressed these challenges in standardizing phenotype variables by formalizing their descriptions using Clinical Element Models (CEM). Designed to represent clinical data, CEMs were highly expressive and thus were able to represent a majority (77.5%) of the 215 phenotype variable descriptions. However, their high expressivity also made it difficult to directly apply them to research data such as phenotype variables in dbGaP. Our study suggested that simplification of the template models makes it more straightforward to formally represent the key semantics of phenotype variables. Public Library of Science 2013-09-18 /pmc/articles/PMC3776754/ /pubmed/24058713 http://dx.doi.org/10.1371/journal.pone.0076384 Text en © 2013 Lin et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Lin, Ko-Wei Tharp, Melissa Conway, Mike Hsieh, Alexander Ross, Mindy Kim, Jihoon Kim, Hyeon-Eui Feasibility of Using Clinical Element Models (CEM) to Standardize Phenotype Variables in the Database of Genotypes and Phenotypes (dbGaP) |
title | Feasibility of Using Clinical Element Models (CEM) to Standardize Phenotype Variables in the Database of Genotypes and Phenotypes (dbGaP) |
title_full | Feasibility of Using Clinical Element Models (CEM) to Standardize Phenotype Variables in the Database of Genotypes and Phenotypes (dbGaP) |
title_fullStr | Feasibility of Using Clinical Element Models (CEM) to Standardize Phenotype Variables in the Database of Genotypes and Phenotypes (dbGaP) |
title_full_unstemmed | Feasibility of Using Clinical Element Models (CEM) to Standardize Phenotype Variables in the Database of Genotypes and Phenotypes (dbGaP) |
title_short | Feasibility of Using Clinical Element Models (CEM) to Standardize Phenotype Variables in the Database of Genotypes and Phenotypes (dbGaP) |
title_sort | feasibility of using clinical element models (cem) to standardize phenotype variables in the database of genotypes and phenotypes (dbgap) |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3776754/ https://www.ncbi.nlm.nih.gov/pubmed/24058713 http://dx.doi.org/10.1371/journal.pone.0076384 |
work_keys_str_mv | AT linkowei feasibilityofusingclinicalelementmodelscemtostandardizephenotypevariablesinthedatabaseofgenotypesandphenotypesdbgap AT tharpmelissa feasibilityofusingclinicalelementmodelscemtostandardizephenotypevariablesinthedatabaseofgenotypesandphenotypesdbgap AT conwaymike feasibilityofusingclinicalelementmodelscemtostandardizephenotypevariablesinthedatabaseofgenotypesandphenotypesdbgap AT hsiehalexander feasibilityofusingclinicalelementmodelscemtostandardizephenotypevariablesinthedatabaseofgenotypesandphenotypesdbgap AT rossmindy feasibilityofusingclinicalelementmodelscemtostandardizephenotypevariablesinthedatabaseofgenotypesandphenotypesdbgap AT kimjihoon feasibilityofusingclinicalelementmodelscemtostandardizephenotypevariablesinthedatabaseofgenotypesandphenotypesdbgap AT kimhyeoneui feasibilityofusingclinicalelementmodelscemtostandardizephenotypevariablesinthedatabaseofgenotypesandphenotypesdbgap |