Cargando…
Linking clinotypes to phenotypes and genotypes from laboratory test results in comprehensive physical exams
BACKGROUND: In this work, we aimed to demonstrate how to utilize the lab test results and other clinical information to support precision medicine research and clinical decisions on complex diseases, with the support of electronic medical record facilities. We defined “clinotypes” as clinical inform...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7903607/ https://www.ncbi.nlm.nih.gov/pubmed/33627109 http://dx.doi.org/10.1186/s12911-021-01387-z |
_version_ | 1783654768460169216 |
---|---|
author | Nguyen, Thanh Zhang, Tongbin Fox, Geoffrey Zeng, Sisi Cao, Ni Pan, Chuandi Chen, Jake Y. |
author_facet | Nguyen, Thanh Zhang, Tongbin Fox, Geoffrey Zeng, Sisi Cao, Ni Pan, Chuandi Chen, Jake Y. |
author_sort | Nguyen, Thanh |
collection | PubMed |
description | BACKGROUND: In this work, we aimed to demonstrate how to utilize the lab test results and other clinical information to support precision medicine research and clinical decisions on complex diseases, with the support of electronic medical record facilities. We defined “clinotypes” as clinical information that could be observed and measured objectively using biomedical instruments. From well-known ‘omic’ problem definitions, we defined problems using clinotype information, including stratifying patients—identifying interested sub cohorts for future studies, mining significant associations between clinotypes and specific phenotypes-diseases, and discovering potential linkages between clinotype and genomic information. We solved these problems by integrating public omic databases and applying advanced machine learning and visual analytic techniques on two-year health exam records from a large population of healthy southern Chinese individuals (size n = 91,354). When developing the solution, we carefully addressed the missing information, imbalance and non-uniformed data annotation issues. RESULTS: We organized the techniques and solutions to address the problems and issues above into CPA framework (Clinotype Prediction and Association-finding). At the data preprocessing step, we handled the missing value issue with predicted accuracy of 0.760. We curated 12,635 clinotype-gene associations. We found 147 Associations between 147 chronic diseases-phenotype and clinotypes, which improved the disease predictive performance to AUC (average) of 0.967. We mined 182 significant clinotype-clinotype associations among 69 clinotypes. CONCLUSIONS: Our results showed strong potential connectivity between the omics information and the clinical lab test information. The results further emphasized the needs to utilize and integrate the clinical information, especially the lab test results, in future PheWas and omic studies. Furthermore, it showed that the clinotype information could initiate an alternative research direction and serve as an independent field of data to support the well-known ‘phenome’ and ‘genome’ researches. |
format | Online Article Text |
id | pubmed-7903607 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-79036072021-03-01 Linking clinotypes to phenotypes and genotypes from laboratory test results in comprehensive physical exams Nguyen, Thanh Zhang, Tongbin Fox, Geoffrey Zeng, Sisi Cao, Ni Pan, Chuandi Chen, Jake Y. BMC Med Inform Decis Mak Research BACKGROUND: In this work, we aimed to demonstrate how to utilize the lab test results and other clinical information to support precision medicine research and clinical decisions on complex diseases, with the support of electronic medical record facilities. We defined “clinotypes” as clinical information that could be observed and measured objectively using biomedical instruments. From well-known ‘omic’ problem definitions, we defined problems using clinotype information, including stratifying patients—identifying interested sub cohorts for future studies, mining significant associations between clinotypes and specific phenotypes-diseases, and discovering potential linkages between clinotype and genomic information. We solved these problems by integrating public omic databases and applying advanced machine learning and visual analytic techniques on two-year health exam records from a large population of healthy southern Chinese individuals (size n = 91,354). When developing the solution, we carefully addressed the missing information, imbalance and non-uniformed data annotation issues. RESULTS: We organized the techniques and solutions to address the problems and issues above into CPA framework (Clinotype Prediction and Association-finding). At the data preprocessing step, we handled the missing value issue with predicted accuracy of 0.760. We curated 12,635 clinotype-gene associations. We found 147 Associations between 147 chronic diseases-phenotype and clinotypes, which improved the disease predictive performance to AUC (average) of 0.967. We mined 182 significant clinotype-clinotype associations among 69 clinotypes. CONCLUSIONS: Our results showed strong potential connectivity between the omics information and the clinical lab test information. The results further emphasized the needs to utilize and integrate the clinical information, especially the lab test results, in future PheWas and omic studies. Furthermore, it showed that the clinotype information could initiate an alternative research direction and serve as an independent field of data to support the well-known ‘phenome’ and ‘genome’ researches. BioMed Central 2021-02-24 /pmc/articles/PMC7903607/ /pubmed/33627109 http://dx.doi.org/10.1186/s12911-021-01387-z Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Nguyen, Thanh Zhang, Tongbin Fox, Geoffrey Zeng, Sisi Cao, Ni Pan, Chuandi Chen, Jake Y. Linking clinotypes to phenotypes and genotypes from laboratory test results in comprehensive physical exams |
title | Linking clinotypes to phenotypes and genotypes from laboratory test results in comprehensive physical exams |
title_full | Linking clinotypes to phenotypes and genotypes from laboratory test results in comprehensive physical exams |
title_fullStr | Linking clinotypes to phenotypes and genotypes from laboratory test results in comprehensive physical exams |
title_full_unstemmed | Linking clinotypes to phenotypes and genotypes from laboratory test results in comprehensive physical exams |
title_short | Linking clinotypes to phenotypes and genotypes from laboratory test results in comprehensive physical exams |
title_sort | linking clinotypes to phenotypes and genotypes from laboratory test results in comprehensive physical exams |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7903607/ https://www.ncbi.nlm.nih.gov/pubmed/33627109 http://dx.doi.org/10.1186/s12911-021-01387-z |
work_keys_str_mv | AT nguyenthanh linkingclinotypestophenotypesandgenotypesfromlaboratorytestresultsincomprehensivephysicalexams AT zhangtongbin linkingclinotypestophenotypesandgenotypesfromlaboratorytestresultsincomprehensivephysicalexams AT foxgeoffrey linkingclinotypestophenotypesandgenotypesfromlaboratorytestresultsincomprehensivephysicalexams AT zengsisi linkingclinotypestophenotypesandgenotypesfromlaboratorytestresultsincomprehensivephysicalexams AT caoni linkingclinotypestophenotypesandgenotypesfromlaboratorytestresultsincomprehensivephysicalexams AT panchuandi linkingclinotypestophenotypesandgenotypesfromlaboratorytestresultsincomprehensivephysicalexams AT chenjakey linkingclinotypestophenotypesandgenotypesfromlaboratorytestresultsincomprehensivephysicalexams |