Cargando…
Machine Learning Data Imputation and Classification in a Multicohort Hypertension Clinical Study
Health-care initiatives are pushing the development and utilization of clinical data for medical discovery and translational research studies. Machine learning tools implemented for Big Data have been applied to detect patterns in complex diseases. This study focuses on hypertension and examines phe...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Libertas Academica
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4862746/ https://www.ncbi.nlm.nih.gov/pubmed/27199552 http://dx.doi.org/10.4137/BBI.S29473 |
_version_ | 1782431382923902976 |
---|---|
author | Seffens, William Evans, Chad Taylor, Herman |
author_facet | Seffens, William Evans, Chad Taylor, Herman |
author_sort | Seffens, William |
collection | PubMed |
description | Health-care initiatives are pushing the development and utilization of clinical data for medical discovery and translational research studies. Machine learning tools implemented for Big Data have been applied to detect patterns in complex diseases. This study focuses on hypertension and examines phenotype data across a major clinical study called Minority Health Genomics and Translational Research Repository Database composed of self-reported African American (AA) participants combined with related cohorts. Prior genome-wide association studies for hypertension in AAs presumed that an increase of disease burden in susceptible populations is due to rare variants. But genomic analysis of hypertension, even those designed to focus on rare variants, has yielded marginal genome-wide results over many studies. Machine learning and other nonparametric statistical methods have recently been shown to uncover relationships in complex phenotypes, genotypes, and clinical data. We trained neural networks with phenotype data for missing-data imputation to increase the usable size of a clinical data set. Validity was established by showing performance effects using the expanded data set for the association of phenotype variables with case/control status of patients. Data mining classification tools were used to generate association rules. |
format | Online Article Text |
id | pubmed-4862746 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Libertas Academica |
record_format | MEDLINE/PubMed |
spelling | pubmed-48627462016-05-19 Machine Learning Data Imputation and Classification in a Multicohort Hypertension Clinical Study Seffens, William Evans, Chad Taylor, Herman Bioinform Biol Insights Original Research Health-care initiatives are pushing the development and utilization of clinical data for medical discovery and translational research studies. Machine learning tools implemented for Big Data have been applied to detect patterns in complex diseases. This study focuses on hypertension and examines phenotype data across a major clinical study called Minority Health Genomics and Translational Research Repository Database composed of self-reported African American (AA) participants combined with related cohorts. Prior genome-wide association studies for hypertension in AAs presumed that an increase of disease burden in susceptible populations is due to rare variants. But genomic analysis of hypertension, even those designed to focus on rare variants, has yielded marginal genome-wide results over many studies. Machine learning and other nonparametric statistical methods have recently been shown to uncover relationships in complex phenotypes, genotypes, and clinical data. We trained neural networks with phenotype data for missing-data imputation to increase the usable size of a clinical data set. Validity was established by showing performance effects using the expanded data set for the association of phenotype variables with case/control status of patients. Data mining classification tools were used to generate association rules. Libertas Academica 2016-05-09 /pmc/articles/PMC4862746/ /pubmed/27199552 http://dx.doi.org/10.4137/BBI.S29473 Text en © 2015 the author(s), publisher and licensee Libertas Academica Ltd. This is an open-access article distributed under the terms of the Creative Commons CC-BY-NC 3.0 license. |
spellingShingle | Original Research Seffens, William Evans, Chad Taylor, Herman Machine Learning Data Imputation and Classification in a Multicohort Hypertension Clinical Study |
title | Machine Learning Data Imputation and Classification in a Multicohort Hypertension Clinical Study |
title_full | Machine Learning Data Imputation and Classification in a Multicohort Hypertension Clinical Study |
title_fullStr | Machine Learning Data Imputation and Classification in a Multicohort Hypertension Clinical Study |
title_full_unstemmed | Machine Learning Data Imputation and Classification in a Multicohort Hypertension Clinical Study |
title_short | Machine Learning Data Imputation and Classification in a Multicohort Hypertension Clinical Study |
title_sort | machine learning data imputation and classification in a multicohort hypertension clinical study |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4862746/ https://www.ncbi.nlm.nih.gov/pubmed/27199552 http://dx.doi.org/10.4137/BBI.S29473 |
work_keys_str_mv | AT seffenswilliam machinelearningdataimputationandclassificationinamulticohorthypertensionclinicalstudy AT evanschad machinelearningdataimputationandclassificationinamulticohorthypertensionclinicalstudy AT machinelearningdataimputationandclassificationinamulticohorthypertensionclinicalstudy AT taylorherman machinelearningdataimputationandclassificationinamulticohorthypertensionclinicalstudy |