Cargando…

Machine Learning Data Imputation and Classification in a Multicohort Hypertension Clinical Study

Health-care initiatives are pushing the development and utilization of clinical data for medical discovery and translational research studies. Machine learning tools implemented for Big Data have been applied to detect patterns in complex diseases. This study focuses on hypertension and examines phe...

Descripción completa

Detalles Bibliográficos
Autores principales: Seffens, William, Evans, Chad, Taylor, Herman
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Libertas Academica 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4862746/
https://www.ncbi.nlm.nih.gov/pubmed/27199552
http://dx.doi.org/10.4137/BBI.S29473
_version_ 1782431382923902976
author Seffens, William
Evans, Chad
Taylor, Herman
author_facet Seffens, William
Evans, Chad
Taylor, Herman
author_sort Seffens, William
collection PubMed
description Health-care initiatives are pushing the development and utilization of clinical data for medical discovery and translational research studies. Machine learning tools implemented for Big Data have been applied to detect patterns in complex diseases. This study focuses on hypertension and examines phenotype data across a major clinical study called Minority Health Genomics and Translational Research Repository Database composed of self-reported African American (AA) participants combined with related cohorts. Prior genome-wide association studies for hypertension in AAs presumed that an increase of disease burden in susceptible populations is due to rare variants. But genomic analysis of hypertension, even those designed to focus on rare variants, has yielded marginal genome-wide results over many studies. Machine learning and other nonparametric statistical methods have recently been shown to uncover relationships in complex phenotypes, genotypes, and clinical data. We trained neural networks with phenotype data for missing-data imputation to increase the usable size of a clinical data set. Validity was established by showing performance effects using the expanded data set for the association of phenotype variables with case/control status of patients. Data mining classification tools were used to generate association rules.
format Online
Article
Text
id pubmed-4862746
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-48627462016-05-19 Machine Learning Data Imputation and Classification in a Multicohort Hypertension Clinical Study Seffens, William Evans, Chad Taylor, Herman Bioinform Biol Insights Original Research Health-care initiatives are pushing the development and utilization of clinical data for medical discovery and translational research studies. Machine learning tools implemented for Big Data have been applied to detect patterns in complex diseases. This study focuses on hypertension and examines phenotype data across a major clinical study called Minority Health Genomics and Translational Research Repository Database composed of self-reported African American (AA) participants combined with related cohorts. Prior genome-wide association studies for hypertension in AAs presumed that an increase of disease burden in susceptible populations is due to rare variants. But genomic analysis of hypertension, even those designed to focus on rare variants, has yielded marginal genome-wide results over many studies. Machine learning and other nonparametric statistical methods have recently been shown to uncover relationships in complex phenotypes, genotypes, and clinical data. We trained neural networks with phenotype data for missing-data imputation to increase the usable size of a clinical data set. Validity was established by showing performance effects using the expanded data set for the association of phenotype variables with case/control status of patients. Data mining classification tools were used to generate association rules. Libertas Academica 2016-05-09 /pmc/articles/PMC4862746/ /pubmed/27199552 http://dx.doi.org/10.4137/BBI.S29473 Text en © 2015 the author(s), publisher and licensee Libertas Academica Ltd. This is an open-access article distributed under the terms of the Creative Commons CC-BY-NC 3.0 license.
spellingShingle Original Research
Seffens, William
Evans, Chad
Taylor, Herman
Machine Learning Data Imputation and Classification in a Multicohort Hypertension Clinical Study
title Machine Learning Data Imputation and Classification in a Multicohort Hypertension Clinical Study
title_full Machine Learning Data Imputation and Classification in a Multicohort Hypertension Clinical Study
title_fullStr Machine Learning Data Imputation and Classification in a Multicohort Hypertension Clinical Study
title_full_unstemmed Machine Learning Data Imputation and Classification in a Multicohort Hypertension Clinical Study
title_short Machine Learning Data Imputation and Classification in a Multicohort Hypertension Clinical Study
title_sort machine learning data imputation and classification in a multicohort hypertension clinical study
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4862746/
https://www.ncbi.nlm.nih.gov/pubmed/27199552
http://dx.doi.org/10.4137/BBI.S29473
work_keys_str_mv AT seffenswilliam machinelearningdataimputationandclassificationinamulticohorthypertensionclinicalstudy
AT evanschad machinelearningdataimputationandclassificationinamulticohorthypertensionclinicalstudy
AT machinelearningdataimputationandclassificationinamulticohorthypertensionclinicalstudy
AT taylorherman machinelearningdataimputationandclassificationinamulticohorthypertensionclinicalstudy