Cargando…

Practical investigation of the performance of robust logistic regression to predict the genetic risk of hypertension

Logistic regression is usually applied to investigate the association between inherited genetic variants and a binary disease phenotype. A limitation of standard methods used to estimate the parameters of logistic regression models is their strong dependence on a few observations deviating from the...

Descripción completa

Detalles Bibliográficos
Autores principales: Kesselmeier, Miriam, Legrand, Carine, Peil, Barbara, Kabisch, Maria, Fischer, Christine, Hamann, Ute, Lorenzo Bermejo, Justo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4143696/
https://www.ncbi.nlm.nih.gov/pubmed/25519338
http://dx.doi.org/10.1186/1753-6561-8-S1-S65
_version_ 1782331941224185856
author Kesselmeier, Miriam
Legrand, Carine
Peil, Barbara
Kabisch, Maria
Fischer, Christine
Hamann, Ute
Lorenzo Bermejo, Justo
author_facet Kesselmeier, Miriam
Legrand, Carine
Peil, Barbara
Kabisch, Maria
Fischer, Christine
Hamann, Ute
Lorenzo Bermejo, Justo
author_sort Kesselmeier, Miriam
collection PubMed
description Logistic regression is usually applied to investigate the association between inherited genetic variants and a binary disease phenotype. A limitation of standard methods used to estimate the parameters of logistic regression models is their strong dependence on a few observations deviating from the majority of the data. We used data from the Genetic Analysis Workshop 18 to explore the possible benefit of robust logistic regression to estimate the genetic risk of hypertension. The comparison between standard and robust methods relied on the influence of departing hypertension profiles (outliers) on the estimated odds ratios, areas under the receiver operating characteristic curves, and clinical net benefit. Our results confirmed that single outliers may substantially affect the estimated genotype relative risks. The ranking of variants by probability values was different in standard and in robust logistic regression. For cutoff probabilities between 0.2 and 0.6, the clinical net benefit estimated by leave-one-out cross-validation in the investigated sample was slightly larger under robust regression, but the overall area under the receiver operating characteristic curve was larger for standard logistic regression. The potential advantage of robust statistics in the context of genetic association studies should be investigated in future analyses based on real and simulated data.
format Online
Article
Text
id pubmed-4143696
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41436962014-09-02 Practical investigation of the performance of robust logistic regression to predict the genetic risk of hypertension Kesselmeier, Miriam Legrand, Carine Peil, Barbara Kabisch, Maria Fischer, Christine Hamann, Ute Lorenzo Bermejo, Justo BMC Proc Proceedings Logistic regression is usually applied to investigate the association between inherited genetic variants and a binary disease phenotype. A limitation of standard methods used to estimate the parameters of logistic regression models is their strong dependence on a few observations deviating from the majority of the data. We used data from the Genetic Analysis Workshop 18 to explore the possible benefit of robust logistic regression to estimate the genetic risk of hypertension. The comparison between standard and robust methods relied on the influence of departing hypertension profiles (outliers) on the estimated odds ratios, areas under the receiver operating characteristic curves, and clinical net benefit. Our results confirmed that single outliers may substantially affect the estimated genotype relative risks. The ranking of variants by probability values was different in standard and in robust logistic regression. For cutoff probabilities between 0.2 and 0.6, the clinical net benefit estimated by leave-one-out cross-validation in the investigated sample was slightly larger under robust regression, but the overall area under the receiver operating characteristic curve was larger for standard logistic regression. The potential advantage of robust statistics in the context of genetic association studies should be investigated in future analyses based on real and simulated data. BioMed Central 2014-06-17 /pmc/articles/PMC4143696/ /pubmed/25519338 http://dx.doi.org/10.1186/1753-6561-8-S1-S65 Text en Copyright © 2014 Kesselmeier et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Kesselmeier, Miriam
Legrand, Carine
Peil, Barbara
Kabisch, Maria
Fischer, Christine
Hamann, Ute
Lorenzo Bermejo, Justo
Practical investigation of the performance of robust logistic regression to predict the genetic risk of hypertension
title Practical investigation of the performance of robust logistic regression to predict the genetic risk of hypertension
title_full Practical investigation of the performance of robust logistic regression to predict the genetic risk of hypertension
title_fullStr Practical investigation of the performance of robust logistic regression to predict the genetic risk of hypertension
title_full_unstemmed Practical investigation of the performance of robust logistic regression to predict the genetic risk of hypertension
title_short Practical investigation of the performance of robust logistic regression to predict the genetic risk of hypertension
title_sort practical investigation of the performance of robust logistic regression to predict the genetic risk of hypertension
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4143696/
https://www.ncbi.nlm.nih.gov/pubmed/25519338
http://dx.doi.org/10.1186/1753-6561-8-S1-S65
work_keys_str_mv AT kesselmeiermiriam practicalinvestigationoftheperformanceofrobustlogisticregressiontopredictthegeneticriskofhypertension
AT legrandcarine practicalinvestigationoftheperformanceofrobustlogisticregressiontopredictthegeneticriskofhypertension
AT peilbarbara practicalinvestigationoftheperformanceofrobustlogisticregressiontopredictthegeneticriskofhypertension
AT kabischmaria practicalinvestigationoftheperformanceofrobustlogisticregressiontopredictthegeneticriskofhypertension
AT fischerchristine practicalinvestigationoftheperformanceofrobustlogisticregressiontopredictthegeneticriskofhypertension
AT hamannute practicalinvestigationoftheperformanceofrobustlogisticregressiontopredictthegeneticriskofhypertension
AT lorenzobermejojusto practicalinvestigationoftheperformanceofrobustlogisticregressiontopredictthegeneticriskofhypertension