Cargando…

Application of penalized linear regression methods to the selection of environmental enteropathy biomarkers

BACKGROUND: Environmental Enteropathy (EE) is a subclinical condition caused by constant fecal-oral contamination and resulting in blunting of intestinal villi and intestinal inflammation. Of primary interest in the clinical research is to evaluate the association between non-invasive EE biomarkers...

Descripción completa

Detalles Bibliográficos
Autores principales: Lu, Miao, Zhou, Jianhui, Naylor, Caitlin, Kirkpatrick, Beth D., Haque, Rashidul, Petri, William A., Ma, Jennie Z.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5345248/
https://www.ncbi.nlm.nih.gov/pubmed/28293424
http://dx.doi.org/10.1186/s40364-017-0089-4
_version_ 1782513678617149440
author Lu, Miao
Zhou, Jianhui
Naylor, Caitlin
Kirkpatrick, Beth D.
Haque, Rashidul
Petri, William A.
Ma, Jennie Z.
author_facet Lu, Miao
Zhou, Jianhui
Naylor, Caitlin
Kirkpatrick, Beth D.
Haque, Rashidul
Petri, William A.
Ma, Jennie Z.
author_sort Lu, Miao
collection PubMed
description BACKGROUND: Environmental Enteropathy (EE) is a subclinical condition caused by constant fecal-oral contamination and resulting in blunting of intestinal villi and intestinal inflammation. Of primary interest in the clinical research is to evaluate the association between non-invasive EE biomarkers and malnutrition in a cohort of Bangladeshi children. The challenges are that the number of biomarkers/covariates is relatively large, and some of them are highly correlated. METHODS: Many variable selection methods are available in the literature, but which are most appropriate for EE biomarker selection remains unclear. In this study, different variable selection approaches were applied and the performance of these methods was assessed numerically through simulation studies, assuming the correlations among covariates were similar to those in the Bangladesh cohort. The suggested methods from simulations were applied to the Bangladesh cohort to select the most relevant biomarkers for the growth response, and bootstrapping methods were used to evaluate the consistency of selection results. RESULTS: Through simulation studies, SCAD (Smoothly Clipped Absolute Deviation), Adaptive LASSO (Least Absolute Shrinkage and Selection Operator) and MCP (Minimax Concave Penalty) are the suggested variable selection methods, compared to traditional stepwise regression method. In the Bangladesh data, predictors such as mother weight, height-for-age z-score (HAZ) at week 18, and inflammation markers (Myeloperoxidase (MPO) at week 12 and soluable CD14 at week 18) are informative biomarkers associated with children’s growth. CONCLUSIONS: Penalized linear regression methods are plausible alternatives to traditional variable selection methods, and the suggested methods are applicable to other biomedical studies. The selected early-stage biomarkers offer a potential explanation for the burden of malnutrition problems in low-income countries, allow early identification of infants at risk, and suggest pathways for intervention. TRIAL REGISTRATION: This study was retrospectively registered with ClinicalTrials.gov, number NCT01375647, on June 3, 2011.
format Online
Article
Text
id pubmed-5345248
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-53452482017-03-14 Application of penalized linear regression methods to the selection of environmental enteropathy biomarkers Lu, Miao Zhou, Jianhui Naylor, Caitlin Kirkpatrick, Beth D. Haque, Rashidul Petri, William A. Ma, Jennie Z. Biomark Res Research BACKGROUND: Environmental Enteropathy (EE) is a subclinical condition caused by constant fecal-oral contamination and resulting in blunting of intestinal villi and intestinal inflammation. Of primary interest in the clinical research is to evaluate the association between non-invasive EE biomarkers and malnutrition in a cohort of Bangladeshi children. The challenges are that the number of biomarkers/covariates is relatively large, and some of them are highly correlated. METHODS: Many variable selection methods are available in the literature, but which are most appropriate for EE biomarker selection remains unclear. In this study, different variable selection approaches were applied and the performance of these methods was assessed numerically through simulation studies, assuming the correlations among covariates were similar to those in the Bangladesh cohort. The suggested methods from simulations were applied to the Bangladesh cohort to select the most relevant biomarkers for the growth response, and bootstrapping methods were used to evaluate the consistency of selection results. RESULTS: Through simulation studies, SCAD (Smoothly Clipped Absolute Deviation), Adaptive LASSO (Least Absolute Shrinkage and Selection Operator) and MCP (Minimax Concave Penalty) are the suggested variable selection methods, compared to traditional stepwise regression method. In the Bangladesh data, predictors such as mother weight, height-for-age z-score (HAZ) at week 18, and inflammation markers (Myeloperoxidase (MPO) at week 12 and soluable CD14 at week 18) are informative biomarkers associated with children’s growth. CONCLUSIONS: Penalized linear regression methods are plausible alternatives to traditional variable selection methods, and the suggested methods are applicable to other biomedical studies. The selected early-stage biomarkers offer a potential explanation for the burden of malnutrition problems in low-income countries, allow early identification of infants at risk, and suggest pathways for intervention. TRIAL REGISTRATION: This study was retrospectively registered with ClinicalTrials.gov, number NCT01375647, on June 3, 2011. BioMed Central 2017-03-09 /pmc/articles/PMC5345248/ /pubmed/28293424 http://dx.doi.org/10.1186/s40364-017-0089-4 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Lu, Miao
Zhou, Jianhui
Naylor, Caitlin
Kirkpatrick, Beth D.
Haque, Rashidul
Petri, William A.
Ma, Jennie Z.
Application of penalized linear regression methods to the selection of environmental enteropathy biomarkers
title Application of penalized linear regression methods to the selection of environmental enteropathy biomarkers
title_full Application of penalized linear regression methods to the selection of environmental enteropathy biomarkers
title_fullStr Application of penalized linear regression methods to the selection of environmental enteropathy biomarkers
title_full_unstemmed Application of penalized linear regression methods to the selection of environmental enteropathy biomarkers
title_short Application of penalized linear regression methods to the selection of environmental enteropathy biomarkers
title_sort application of penalized linear regression methods to the selection of environmental enteropathy biomarkers
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5345248/
https://www.ncbi.nlm.nih.gov/pubmed/28293424
http://dx.doi.org/10.1186/s40364-017-0089-4
work_keys_str_mv AT lumiao applicationofpenalizedlinearregressionmethodstotheselectionofenvironmentalenteropathybiomarkers
AT zhoujianhui applicationofpenalizedlinearregressionmethodstotheselectionofenvironmentalenteropathybiomarkers
AT naylorcaitlin applicationofpenalizedlinearregressionmethodstotheselectionofenvironmentalenteropathybiomarkers
AT kirkpatrickbethd applicationofpenalizedlinearregressionmethodstotheselectionofenvironmentalenteropathybiomarkers
AT haquerashidul applicationofpenalizedlinearregressionmethodstotheselectionofenvironmentalenteropathybiomarkers
AT petriwilliama applicationofpenalizedlinearregressionmethodstotheselectionofenvironmentalenteropathybiomarkers
AT majenniez applicationofpenalizedlinearregressionmethodstotheselectionofenvironmentalenteropathybiomarkers