Cargando…

Machine learning selected smoking-associated DNA methylation signatures that predict HIV prognosis and mortality

BACKGROUND: The effects of tobacco smoking on epigenome-wide methylation signatures in white blood cells (WBCs) collected from persons living with HIV may have important implications for their immune-related outcomes, including frailty and mortality. The application of a machine learning approach to...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Xinyu, Hu, Ying, Aouizerat, Bradley E., Peng, Gang, Marconi, Vincent C., Corley, Michael J., Hulgan, Todd, Bryant, Kendall J., Zhao, Hongyu, Krystal, John H., Justice, Amy C., Xu, Ke
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6293604/
https://www.ncbi.nlm.nih.gov/pubmed/30545403
http://dx.doi.org/10.1186/s13148-018-0591-z
_version_ 1783380570896596992
author Zhang, Xinyu
Hu, Ying
Aouizerat, Bradley E.
Peng, Gang
Marconi, Vincent C.
Corley, Michael J.
Hulgan, Todd
Bryant, Kendall J.
Zhao, Hongyu
Krystal, John H.
Justice, Amy C.
Xu, Ke
author_facet Zhang, Xinyu
Hu, Ying
Aouizerat, Bradley E.
Peng, Gang
Marconi, Vincent C.
Corley, Michael J.
Hulgan, Todd
Bryant, Kendall J.
Zhao, Hongyu
Krystal, John H.
Justice, Amy C.
Xu, Ke
author_sort Zhang, Xinyu
collection PubMed
description BACKGROUND: The effects of tobacco smoking on epigenome-wide methylation signatures in white blood cells (WBCs) collected from persons living with HIV may have important implications for their immune-related outcomes, including frailty and mortality. The application of a machine learning approach to the analysis of CpG methylation in the epigenome enables the selection of phenotypically relevant features from high-dimensional data. Using this approach, we now report that a set of smoking-associated DNA-methylated CpGs predicts HIV prognosis and mortality in an HIV-positive veteran population. RESULTS: We first identified 137 epigenome-wide significant CpGs for smoking in WBCs from 1137 HIV-positive individuals (p < 1.70E−07). To examine whether smoking-associated CpGs were predictive of HIV frailty and mortality, we applied ensemble-based machine learning to build a model in a training sample employing 408,583 CpGs. A set of 698 CpGs was selected and predictive of high HIV frailty in a testing sample [(area under curve (AUC) = 0.73, 95%CI 0.63~0.83)] and was replicated in an independent sample [(AUC = 0.78, 95%CI 0.73~0.83)]. We further found an association of a DNA methylation index constructed from the 698 CpGs that were associated with a 5-year survival rate [HR = 1.46; 95%CI 1.06~2.02, p = 0.02]. Interestingly, the 698 CpGs located on 445 genes were enriched on the integrin signaling pathway (p = 9.55E−05, false discovery rate = 0.036), which is responsible for the regulation of the cell cycle, differentiation, and adhesion. CONCLUSION: We demonstrated that smoking-associated DNA methylation features in white blood cells predict HIV infection-related clinical outcomes in a population living with HIV. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13148-018-0591-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6293604
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-62936042018-12-18 Machine learning selected smoking-associated DNA methylation signatures that predict HIV prognosis and mortality Zhang, Xinyu Hu, Ying Aouizerat, Bradley E. Peng, Gang Marconi, Vincent C. Corley, Michael J. Hulgan, Todd Bryant, Kendall J. Zhao, Hongyu Krystal, John H. Justice, Amy C. Xu, Ke Clin Epigenetics Research BACKGROUND: The effects of tobacco smoking on epigenome-wide methylation signatures in white blood cells (WBCs) collected from persons living with HIV may have important implications for their immune-related outcomes, including frailty and mortality. The application of a machine learning approach to the analysis of CpG methylation in the epigenome enables the selection of phenotypically relevant features from high-dimensional data. Using this approach, we now report that a set of smoking-associated DNA-methylated CpGs predicts HIV prognosis and mortality in an HIV-positive veteran population. RESULTS: We first identified 137 epigenome-wide significant CpGs for smoking in WBCs from 1137 HIV-positive individuals (p < 1.70E−07). To examine whether smoking-associated CpGs were predictive of HIV frailty and mortality, we applied ensemble-based machine learning to build a model in a training sample employing 408,583 CpGs. A set of 698 CpGs was selected and predictive of high HIV frailty in a testing sample [(area under curve (AUC) = 0.73, 95%CI 0.63~0.83)] and was replicated in an independent sample [(AUC = 0.78, 95%CI 0.73~0.83)]. We further found an association of a DNA methylation index constructed from the 698 CpGs that were associated with a 5-year survival rate [HR = 1.46; 95%CI 1.06~2.02, p = 0.02]. Interestingly, the 698 CpGs located on 445 genes were enriched on the integrin signaling pathway (p = 9.55E−05, false discovery rate = 0.036), which is responsible for the regulation of the cell cycle, differentiation, and adhesion. CONCLUSION: We demonstrated that smoking-associated DNA methylation features in white blood cells predict HIV infection-related clinical outcomes in a population living with HIV. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13148-018-0591-z) contains supplementary material, which is available to authorized users. BioMed Central 2018-12-13 /pmc/articles/PMC6293604/ /pubmed/30545403 http://dx.doi.org/10.1186/s13148-018-0591-z Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Zhang, Xinyu
Hu, Ying
Aouizerat, Bradley E.
Peng, Gang
Marconi, Vincent C.
Corley, Michael J.
Hulgan, Todd
Bryant, Kendall J.
Zhao, Hongyu
Krystal, John H.
Justice, Amy C.
Xu, Ke
Machine learning selected smoking-associated DNA methylation signatures that predict HIV prognosis and mortality
title Machine learning selected smoking-associated DNA methylation signatures that predict HIV prognosis and mortality
title_full Machine learning selected smoking-associated DNA methylation signatures that predict HIV prognosis and mortality
title_fullStr Machine learning selected smoking-associated DNA methylation signatures that predict HIV prognosis and mortality
title_full_unstemmed Machine learning selected smoking-associated DNA methylation signatures that predict HIV prognosis and mortality
title_short Machine learning selected smoking-associated DNA methylation signatures that predict HIV prognosis and mortality
title_sort machine learning selected smoking-associated dna methylation signatures that predict hiv prognosis and mortality
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6293604/
https://www.ncbi.nlm.nih.gov/pubmed/30545403
http://dx.doi.org/10.1186/s13148-018-0591-z
work_keys_str_mv AT zhangxinyu machinelearningselectedsmokingassociateddnamethylationsignaturesthatpredicthivprognosisandmortality
AT huying machinelearningselectedsmokingassociateddnamethylationsignaturesthatpredicthivprognosisandmortality
AT aouizeratbradleye machinelearningselectedsmokingassociateddnamethylationsignaturesthatpredicthivprognosisandmortality
AT penggang machinelearningselectedsmokingassociateddnamethylationsignaturesthatpredicthivprognosisandmortality
AT marconivincentc machinelearningselectedsmokingassociateddnamethylationsignaturesthatpredicthivprognosisandmortality
AT corleymichaelj machinelearningselectedsmokingassociateddnamethylationsignaturesthatpredicthivprognosisandmortality
AT hulgantodd machinelearningselectedsmokingassociateddnamethylationsignaturesthatpredicthivprognosisandmortality
AT bryantkendallj machinelearningselectedsmokingassociateddnamethylationsignaturesthatpredicthivprognosisandmortality
AT zhaohongyu machinelearningselectedsmokingassociateddnamethylationsignaturesthatpredicthivprognosisandmortality
AT krystaljohnh machinelearningselectedsmokingassociateddnamethylationsignaturesthatpredicthivprognosisandmortality
AT justiceamyc machinelearningselectedsmokingassociateddnamethylationsignaturesthatpredicthivprognosisandmortality
AT xuke machinelearningselectedsmokingassociateddnamethylationsignaturesthatpredicthivprognosisandmortality