Cargando…
Machine learning selected smoking-associated DNA methylation signatures that predict HIV prognosis and mortality
BACKGROUND: The effects of tobacco smoking on epigenome-wide methylation signatures in white blood cells (WBCs) collected from persons living with HIV may have important implications for their immune-related outcomes, including frailty and mortality. The application of a machine learning approach to...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6293604/ https://www.ncbi.nlm.nih.gov/pubmed/30545403 http://dx.doi.org/10.1186/s13148-018-0591-z |
_version_ | 1783380570896596992 |
---|---|
author | Zhang, Xinyu Hu, Ying Aouizerat, Bradley E. Peng, Gang Marconi, Vincent C. Corley, Michael J. Hulgan, Todd Bryant, Kendall J. Zhao, Hongyu Krystal, John H. Justice, Amy C. Xu, Ke |
author_facet | Zhang, Xinyu Hu, Ying Aouizerat, Bradley E. Peng, Gang Marconi, Vincent C. Corley, Michael J. Hulgan, Todd Bryant, Kendall J. Zhao, Hongyu Krystal, John H. Justice, Amy C. Xu, Ke |
author_sort | Zhang, Xinyu |
collection | PubMed |
description | BACKGROUND: The effects of tobacco smoking on epigenome-wide methylation signatures in white blood cells (WBCs) collected from persons living with HIV may have important implications for their immune-related outcomes, including frailty and mortality. The application of a machine learning approach to the analysis of CpG methylation in the epigenome enables the selection of phenotypically relevant features from high-dimensional data. Using this approach, we now report that a set of smoking-associated DNA-methylated CpGs predicts HIV prognosis and mortality in an HIV-positive veteran population. RESULTS: We first identified 137 epigenome-wide significant CpGs for smoking in WBCs from 1137 HIV-positive individuals (p < 1.70E−07). To examine whether smoking-associated CpGs were predictive of HIV frailty and mortality, we applied ensemble-based machine learning to build a model in a training sample employing 408,583 CpGs. A set of 698 CpGs was selected and predictive of high HIV frailty in a testing sample [(area under curve (AUC) = 0.73, 95%CI 0.63~0.83)] and was replicated in an independent sample [(AUC = 0.78, 95%CI 0.73~0.83)]. We further found an association of a DNA methylation index constructed from the 698 CpGs that were associated with a 5-year survival rate [HR = 1.46; 95%CI 1.06~2.02, p = 0.02]. Interestingly, the 698 CpGs located on 445 genes were enriched on the integrin signaling pathway (p = 9.55E−05, false discovery rate = 0.036), which is responsible for the regulation of the cell cycle, differentiation, and adhesion. CONCLUSION: We demonstrated that smoking-associated DNA methylation features in white blood cells predict HIV infection-related clinical outcomes in a population living with HIV. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13148-018-0591-z) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6293604 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-62936042018-12-18 Machine learning selected smoking-associated DNA methylation signatures that predict HIV prognosis and mortality Zhang, Xinyu Hu, Ying Aouizerat, Bradley E. Peng, Gang Marconi, Vincent C. Corley, Michael J. Hulgan, Todd Bryant, Kendall J. Zhao, Hongyu Krystal, John H. Justice, Amy C. Xu, Ke Clin Epigenetics Research BACKGROUND: The effects of tobacco smoking on epigenome-wide methylation signatures in white blood cells (WBCs) collected from persons living with HIV may have important implications for their immune-related outcomes, including frailty and mortality. The application of a machine learning approach to the analysis of CpG methylation in the epigenome enables the selection of phenotypically relevant features from high-dimensional data. Using this approach, we now report that a set of smoking-associated DNA-methylated CpGs predicts HIV prognosis and mortality in an HIV-positive veteran population. RESULTS: We first identified 137 epigenome-wide significant CpGs for smoking in WBCs from 1137 HIV-positive individuals (p < 1.70E−07). To examine whether smoking-associated CpGs were predictive of HIV frailty and mortality, we applied ensemble-based machine learning to build a model in a training sample employing 408,583 CpGs. A set of 698 CpGs was selected and predictive of high HIV frailty in a testing sample [(area under curve (AUC) = 0.73, 95%CI 0.63~0.83)] and was replicated in an independent sample [(AUC = 0.78, 95%CI 0.73~0.83)]. We further found an association of a DNA methylation index constructed from the 698 CpGs that were associated with a 5-year survival rate [HR = 1.46; 95%CI 1.06~2.02, p = 0.02]. Interestingly, the 698 CpGs located on 445 genes were enriched on the integrin signaling pathway (p = 9.55E−05, false discovery rate = 0.036), which is responsible for the regulation of the cell cycle, differentiation, and adhesion. CONCLUSION: We demonstrated that smoking-associated DNA methylation features in white blood cells predict HIV infection-related clinical outcomes in a population living with HIV. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13148-018-0591-z) contains supplementary material, which is available to authorized users. BioMed Central 2018-12-13 /pmc/articles/PMC6293604/ /pubmed/30545403 http://dx.doi.org/10.1186/s13148-018-0591-z Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Zhang, Xinyu Hu, Ying Aouizerat, Bradley E. Peng, Gang Marconi, Vincent C. Corley, Michael J. Hulgan, Todd Bryant, Kendall J. Zhao, Hongyu Krystal, John H. Justice, Amy C. Xu, Ke Machine learning selected smoking-associated DNA methylation signatures that predict HIV prognosis and mortality |
title | Machine learning selected smoking-associated DNA methylation signatures that predict HIV prognosis and mortality |
title_full | Machine learning selected smoking-associated DNA methylation signatures that predict HIV prognosis and mortality |
title_fullStr | Machine learning selected smoking-associated DNA methylation signatures that predict HIV prognosis and mortality |
title_full_unstemmed | Machine learning selected smoking-associated DNA methylation signatures that predict HIV prognosis and mortality |
title_short | Machine learning selected smoking-associated DNA methylation signatures that predict HIV prognosis and mortality |
title_sort | machine learning selected smoking-associated dna methylation signatures that predict hiv prognosis and mortality |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6293604/ https://www.ncbi.nlm.nih.gov/pubmed/30545403 http://dx.doi.org/10.1186/s13148-018-0591-z |
work_keys_str_mv | AT zhangxinyu machinelearningselectedsmokingassociateddnamethylationsignaturesthatpredicthivprognosisandmortality AT huying machinelearningselectedsmokingassociateddnamethylationsignaturesthatpredicthivprognosisandmortality AT aouizeratbradleye machinelearningselectedsmokingassociateddnamethylationsignaturesthatpredicthivprognosisandmortality AT penggang machinelearningselectedsmokingassociateddnamethylationsignaturesthatpredicthivprognosisandmortality AT marconivincentc machinelearningselectedsmokingassociateddnamethylationsignaturesthatpredicthivprognosisandmortality AT corleymichaelj machinelearningselectedsmokingassociateddnamethylationsignaturesthatpredicthivprognosisandmortality AT hulgantodd machinelearningselectedsmokingassociateddnamethylationsignaturesthatpredicthivprognosisandmortality AT bryantkendallj machinelearningselectedsmokingassociateddnamethylationsignaturesthatpredicthivprognosisandmortality AT zhaohongyu machinelearningselectedsmokingassociateddnamethylationsignaturesthatpredicthivprognosisandmortality AT krystaljohnh machinelearningselectedsmokingassociateddnamethylationsignaturesthatpredicthivprognosisandmortality AT justiceamyc machinelearningselectedsmokingassociateddnamethylationsignaturesthatpredicthivprognosisandmortality AT xuke machinelearningselectedsmokingassociateddnamethylationsignaturesthatpredicthivprognosisandmortality |