Cargando…
A machine learning case–control classifier for schizophrenia based on DNA methylation in blood
Epigenetic dysregulation is thought to contribute to the etiology of schizophrenia (SZ), but the cell type-specificity of DNA methylation makes population-based epigenetic studies of SZ challenging. To train an SZ case–control classifier based on DNA methylation in blood, therefore, we focused on hu...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8329061/ https://www.ncbi.nlm.nih.gov/pubmed/34341337 http://dx.doi.org/10.1038/s41398-021-01496-3 |
_version_ | 1783732420381507584 |
---|---|
author | Gunasekara, Chathura J. Hannon, Eilis MacKay, Harry Coarfa, Cristian McQuillin, Andrew Clair, David St. Mill, Jonathan Waterland, Robert A. |
author_facet | Gunasekara, Chathura J. Hannon, Eilis MacKay, Harry Coarfa, Cristian McQuillin, Andrew Clair, David St. Mill, Jonathan Waterland, Robert A. |
author_sort | Gunasekara, Chathura J. |
collection | PubMed |
description | Epigenetic dysregulation is thought to contribute to the etiology of schizophrenia (SZ), but the cell type-specificity of DNA methylation makes population-based epigenetic studies of SZ challenging. To train an SZ case–control classifier based on DNA methylation in blood, therefore, we focused on human genomic regions of systemic interindividual epigenetic variation (CoRSIVs), a subset of which are represented on the Illumina Human Methylation 450K (HM450) array. HM450 DNA methylation data on whole blood of 414 SZ cases and 433 non-psychiatric controls were used as training data for a classification algorithm with built-in feature selection, sparse partial least squares discriminate analysis (SPLS-DA); application of SPLS-DA to HM450 data has not been previously reported. Using the first two SPLS-DA dimensions we calculated a “risk distance” to identify individuals with the highest probability of SZ. The model was then evaluated on an independent HM450 data set on 353 SZ cases and 322 non-psychiatric controls. Our CoRSIV-based model classified 303 individuals as cases with a positive predictive value (PPV) of 80%, far surpassing the performance of a model based on polygenic risk score (PRS). Importantly, risk distance (based on CoRSIV methylation) was not associated with medication use, arguing against reverse causality. Risk distance and PRS were positively correlated (Pearson r = 0.28, P = 1.28 × 10(−12)), and mediational analysis suggested that genetic effects on SZ are partially mediated by altered methylation at CoRSIVs. Our results indicate two innate dimensions of SZ risk: one based on genetic, and the other on systemic epigenetic variants. |
format | Online Article Text |
id | pubmed-8329061 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-83290612021-08-19 A machine learning case–control classifier for schizophrenia based on DNA methylation in blood Gunasekara, Chathura J. Hannon, Eilis MacKay, Harry Coarfa, Cristian McQuillin, Andrew Clair, David St. Mill, Jonathan Waterland, Robert A. Transl Psychiatry Article Epigenetic dysregulation is thought to contribute to the etiology of schizophrenia (SZ), but the cell type-specificity of DNA methylation makes population-based epigenetic studies of SZ challenging. To train an SZ case–control classifier based on DNA methylation in blood, therefore, we focused on human genomic regions of systemic interindividual epigenetic variation (CoRSIVs), a subset of which are represented on the Illumina Human Methylation 450K (HM450) array. HM450 DNA methylation data on whole blood of 414 SZ cases and 433 non-psychiatric controls were used as training data for a classification algorithm with built-in feature selection, sparse partial least squares discriminate analysis (SPLS-DA); application of SPLS-DA to HM450 data has not been previously reported. Using the first two SPLS-DA dimensions we calculated a “risk distance” to identify individuals with the highest probability of SZ. The model was then evaluated on an independent HM450 data set on 353 SZ cases and 322 non-psychiatric controls. Our CoRSIV-based model classified 303 individuals as cases with a positive predictive value (PPV) of 80%, far surpassing the performance of a model based on polygenic risk score (PRS). Importantly, risk distance (based on CoRSIV methylation) was not associated with medication use, arguing against reverse causality. Risk distance and PRS were positively correlated (Pearson r = 0.28, P = 1.28 × 10(−12)), and mediational analysis suggested that genetic effects on SZ are partially mediated by altered methylation at CoRSIVs. Our results indicate two innate dimensions of SZ risk: one based on genetic, and the other on systemic epigenetic variants. Nature Publishing Group UK 2021-08-03 /pmc/articles/PMC8329061/ /pubmed/34341337 http://dx.doi.org/10.1038/s41398-021-01496-3 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Gunasekara, Chathura J. Hannon, Eilis MacKay, Harry Coarfa, Cristian McQuillin, Andrew Clair, David St. Mill, Jonathan Waterland, Robert A. A machine learning case–control classifier for schizophrenia based on DNA methylation in blood |
title | A machine learning case–control classifier for schizophrenia based on DNA methylation in blood |
title_full | A machine learning case–control classifier for schizophrenia based on DNA methylation in blood |
title_fullStr | A machine learning case–control classifier for schizophrenia based on DNA methylation in blood |
title_full_unstemmed | A machine learning case–control classifier for schizophrenia based on DNA methylation in blood |
title_short | A machine learning case–control classifier for schizophrenia based on DNA methylation in blood |
title_sort | machine learning case–control classifier for schizophrenia based on dna methylation in blood |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8329061/ https://www.ncbi.nlm.nih.gov/pubmed/34341337 http://dx.doi.org/10.1038/s41398-021-01496-3 |
work_keys_str_mv | AT gunasekarachathuraj amachinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood AT hannoneilis amachinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood AT mackayharry amachinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood AT coarfacristian amachinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood AT mcquillinandrew amachinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood AT clairdavidst amachinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood AT milljonathan amachinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood AT waterlandroberta amachinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood AT gunasekarachathuraj machinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood AT hannoneilis machinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood AT mackayharry machinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood AT coarfacristian machinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood AT mcquillinandrew machinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood AT clairdavidst machinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood AT milljonathan machinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood AT waterlandroberta machinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood |