Cargando…

A machine learning case–control classifier for schizophrenia based on DNA methylation in blood

Epigenetic dysregulation is thought to contribute to the etiology of schizophrenia (SZ), but the cell type-specificity of DNA methylation makes population-based epigenetic studies of SZ challenging. To train an SZ case–control classifier based on DNA methylation in blood, therefore, we focused on hu...

Descripción completa

Detalles Bibliográficos
Autores principales: Gunasekara, Chathura J., Hannon, Eilis, MacKay, Harry, Coarfa, Cristian, McQuillin, Andrew, Clair, David St., Mill, Jonathan, Waterland, Robert A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8329061/
https://www.ncbi.nlm.nih.gov/pubmed/34341337
http://dx.doi.org/10.1038/s41398-021-01496-3
_version_ 1783732420381507584
author Gunasekara, Chathura J.
Hannon, Eilis
MacKay, Harry
Coarfa, Cristian
McQuillin, Andrew
Clair, David St.
Mill, Jonathan
Waterland, Robert A.
author_facet Gunasekara, Chathura J.
Hannon, Eilis
MacKay, Harry
Coarfa, Cristian
McQuillin, Andrew
Clair, David St.
Mill, Jonathan
Waterland, Robert A.
author_sort Gunasekara, Chathura J.
collection PubMed
description Epigenetic dysregulation is thought to contribute to the etiology of schizophrenia (SZ), but the cell type-specificity of DNA methylation makes population-based epigenetic studies of SZ challenging. To train an SZ case–control classifier based on DNA methylation in blood, therefore, we focused on human genomic regions of systemic interindividual epigenetic variation (CoRSIVs), a subset of which are represented on the Illumina Human Methylation 450K (HM450) array. HM450 DNA methylation data on whole blood of 414 SZ cases and 433 non-psychiatric controls were used as training data for a classification algorithm with built-in feature selection, sparse partial least squares discriminate analysis (SPLS-DA); application of SPLS-DA to HM450 data has not been previously reported. Using the first two SPLS-DA dimensions we calculated a “risk distance” to identify individuals with the highest probability of SZ. The model was then evaluated on an independent HM450 data set on 353 SZ cases and 322 non-psychiatric controls. Our CoRSIV-based model classified 303 individuals as cases with a positive predictive value (PPV) of 80%, far surpassing the performance of a model based on polygenic risk score (PRS). Importantly, risk distance (based on CoRSIV methylation) was not associated with medication use, arguing against reverse causality. Risk distance and PRS were positively correlated (Pearson r = 0.28, P = 1.28 × 10(−12)), and mediational analysis suggested that genetic effects on SZ are partially mediated by altered methylation at CoRSIVs. Our results indicate two innate dimensions of SZ risk: one based on genetic, and the other on systemic epigenetic variants.
format Online
Article
Text
id pubmed-8329061
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-83290612021-08-19 A machine learning case–control classifier for schizophrenia based on DNA methylation in blood Gunasekara, Chathura J. Hannon, Eilis MacKay, Harry Coarfa, Cristian McQuillin, Andrew Clair, David St. Mill, Jonathan Waterland, Robert A. Transl Psychiatry Article Epigenetic dysregulation is thought to contribute to the etiology of schizophrenia (SZ), but the cell type-specificity of DNA methylation makes population-based epigenetic studies of SZ challenging. To train an SZ case–control classifier based on DNA methylation in blood, therefore, we focused on human genomic regions of systemic interindividual epigenetic variation (CoRSIVs), a subset of which are represented on the Illumina Human Methylation 450K (HM450) array. HM450 DNA methylation data on whole blood of 414 SZ cases and 433 non-psychiatric controls were used as training data for a classification algorithm with built-in feature selection, sparse partial least squares discriminate analysis (SPLS-DA); application of SPLS-DA to HM450 data has not been previously reported. Using the first two SPLS-DA dimensions we calculated a “risk distance” to identify individuals with the highest probability of SZ. The model was then evaluated on an independent HM450 data set on 353 SZ cases and 322 non-psychiatric controls. Our CoRSIV-based model classified 303 individuals as cases with a positive predictive value (PPV) of 80%, far surpassing the performance of a model based on polygenic risk score (PRS). Importantly, risk distance (based on CoRSIV methylation) was not associated with medication use, arguing against reverse causality. Risk distance and PRS were positively correlated (Pearson r = 0.28, P = 1.28 × 10(−12)), and mediational analysis suggested that genetic effects on SZ are partially mediated by altered methylation at CoRSIVs. Our results indicate two innate dimensions of SZ risk: one based on genetic, and the other on systemic epigenetic variants. Nature Publishing Group UK 2021-08-03 /pmc/articles/PMC8329061/ /pubmed/34341337 http://dx.doi.org/10.1038/s41398-021-01496-3 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Gunasekara, Chathura J.
Hannon, Eilis
MacKay, Harry
Coarfa, Cristian
McQuillin, Andrew
Clair, David St.
Mill, Jonathan
Waterland, Robert A.
A machine learning case–control classifier for schizophrenia based on DNA methylation in blood
title A machine learning case–control classifier for schizophrenia based on DNA methylation in blood
title_full A machine learning case–control classifier for schizophrenia based on DNA methylation in blood
title_fullStr A machine learning case–control classifier for schizophrenia based on DNA methylation in blood
title_full_unstemmed A machine learning case–control classifier for schizophrenia based on DNA methylation in blood
title_short A machine learning case–control classifier for schizophrenia based on DNA methylation in blood
title_sort machine learning case–control classifier for schizophrenia based on dna methylation in blood
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8329061/
https://www.ncbi.nlm.nih.gov/pubmed/34341337
http://dx.doi.org/10.1038/s41398-021-01496-3
work_keys_str_mv AT gunasekarachathuraj amachinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood
AT hannoneilis amachinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood
AT mackayharry amachinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood
AT coarfacristian amachinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood
AT mcquillinandrew amachinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood
AT clairdavidst amachinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood
AT milljonathan amachinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood
AT waterlandroberta amachinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood
AT gunasekarachathuraj machinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood
AT hannoneilis machinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood
AT mackayharry machinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood
AT coarfacristian machinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood
AT mcquillinandrew machinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood
AT clairdavidst machinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood
AT milljonathan machinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood
AT waterlandroberta machinelearningcasecontrolclassifierforschizophreniabasedondnamethylationinblood