Cargando…
Novel risk genes for systemic lupus erythematosus predicted by random forest classification
Genome-wide association studies have identified risk loci for SLE, but a large proportion of the genetic contribution to SLE still remains unexplained. To detect novel risk genes, and to predict an individual’s SLE risk we designed a random forest classifier using SNP genotype data generated on the...
Autores principales: | , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5524838/ https://www.ncbi.nlm.nih.gov/pubmed/28740209 http://dx.doi.org/10.1038/s41598-017-06516-1 |
_version_ | 1783252533057159168 |
---|---|
author | Almlöf, Jonas Carlsson Alexsson, Andrei Imgenberg-Kreuz, Juliana Sylwan, Lina Bäcklin, Christofer Leonard, Dag Nordmark, Gunnel Tandre, Karolina Eloranta, Maija-Leena Padyukov, Leonid Bengtsson, Christine Jönsen, Andreas Dahlqvist, Solbritt Rantapää Sjöwall, Christopher Bengtsson, Anders A. Gunnarsson, Iva Svenungsson, Elisabet Rönnblom, Lars Sandling, Johanna K. Syvänen, Ann-Christine |
author_facet | Almlöf, Jonas Carlsson Alexsson, Andrei Imgenberg-Kreuz, Juliana Sylwan, Lina Bäcklin, Christofer Leonard, Dag Nordmark, Gunnel Tandre, Karolina Eloranta, Maija-Leena Padyukov, Leonid Bengtsson, Christine Jönsen, Andreas Dahlqvist, Solbritt Rantapää Sjöwall, Christopher Bengtsson, Anders A. Gunnarsson, Iva Svenungsson, Elisabet Rönnblom, Lars Sandling, Johanna K. Syvänen, Ann-Christine |
author_sort | Almlöf, Jonas Carlsson |
collection | PubMed |
description | Genome-wide association studies have identified risk loci for SLE, but a large proportion of the genetic contribution to SLE still remains unexplained. To detect novel risk genes, and to predict an individual’s SLE risk we designed a random forest classifier using SNP genotype data generated on the “Immunochip” from 1,160 patients with SLE and 2,711 controls. Using gene importance scores defined by the random forest classifier, we identified 15 potential novel risk genes for SLE. Of them 12 are associated with other autoimmune diseases than SLE, whereas three genes (ZNF804A, CDK1, and MANF) have not previously been associated with autoimmunity. Random forest classification also allowed prediction of patients at risk for lupus nephritis with an area under the curve of 0.94. By allele-specific gene expression analysis we detected cis-regulatory SNPs that affect the expression levels of six of the top 40 genes designed by the random forest analysis, indicating a regulatory role for the identified risk variants. The 40 top genes from the prediction were overrepresented for differential expression in B and T cells according to RNA-sequencing of samples from five healthy donors, with more frequent over-expression in B cells compared to T cells. |
format | Online Article Text |
id | pubmed-5524838 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-55248382017-07-26 Novel risk genes for systemic lupus erythematosus predicted by random forest classification Almlöf, Jonas Carlsson Alexsson, Andrei Imgenberg-Kreuz, Juliana Sylwan, Lina Bäcklin, Christofer Leonard, Dag Nordmark, Gunnel Tandre, Karolina Eloranta, Maija-Leena Padyukov, Leonid Bengtsson, Christine Jönsen, Andreas Dahlqvist, Solbritt Rantapää Sjöwall, Christopher Bengtsson, Anders A. Gunnarsson, Iva Svenungsson, Elisabet Rönnblom, Lars Sandling, Johanna K. Syvänen, Ann-Christine Sci Rep Article Genome-wide association studies have identified risk loci for SLE, but a large proportion of the genetic contribution to SLE still remains unexplained. To detect novel risk genes, and to predict an individual’s SLE risk we designed a random forest classifier using SNP genotype data generated on the “Immunochip” from 1,160 patients with SLE and 2,711 controls. Using gene importance scores defined by the random forest classifier, we identified 15 potential novel risk genes for SLE. Of them 12 are associated with other autoimmune diseases than SLE, whereas three genes (ZNF804A, CDK1, and MANF) have not previously been associated with autoimmunity. Random forest classification also allowed prediction of patients at risk for lupus nephritis with an area under the curve of 0.94. By allele-specific gene expression analysis we detected cis-regulatory SNPs that affect the expression levels of six of the top 40 genes designed by the random forest analysis, indicating a regulatory role for the identified risk variants. The 40 top genes from the prediction were overrepresented for differential expression in B and T cells according to RNA-sequencing of samples from five healthy donors, with more frequent over-expression in B cells compared to T cells. Nature Publishing Group UK 2017-07-24 /pmc/articles/PMC5524838/ /pubmed/28740209 http://dx.doi.org/10.1038/s41598-017-06516-1 Text en © The Author(s) 2017 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Almlöf, Jonas Carlsson Alexsson, Andrei Imgenberg-Kreuz, Juliana Sylwan, Lina Bäcklin, Christofer Leonard, Dag Nordmark, Gunnel Tandre, Karolina Eloranta, Maija-Leena Padyukov, Leonid Bengtsson, Christine Jönsen, Andreas Dahlqvist, Solbritt Rantapää Sjöwall, Christopher Bengtsson, Anders A. Gunnarsson, Iva Svenungsson, Elisabet Rönnblom, Lars Sandling, Johanna K. Syvänen, Ann-Christine Novel risk genes for systemic lupus erythematosus predicted by random forest classification |
title | Novel risk genes for systemic lupus erythematosus predicted by random forest classification |
title_full | Novel risk genes for systemic lupus erythematosus predicted by random forest classification |
title_fullStr | Novel risk genes for systemic lupus erythematosus predicted by random forest classification |
title_full_unstemmed | Novel risk genes for systemic lupus erythematosus predicted by random forest classification |
title_short | Novel risk genes for systemic lupus erythematosus predicted by random forest classification |
title_sort | novel risk genes for systemic lupus erythematosus predicted by random forest classification |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5524838/ https://www.ncbi.nlm.nih.gov/pubmed/28740209 http://dx.doi.org/10.1038/s41598-017-06516-1 |
work_keys_str_mv | AT almlofjonascarlsson novelriskgenesforsystemiclupuserythematosuspredictedbyrandomforestclassification AT alexssonandrei novelriskgenesforsystemiclupuserythematosuspredictedbyrandomforestclassification AT imgenbergkreuzjuliana novelriskgenesforsystemiclupuserythematosuspredictedbyrandomforestclassification AT sylwanlina novelriskgenesforsystemiclupuserythematosuspredictedbyrandomforestclassification AT backlinchristofer novelriskgenesforsystemiclupuserythematosuspredictedbyrandomforestclassification AT leonarddag novelriskgenesforsystemiclupuserythematosuspredictedbyrandomforestclassification AT nordmarkgunnel novelriskgenesforsystemiclupuserythematosuspredictedbyrandomforestclassification AT tandrekarolina novelriskgenesforsystemiclupuserythematosuspredictedbyrandomforestclassification AT elorantamaijaleena novelriskgenesforsystemiclupuserythematosuspredictedbyrandomforestclassification AT padyukovleonid novelriskgenesforsystemiclupuserythematosuspredictedbyrandomforestclassification AT bengtssonchristine novelriskgenesforsystemiclupuserythematosuspredictedbyrandomforestclassification AT jonsenandreas novelriskgenesforsystemiclupuserythematosuspredictedbyrandomforestclassification AT dahlqvistsolbrittrantapaa novelriskgenesforsystemiclupuserythematosuspredictedbyrandomforestclassification AT sjowallchristopher novelriskgenesforsystemiclupuserythematosuspredictedbyrandomforestclassification AT bengtssonandersa novelriskgenesforsystemiclupuserythematosuspredictedbyrandomforestclassification AT gunnarssoniva novelriskgenesforsystemiclupuserythematosuspredictedbyrandomforestclassification AT svenungssonelisabet novelriskgenesforsystemiclupuserythematosuspredictedbyrandomforestclassification AT ronnblomlars novelriskgenesforsystemiclupuserythematosuspredictedbyrandomforestclassification AT sandlingjohannak novelriskgenesforsystemiclupuserythematosuspredictedbyrandomforestclassification AT syvanenannchristine novelriskgenesforsystemiclupuserythematosuspredictedbyrandomforestclassification |