Cargando…

Interpreting Non-coding Genetic Variation in Multiple Sclerosis Genome-Wide Associated Regions

Multiple sclerosis (MS) is the most common neurological disorder in young adults. Despite extensive studies, only a fraction of MS heritability has been explained, with association studies focusing primarily on protein-coding genes, essentially for the difficulty of interpreting non-coding features....

Descripción completa

Detalles Bibliográficos
Autores principales: Paraboschi, Elvezia Maria, Cardamone, Giulia, Soldà, Giulia, Duga, Stefano, Asselta, Rosanna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6304422/
https://www.ncbi.nlm.nih.gov/pubmed/30619471
http://dx.doi.org/10.3389/fgene.2018.00647
_version_ 1783382356592164864
author Paraboschi, Elvezia Maria
Cardamone, Giulia
Soldà, Giulia
Duga, Stefano
Asselta, Rosanna
author_facet Paraboschi, Elvezia Maria
Cardamone, Giulia
Soldà, Giulia
Duga, Stefano
Asselta, Rosanna
author_sort Paraboschi, Elvezia Maria
collection PubMed
description Multiple sclerosis (MS) is the most common neurological disorder in young adults. Despite extensive studies, only a fraction of MS heritability has been explained, with association studies focusing primarily on protein-coding genes, essentially for the difficulty of interpreting non-coding features. However, non-coding RNAs (ncRNAs) and functional elements, such as super-enhancers (SE), are crucial regulators of many pathways and cellular mechanisms, and they have been implicated in a growing number of diseases. In this work, we searched for possible enrichments in non-coding elements at MS genome-wide associated loci, with the aim to highlight their possible involvement in the susceptibility to the disease. We first reconstructed the linkage disequilibrium (LD) structure of the Italian population using data of 727,478 single-nucleotide polymorphisms (SNPs) from 1,668 healthy individuals. The genomic coordinates of the obtained LD blocks were intersected with those of the top hits identified in previously published MS genome-wide association studies (GWAS). By a bootstrapping approach, we hence demonstrated a striking enrichment of non-coding elements, especially of circular RNAs (circRNAs) mapping in the 73 LD blocks harboring MS-associated SNPs. In particular, we found a total of 482 circRNAs (annotated in publicly available databases) vs. a mean of 194 ± 65 in the random sets of LD blocks, using 1,000 iterations. As a proof of concept of a possible functional relevance of this observation, we experimentally verified that the expression levels of a circRNA derived from an MS-associated locus, i.e., hsa_circ_0043813 from the STAT3 gene, can be modulated by the three genotypes at the disease-associated SNP. Finally, by evaluating RNA-seq data of two cell lines, SH-SY5Y and Jurkat cells, representing tissues relevant for MS, we identified 18 (two novel) circRNAs derived from MS-associated genes. In conclusion, this work showed for the first time that MS-GWAS top hits map in LD blocks enriched in circRNAs, suggesting circRNAs as possible novel contributors to the disease pathogenesis.
format Online
Article
Text
id pubmed-6304422
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-63044222019-01-07 Interpreting Non-coding Genetic Variation in Multiple Sclerosis Genome-Wide Associated Regions Paraboschi, Elvezia Maria Cardamone, Giulia Soldà, Giulia Duga, Stefano Asselta, Rosanna Front Genet Genetics Multiple sclerosis (MS) is the most common neurological disorder in young adults. Despite extensive studies, only a fraction of MS heritability has been explained, with association studies focusing primarily on protein-coding genes, essentially for the difficulty of interpreting non-coding features. However, non-coding RNAs (ncRNAs) and functional elements, such as super-enhancers (SE), are crucial regulators of many pathways and cellular mechanisms, and they have been implicated in a growing number of diseases. In this work, we searched for possible enrichments in non-coding elements at MS genome-wide associated loci, with the aim to highlight their possible involvement in the susceptibility to the disease. We first reconstructed the linkage disequilibrium (LD) structure of the Italian population using data of 727,478 single-nucleotide polymorphisms (SNPs) from 1,668 healthy individuals. The genomic coordinates of the obtained LD blocks were intersected with those of the top hits identified in previously published MS genome-wide association studies (GWAS). By a bootstrapping approach, we hence demonstrated a striking enrichment of non-coding elements, especially of circular RNAs (circRNAs) mapping in the 73 LD blocks harboring MS-associated SNPs. In particular, we found a total of 482 circRNAs (annotated in publicly available databases) vs. a mean of 194 ± 65 in the random sets of LD blocks, using 1,000 iterations. As a proof of concept of a possible functional relevance of this observation, we experimentally verified that the expression levels of a circRNA derived from an MS-associated locus, i.e., hsa_circ_0043813 from the STAT3 gene, can be modulated by the three genotypes at the disease-associated SNP. Finally, by evaluating RNA-seq data of two cell lines, SH-SY5Y and Jurkat cells, representing tissues relevant for MS, we identified 18 (two novel) circRNAs derived from MS-associated genes. In conclusion, this work showed for the first time that MS-GWAS top hits map in LD blocks enriched in circRNAs, suggesting circRNAs as possible novel contributors to the disease pathogenesis. Frontiers Media S.A. 2018-12-17 /pmc/articles/PMC6304422/ /pubmed/30619471 http://dx.doi.org/10.3389/fgene.2018.00647 Text en Copyright © 2018 Paraboschi, Cardamone, Soldà, Duga and Asselta. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Paraboschi, Elvezia Maria
Cardamone, Giulia
Soldà, Giulia
Duga, Stefano
Asselta, Rosanna
Interpreting Non-coding Genetic Variation in Multiple Sclerosis Genome-Wide Associated Regions
title Interpreting Non-coding Genetic Variation in Multiple Sclerosis Genome-Wide Associated Regions
title_full Interpreting Non-coding Genetic Variation in Multiple Sclerosis Genome-Wide Associated Regions
title_fullStr Interpreting Non-coding Genetic Variation in Multiple Sclerosis Genome-Wide Associated Regions
title_full_unstemmed Interpreting Non-coding Genetic Variation in Multiple Sclerosis Genome-Wide Associated Regions
title_short Interpreting Non-coding Genetic Variation in Multiple Sclerosis Genome-Wide Associated Regions
title_sort interpreting non-coding genetic variation in multiple sclerosis genome-wide associated regions
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6304422/
https://www.ncbi.nlm.nih.gov/pubmed/30619471
http://dx.doi.org/10.3389/fgene.2018.00647
work_keys_str_mv AT paraboschielveziamaria interpretingnoncodinggeneticvariationinmultiplesclerosisgenomewideassociatedregions
AT cardamonegiulia interpretingnoncodinggeneticvariationinmultiplesclerosisgenomewideassociatedregions
AT soldagiulia interpretingnoncodinggeneticvariationinmultiplesclerosisgenomewideassociatedregions
AT dugastefano interpretingnoncodinggeneticvariationinmultiplesclerosisgenomewideassociatedregions
AT asseltarosanna interpretingnoncodinggeneticvariationinmultiplesclerosisgenomewideassociatedregions