Cargando…

Identification of genetic variants associated with alternative splicing using sQTLseekeR

Identification of genetic variants affecting splicing in RNA sequencing population studies is still in its infancy. Splicing phenotype is more complex than gene expression and ought to be treated as a multivariate phenotype to be recapitulated completely. Here we represent the splicing pattern of a...

Descripción completa

Detalles Bibliográficos
Autores principales: Monlong, Jean, Calvo, Miquel, Ferreira, Pedro G., Guigó, Roderic
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Pub. Group 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4143934/
https://www.ncbi.nlm.nih.gov/pubmed/25140736
http://dx.doi.org/10.1038/ncomms5698
_version_ 1782331986769084416
author Monlong, Jean
Calvo, Miquel
Ferreira, Pedro G.
Guigó, Roderic
author_facet Monlong, Jean
Calvo, Miquel
Ferreira, Pedro G.
Guigó, Roderic
author_sort Monlong, Jean
collection PubMed
description Identification of genetic variants affecting splicing in RNA sequencing population studies is still in its infancy. Splicing phenotype is more complex than gene expression and ought to be treated as a multivariate phenotype to be recapitulated completely. Here we represent the splicing pattern of a gene as the distribution of the relative abundances of a gene’s alternative transcript isoforms. We develop a statistical framework that uses a distance-based approach to compute the variability of splicing ratios across observations, and a non-parametric analogue to multivariate analysis of variance. We implement this approach in the R package sQTLseekeR and use it to analyze RNA-Seq data from the Geuvadis project in 465 individuals. We identify hundreds of single nucleotide polymorphisms (SNPs) as splicing QTLs (sQTLs), including some falling in genome-wide association study SNPs. By developing the appropriate metrics, we show that sQTLseekeR compares favorably with existing methods that rely on univariate approaches, predicting variants that behave as expected from mutations affecting splicing.
format Online
Article
Text
id pubmed-4143934
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Nature Pub. Group
record_format MEDLINE/PubMed
spelling pubmed-41439342014-09-03 Identification of genetic variants associated with alternative splicing using sQTLseekeR Monlong, Jean Calvo, Miquel Ferreira, Pedro G. Guigó, Roderic Nat Commun Article Identification of genetic variants affecting splicing in RNA sequencing population studies is still in its infancy. Splicing phenotype is more complex than gene expression and ought to be treated as a multivariate phenotype to be recapitulated completely. Here we represent the splicing pattern of a gene as the distribution of the relative abundances of a gene’s alternative transcript isoforms. We develop a statistical framework that uses a distance-based approach to compute the variability of splicing ratios across observations, and a non-parametric analogue to multivariate analysis of variance. We implement this approach in the R package sQTLseekeR and use it to analyze RNA-Seq data from the Geuvadis project in 465 individuals. We identify hundreds of single nucleotide polymorphisms (SNPs) as splicing QTLs (sQTLs), including some falling in genome-wide association study SNPs. By developing the appropriate metrics, we show that sQTLseekeR compares favorably with existing methods that rely on univariate approaches, predicting variants that behave as expected from mutations affecting splicing. Nature Pub. Group 2014-08-20 /pmc/articles/PMC4143934/ /pubmed/25140736 http://dx.doi.org/10.1038/ncomms5698 Text en Copyright © 2014, Nature Publishing Group, a division of Macmillan Publishers Limited. All Rights Reserved. http://creativecommons.org/licenses/by-nc-nd/4.0/ This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit Creative Commons Attribution-Noncommercial-No Derivative 4.0 license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
spellingShingle Article
Monlong, Jean
Calvo, Miquel
Ferreira, Pedro G.
Guigó, Roderic
Identification of genetic variants associated with alternative splicing using sQTLseekeR
title Identification of genetic variants associated with alternative splicing using sQTLseekeR
title_full Identification of genetic variants associated with alternative splicing using sQTLseekeR
title_fullStr Identification of genetic variants associated with alternative splicing using sQTLseekeR
title_full_unstemmed Identification of genetic variants associated with alternative splicing using sQTLseekeR
title_short Identification of genetic variants associated with alternative splicing using sQTLseekeR
title_sort identification of genetic variants associated with alternative splicing using sqtlseeker
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4143934/
https://www.ncbi.nlm.nih.gov/pubmed/25140736
http://dx.doi.org/10.1038/ncomms5698
work_keys_str_mv AT monlongjean identificationofgeneticvariantsassociatedwithalternativesplicingusingsqtlseeker
AT calvomiquel identificationofgeneticvariantsassociatedwithalternativesplicingusingsqtlseeker
AT ferreirapedrog identificationofgeneticvariantsassociatedwithalternativesplicingusingsqtlseeker
AT guigoroderic identificationofgeneticvariantsassociatedwithalternativesplicingusingsqtlseeker