Cargando…

Robust Linear Models for Cis-eQTL Analysis

Expression Quantitative Trait Loci (eQTL) analysis enables characterisation of functional genetic variation influencing expression levels of individual genes. In outbread populations, including humans, eQTLs are commonly analysed using the conventional linear model, adjusting for relevant covariates...

Descripción completa

Detalles Bibliográficos
Autores principales: Rantalainen, Mattias, Lindgren, Cecilia M., Holmes, Christopher C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4436354/
https://www.ncbi.nlm.nih.gov/pubmed/25992607
http://dx.doi.org/10.1371/journal.pone.0127882
_version_ 1782372064055787520
author Rantalainen, Mattias
Lindgren, Cecilia M.
Holmes, Christopher C.
author_facet Rantalainen, Mattias
Lindgren, Cecilia M.
Holmes, Christopher C.
author_sort Rantalainen, Mattias
collection PubMed
description Expression Quantitative Trait Loci (eQTL) analysis enables characterisation of functional genetic variation influencing expression levels of individual genes. In outbread populations, including humans, eQTLs are commonly analysed using the conventional linear model, adjusting for relevant covariates, assuming an allelic dosage model and a Gaussian error term. However, gene expression data generally have noise that induces heavy-tailed errors relative to the Gaussian distribution and often include atypical observations, or outliers. Such departures from modelling assumptions can lead to an increased rate of type II errors (false negatives), and to some extent also type I errors (false positives). Careful model checking can reduce the risk of type-I errors but often not type II errors, since it is generally too time-consuming to carefully check all models with a non-significant effect in large-scale and genome-wide studies. Here we propose the application of a robust linear model for eQTL analysis to reduce adverse effects of deviations from the assumption of Gaussian residuals. We present results from a simulation study as well as results from the analysis of real eQTL data sets. Our findings suggest that in many situations robust models have the potential to provide more reliable eQTL results compared to conventional linear models, particularly in respect to reducing type II errors due to non-Gaussian noise. Post-genomic data, such as that generated in genome-wide eQTL studies, are often noisy and frequently contain atypical observations. Robust statistical models have the potential to provide more reliable results and increased statistical power under non-Gaussian conditions. The results presented here suggest that robust models should be considered routinely alongside other commonly used methodologies for eQTL analysis.
format Online
Article
Text
id pubmed-4436354
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-44363542015-05-27 Robust Linear Models for Cis-eQTL Analysis Rantalainen, Mattias Lindgren, Cecilia M. Holmes, Christopher C. PLoS One Research Article Expression Quantitative Trait Loci (eQTL) analysis enables characterisation of functional genetic variation influencing expression levels of individual genes. In outbread populations, including humans, eQTLs are commonly analysed using the conventional linear model, adjusting for relevant covariates, assuming an allelic dosage model and a Gaussian error term. However, gene expression data generally have noise that induces heavy-tailed errors relative to the Gaussian distribution and often include atypical observations, or outliers. Such departures from modelling assumptions can lead to an increased rate of type II errors (false negatives), and to some extent also type I errors (false positives). Careful model checking can reduce the risk of type-I errors but often not type II errors, since it is generally too time-consuming to carefully check all models with a non-significant effect in large-scale and genome-wide studies. Here we propose the application of a robust linear model for eQTL analysis to reduce adverse effects of deviations from the assumption of Gaussian residuals. We present results from a simulation study as well as results from the analysis of real eQTL data sets. Our findings suggest that in many situations robust models have the potential to provide more reliable eQTL results compared to conventional linear models, particularly in respect to reducing type II errors due to non-Gaussian noise. Post-genomic data, such as that generated in genome-wide eQTL studies, are often noisy and frequently contain atypical observations. Robust statistical models have the potential to provide more reliable results and increased statistical power under non-Gaussian conditions. The results presented here suggest that robust models should be considered routinely alongside other commonly used methodologies for eQTL analysis. Public Library of Science 2015-05-18 /pmc/articles/PMC4436354/ /pubmed/25992607 http://dx.doi.org/10.1371/journal.pone.0127882 Text en © 2015 Rantalainen et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Rantalainen, Mattias
Lindgren, Cecilia M.
Holmes, Christopher C.
Robust Linear Models for Cis-eQTL Analysis
title Robust Linear Models for Cis-eQTL Analysis
title_full Robust Linear Models for Cis-eQTL Analysis
title_fullStr Robust Linear Models for Cis-eQTL Analysis
title_full_unstemmed Robust Linear Models for Cis-eQTL Analysis
title_short Robust Linear Models for Cis-eQTL Analysis
title_sort robust linear models for cis-eqtl analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4436354/
https://www.ncbi.nlm.nih.gov/pubmed/25992607
http://dx.doi.org/10.1371/journal.pone.0127882
work_keys_str_mv AT rantalainenmattias robustlinearmodelsforciseqtlanalysis
AT lindgrenceciliam robustlinearmodelsforciseqtlanalysis
AT holmeschristopherc robustlinearmodelsforciseqtlanalysis