Cargando…
kruX: matrix-based non-parametric eQTL discovery
BACKGROUND: The Kruskal-Wallis test is a popular non-parametric statistical test for identifying expression quantitative trait loci (eQTLs) from genome-wide data due to its robustness against variations in the underlying genetic model and expression trait distribution, but testing billions of marker...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3897912/ https://www.ncbi.nlm.nih.gov/pubmed/24423115 http://dx.doi.org/10.1186/1471-2105-15-11 |
_version_ | 1782300320806731776 |
---|---|
author | Qi, Jianlong Asl, Hassan Foroughi Björkegren, Johan Michoel, Tom |
author_facet | Qi, Jianlong Asl, Hassan Foroughi Björkegren, Johan Michoel, Tom |
author_sort | Qi, Jianlong |
collection | PubMed |
description | BACKGROUND: The Kruskal-Wallis test is a popular non-parametric statistical test for identifying expression quantitative trait loci (eQTLs) from genome-wide data due to its robustness against variations in the underlying genetic model and expression trait distribution, but testing billions of marker-trait combinations one-by-one can become computationally prohibitive. RESULTS: We developed kruX, an algorithm implemented in Matlab, Python and R that uses matrix multiplications to simultaneously calculate the Kruskal-Wallis test statistic for several millions of marker-trait combinations at once. KruX is more than ten thousand times faster than computing associations one-by-one on a typical human dataset. We used kruX and a dataset of more than 500k SNPs and 20k expression traits measured in 102 human blood samples to compare eQTLs detected by the Kruskal-Wallis test to eQTLs detected by the parametric ANOVA and linear model methods. We found that the Kruskal-Wallis test is more robust against data outliers and heterogeneous genotype group sizes and detects a higher proportion of non-linear associations, but is more conservative for calling additive linear associations. CONCLUSION: kruX enables the use of robust non-parametric methods for massive eQTL mapping without the need for a high-performance computing infrastructure and is freely available from http://krux.googlecode.com. |
format | Online Article Text |
id | pubmed-3897912 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-38979122014-02-05 kruX: matrix-based non-parametric eQTL discovery Qi, Jianlong Asl, Hassan Foroughi Björkegren, Johan Michoel, Tom BMC Bioinformatics Software BACKGROUND: The Kruskal-Wallis test is a popular non-parametric statistical test for identifying expression quantitative trait loci (eQTLs) from genome-wide data due to its robustness against variations in the underlying genetic model and expression trait distribution, but testing billions of marker-trait combinations one-by-one can become computationally prohibitive. RESULTS: We developed kruX, an algorithm implemented in Matlab, Python and R that uses matrix multiplications to simultaneously calculate the Kruskal-Wallis test statistic for several millions of marker-trait combinations at once. KruX is more than ten thousand times faster than computing associations one-by-one on a typical human dataset. We used kruX and a dataset of more than 500k SNPs and 20k expression traits measured in 102 human blood samples to compare eQTLs detected by the Kruskal-Wallis test to eQTLs detected by the parametric ANOVA and linear model methods. We found that the Kruskal-Wallis test is more robust against data outliers and heterogeneous genotype group sizes and detects a higher proportion of non-linear associations, but is more conservative for calling additive linear associations. CONCLUSION: kruX enables the use of robust non-parametric methods for massive eQTL mapping without the need for a high-performance computing infrastructure and is freely available from http://krux.googlecode.com. BioMed Central 2014-01-14 /pmc/articles/PMC3897912/ /pubmed/24423115 http://dx.doi.org/10.1186/1471-2105-15-11 Text en Copyright © 2014 Qi et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Qi, Jianlong Asl, Hassan Foroughi Björkegren, Johan Michoel, Tom kruX: matrix-based non-parametric eQTL discovery |
title | kruX: matrix-based non-parametric eQTL discovery |
title_full | kruX: matrix-based non-parametric eQTL discovery |
title_fullStr | kruX: matrix-based non-parametric eQTL discovery |
title_full_unstemmed | kruX: matrix-based non-parametric eQTL discovery |
title_short | kruX: matrix-based non-parametric eQTL discovery |
title_sort | krux: matrix-based non-parametric eqtl discovery |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3897912/ https://www.ncbi.nlm.nih.gov/pubmed/24423115 http://dx.doi.org/10.1186/1471-2105-15-11 |
work_keys_str_mv | AT qijianlong kruxmatrixbasednonparametriceqtldiscovery AT aslhassanforoughi kruxmatrixbasednonparametriceqtldiscovery AT bjorkegrenjohan kruxmatrixbasednonparametriceqtldiscovery AT michoeltom kruxmatrixbasednonparametriceqtldiscovery |