Cargando…
eQTL mapping of rare variant associations using RNA-seq data: An evaluation of approaches
Expression quantitative trait loci (eQTL) provide insight on transcription regulation and illuminate the molecular basis of phenotypic outcomes. High-throughput RNA sequencing (RNA-seq) is becoming a popular technique to measure gene expression abundance. Traditional eQTL mapping methods for microar...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6776318/ https://www.ncbi.nlm.nih.gov/pubmed/31581212 http://dx.doi.org/10.1371/journal.pone.0223273 |
_version_ | 1783456398089125888 |
---|---|
author | Lutz, Sharon Marie Thwing, Annie Fingerlin, Tasha |
author_facet | Lutz, Sharon Marie Thwing, Annie Fingerlin, Tasha |
author_sort | Lutz, Sharon Marie |
collection | PubMed |
description | Expression quantitative trait loci (eQTL) provide insight on transcription regulation and illuminate the molecular basis of phenotypic outcomes. High-throughput RNA sequencing (RNA-seq) is becoming a popular technique to measure gene expression abundance. Traditional eQTL mapping methods for microarray expression data often assume the expression data follow a normal distribution. As a result, for RNA-seq data, total read count measurements can be normalized by normal quantile transformation in order to fit the data using a linear regression. Other approaches model the total read counts using a negative binomial regression. While these methods work well for common variants (minor allele frequencies > 5% or 1%), an extension of existing methodology is needed to accommodate a collection of rare variants in RNA-seq data. Here, we examine 2 approaches that are direct applications of existing methodology and apply these approaches to RNAseq studies: 1) collapsing the rare variants in the region and using either negative binomial regression or Poisson regression and 2) using the normalized read counts with the Sequence Kernel Association Test (SKAT), the burden test for SKAT (SKAT-Burden), or an optimal combination of these two tests (SKAT-O). We evaluated these approaches via simulation studies under numerous scenarios and applied these approaches to the 1,000 Genomes Project. |
format | Online Article Text |
id | pubmed-6776318 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-67763182019-10-11 eQTL mapping of rare variant associations using RNA-seq data: An evaluation of approaches Lutz, Sharon Marie Thwing, Annie Fingerlin, Tasha PLoS One Research Article Expression quantitative trait loci (eQTL) provide insight on transcription regulation and illuminate the molecular basis of phenotypic outcomes. High-throughput RNA sequencing (RNA-seq) is becoming a popular technique to measure gene expression abundance. Traditional eQTL mapping methods for microarray expression data often assume the expression data follow a normal distribution. As a result, for RNA-seq data, total read count measurements can be normalized by normal quantile transformation in order to fit the data using a linear regression. Other approaches model the total read counts using a negative binomial regression. While these methods work well for common variants (minor allele frequencies > 5% or 1%), an extension of existing methodology is needed to accommodate a collection of rare variants in RNA-seq data. Here, we examine 2 approaches that are direct applications of existing methodology and apply these approaches to RNAseq studies: 1) collapsing the rare variants in the region and using either negative binomial regression or Poisson regression and 2) using the normalized read counts with the Sequence Kernel Association Test (SKAT), the burden test for SKAT (SKAT-Burden), or an optimal combination of these two tests (SKAT-O). We evaluated these approaches via simulation studies under numerous scenarios and applied these approaches to the 1,000 Genomes Project. Public Library of Science 2019-10-03 /pmc/articles/PMC6776318/ /pubmed/31581212 http://dx.doi.org/10.1371/journal.pone.0223273 Text en © 2019 Lutz et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Lutz, Sharon Marie Thwing, Annie Fingerlin, Tasha eQTL mapping of rare variant associations using RNA-seq data: An evaluation of approaches |
title | eQTL mapping of rare variant associations using RNA-seq data: An evaluation of approaches |
title_full | eQTL mapping of rare variant associations using RNA-seq data: An evaluation of approaches |
title_fullStr | eQTL mapping of rare variant associations using RNA-seq data: An evaluation of approaches |
title_full_unstemmed | eQTL mapping of rare variant associations using RNA-seq data: An evaluation of approaches |
title_short | eQTL mapping of rare variant associations using RNA-seq data: An evaluation of approaches |
title_sort | eqtl mapping of rare variant associations using rna-seq data: an evaluation of approaches |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6776318/ https://www.ncbi.nlm.nih.gov/pubmed/31581212 http://dx.doi.org/10.1371/journal.pone.0223273 |
work_keys_str_mv | AT lutzsharonmarie eqtlmappingofrarevariantassociationsusingrnaseqdataanevaluationofapproaches AT thwingannie eqtlmappingofrarevariantassociationsusingrnaseqdataanevaluationofapproaches AT fingerlintasha eqtlmappingofrarevariantassociationsusingrnaseqdataanevaluationofapproaches |