Cargando…

eQTL mapping of rare variant associations using RNA-seq data: An evaluation of approaches

Expression quantitative trait loci (eQTL) provide insight on transcription regulation and illuminate the molecular basis of phenotypic outcomes. High-throughput RNA sequencing (RNA-seq) is becoming a popular technique to measure gene expression abundance. Traditional eQTL mapping methods for microar...

Descripción completa

Detalles Bibliográficos
Autores principales: Lutz, Sharon Marie, Thwing, Annie, Fingerlin, Tasha
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6776318/
https://www.ncbi.nlm.nih.gov/pubmed/31581212
http://dx.doi.org/10.1371/journal.pone.0223273
_version_ 1783456398089125888
author Lutz, Sharon Marie
Thwing, Annie
Fingerlin, Tasha
author_facet Lutz, Sharon Marie
Thwing, Annie
Fingerlin, Tasha
author_sort Lutz, Sharon Marie
collection PubMed
description Expression quantitative trait loci (eQTL) provide insight on transcription regulation and illuminate the molecular basis of phenotypic outcomes. High-throughput RNA sequencing (RNA-seq) is becoming a popular technique to measure gene expression abundance. Traditional eQTL mapping methods for microarray expression data often assume the expression data follow a normal distribution. As a result, for RNA-seq data, total read count measurements can be normalized by normal quantile transformation in order to fit the data using a linear regression. Other approaches model the total read counts using a negative binomial regression. While these methods work well for common variants (minor allele frequencies > 5% or 1%), an extension of existing methodology is needed to accommodate a collection of rare variants in RNA-seq data. Here, we examine 2 approaches that are direct applications of existing methodology and apply these approaches to RNAseq studies: 1) collapsing the rare variants in the region and using either negative binomial regression or Poisson regression and 2) using the normalized read counts with the Sequence Kernel Association Test (SKAT), the burden test for SKAT (SKAT-Burden), or an optimal combination of these two tests (SKAT-O). We evaluated these approaches via simulation studies under numerous scenarios and applied these approaches to the 1,000 Genomes Project.
format Online
Article
Text
id pubmed-6776318
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-67763182019-10-11 eQTL mapping of rare variant associations using RNA-seq data: An evaluation of approaches Lutz, Sharon Marie Thwing, Annie Fingerlin, Tasha PLoS One Research Article Expression quantitative trait loci (eQTL) provide insight on transcription regulation and illuminate the molecular basis of phenotypic outcomes. High-throughput RNA sequencing (RNA-seq) is becoming a popular technique to measure gene expression abundance. Traditional eQTL mapping methods for microarray expression data often assume the expression data follow a normal distribution. As a result, for RNA-seq data, total read count measurements can be normalized by normal quantile transformation in order to fit the data using a linear regression. Other approaches model the total read counts using a negative binomial regression. While these methods work well for common variants (minor allele frequencies > 5% or 1%), an extension of existing methodology is needed to accommodate a collection of rare variants in RNA-seq data. Here, we examine 2 approaches that are direct applications of existing methodology and apply these approaches to RNAseq studies: 1) collapsing the rare variants in the region and using either negative binomial regression or Poisson regression and 2) using the normalized read counts with the Sequence Kernel Association Test (SKAT), the burden test for SKAT (SKAT-Burden), or an optimal combination of these two tests (SKAT-O). We evaluated these approaches via simulation studies under numerous scenarios and applied these approaches to the 1,000 Genomes Project. Public Library of Science 2019-10-03 /pmc/articles/PMC6776318/ /pubmed/31581212 http://dx.doi.org/10.1371/journal.pone.0223273 Text en © 2019 Lutz et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Lutz, Sharon Marie
Thwing, Annie
Fingerlin, Tasha
eQTL mapping of rare variant associations using RNA-seq data: An evaluation of approaches
title eQTL mapping of rare variant associations using RNA-seq data: An evaluation of approaches
title_full eQTL mapping of rare variant associations using RNA-seq data: An evaluation of approaches
title_fullStr eQTL mapping of rare variant associations using RNA-seq data: An evaluation of approaches
title_full_unstemmed eQTL mapping of rare variant associations using RNA-seq data: An evaluation of approaches
title_short eQTL mapping of rare variant associations using RNA-seq data: An evaluation of approaches
title_sort eqtl mapping of rare variant associations using rna-seq data: an evaluation of approaches
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6776318/
https://www.ncbi.nlm.nih.gov/pubmed/31581212
http://dx.doi.org/10.1371/journal.pone.0223273
work_keys_str_mv AT lutzsharonmarie eqtlmappingofrarevariantassociationsusingrnaseqdataanevaluationofapproaches
AT thwingannie eqtlmappingofrarevariantassociationsusingrnaseqdataanevaluationofapproaches
AT fingerlintasha eqtlmappingofrarevariantassociationsusingrnaseqdataanevaluationofapproaches