Cargando…

Robust identification of regulatory variants (eQTLs) using a differential expression framework developed for RNA-sequencing

BACKGROUND: A gap currently exists between genetic variants and the underlying cell and tissue biology of a trait, and expression quantitative trait loci (eQTL) studies provide important information to help close that gap. However, two concerns that arise with eQTL analyses using RNA-sequencing data...

Descripción completa

Detalles Bibliográficos
Autores principales: Marrella, Mackenzie A., Biase, Fernando H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10161580/
https://www.ncbi.nlm.nih.gov/pubmed/37143150
http://dx.doi.org/10.1186/s40104-023-00861-0
_version_ 1785037523192381440
author Marrella, Mackenzie A.
Biase, Fernando H.
author_facet Marrella, Mackenzie A.
Biase, Fernando H.
author_sort Marrella, Mackenzie A.
collection PubMed
description BACKGROUND: A gap currently exists between genetic variants and the underlying cell and tissue biology of a trait, and expression quantitative trait loci (eQTL) studies provide important information to help close that gap. However, two concerns that arise with eQTL analyses using RNA-sequencing data are normalization of data across samples and the data not following a normal distribution. Multiple pipelines have been suggested to address this. For instance, the most recent analysis of the human and farm Genotype-Tissue Expression (GTEx) project proposes using trimmed means of M-values (TMM) to normalize the data followed by an inverse normal transformation. RESULTS: In this study, we reasoned that eQTL analysis could be carried out using the same framework used for differential gene expression (DGE), which uses a negative binomial model, a statistical test feasible for count data. Using the GTEx framework, we identified 35 significant eQTLs (P < 5 × 10(–8)) following the ANOVA model and 39 significant eQTLs (P < 5 × 10(–8)) following the additive model. Using a differential gene expression framework, we identified 930 and six significant eQTLs (P < 5 × 10(–8)) following an analytical framework equivalent to the ANOVA and additive model, respectively. When we compared the two approaches, there was no overlap of significant eQTLs between the two frameworks. Because we defined specific contrasts, we identified trans eQTLs that more closely resembled what we expect from genetic variants showing complete dominance between alleles. Yet, these were not identified by the GTEx framework. CONCLUSIONS: Our results show that transforming RNA-sequencing data to fit a normal distribution prior to eQTL analysis is not required when the DGE framework is employed. Our proposed approach detected biologically relevant variants that otherwise would not have been identified due to data transformation to fit a normal distribution. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40104-023-00861-0.
format Online
Article
Text
id pubmed-10161580
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-101615802023-05-06 Robust identification of regulatory variants (eQTLs) using a differential expression framework developed for RNA-sequencing Marrella, Mackenzie A. Biase, Fernando H. J Anim Sci Biotechnol Research BACKGROUND: A gap currently exists between genetic variants and the underlying cell and tissue biology of a trait, and expression quantitative trait loci (eQTL) studies provide important information to help close that gap. However, two concerns that arise with eQTL analyses using RNA-sequencing data are normalization of data across samples and the data not following a normal distribution. Multiple pipelines have been suggested to address this. For instance, the most recent analysis of the human and farm Genotype-Tissue Expression (GTEx) project proposes using trimmed means of M-values (TMM) to normalize the data followed by an inverse normal transformation. RESULTS: In this study, we reasoned that eQTL analysis could be carried out using the same framework used for differential gene expression (DGE), which uses a negative binomial model, a statistical test feasible for count data. Using the GTEx framework, we identified 35 significant eQTLs (P < 5 × 10(–8)) following the ANOVA model and 39 significant eQTLs (P < 5 × 10(–8)) following the additive model. Using a differential gene expression framework, we identified 930 and six significant eQTLs (P < 5 × 10(–8)) following an analytical framework equivalent to the ANOVA and additive model, respectively. When we compared the two approaches, there was no overlap of significant eQTLs between the two frameworks. Because we defined specific contrasts, we identified trans eQTLs that more closely resembled what we expect from genetic variants showing complete dominance between alleles. Yet, these were not identified by the GTEx framework. CONCLUSIONS: Our results show that transforming RNA-sequencing data to fit a normal distribution prior to eQTL analysis is not required when the DGE framework is employed. Our proposed approach detected biologically relevant variants that otherwise would not have been identified due to data transformation to fit a normal distribution. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40104-023-00861-0. BioMed Central 2023-05-05 /pmc/articles/PMC10161580/ /pubmed/37143150 http://dx.doi.org/10.1186/s40104-023-00861-0 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Marrella, Mackenzie A.
Biase, Fernando H.
Robust identification of regulatory variants (eQTLs) using a differential expression framework developed for RNA-sequencing
title Robust identification of regulatory variants (eQTLs) using a differential expression framework developed for RNA-sequencing
title_full Robust identification of regulatory variants (eQTLs) using a differential expression framework developed for RNA-sequencing
title_fullStr Robust identification of regulatory variants (eQTLs) using a differential expression framework developed for RNA-sequencing
title_full_unstemmed Robust identification of regulatory variants (eQTLs) using a differential expression framework developed for RNA-sequencing
title_short Robust identification of regulatory variants (eQTLs) using a differential expression framework developed for RNA-sequencing
title_sort robust identification of regulatory variants (eqtls) using a differential expression framework developed for rna-sequencing
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10161580/
https://www.ncbi.nlm.nih.gov/pubmed/37143150
http://dx.doi.org/10.1186/s40104-023-00861-0
work_keys_str_mv AT marrellamackenziea robustidentificationofregulatoryvariantseqtlsusingadifferentialexpressionframeworkdevelopedforrnasequencing
AT biasefernandoh robustidentificationofregulatoryvariantseqtlsusingadifferentialexpressionframeworkdevelopedforrnasequencing