Cargando…
Robust identification of regulatory variants (eQTLs) using a differential expression framework developed for RNA-sequencing
BACKGROUND: A gap currently exists between genetic variants and the underlying cell and tissue biology of a trait, and expression quantitative trait loci (eQTL) studies provide important information to help close that gap. However, two concerns that arise with eQTL analyses using RNA-sequencing data...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10161580/ https://www.ncbi.nlm.nih.gov/pubmed/37143150 http://dx.doi.org/10.1186/s40104-023-00861-0 |
_version_ | 1785037523192381440 |
---|---|
author | Marrella, Mackenzie A. Biase, Fernando H. |
author_facet | Marrella, Mackenzie A. Biase, Fernando H. |
author_sort | Marrella, Mackenzie A. |
collection | PubMed |
description | BACKGROUND: A gap currently exists between genetic variants and the underlying cell and tissue biology of a trait, and expression quantitative trait loci (eQTL) studies provide important information to help close that gap. However, two concerns that arise with eQTL analyses using RNA-sequencing data are normalization of data across samples and the data not following a normal distribution. Multiple pipelines have been suggested to address this. For instance, the most recent analysis of the human and farm Genotype-Tissue Expression (GTEx) project proposes using trimmed means of M-values (TMM) to normalize the data followed by an inverse normal transformation. RESULTS: In this study, we reasoned that eQTL analysis could be carried out using the same framework used for differential gene expression (DGE), which uses a negative binomial model, a statistical test feasible for count data. Using the GTEx framework, we identified 35 significant eQTLs (P < 5 × 10(–8)) following the ANOVA model and 39 significant eQTLs (P < 5 × 10(–8)) following the additive model. Using a differential gene expression framework, we identified 930 and six significant eQTLs (P < 5 × 10(–8)) following an analytical framework equivalent to the ANOVA and additive model, respectively. When we compared the two approaches, there was no overlap of significant eQTLs between the two frameworks. Because we defined specific contrasts, we identified trans eQTLs that more closely resembled what we expect from genetic variants showing complete dominance between alleles. Yet, these were not identified by the GTEx framework. CONCLUSIONS: Our results show that transforming RNA-sequencing data to fit a normal distribution prior to eQTL analysis is not required when the DGE framework is employed. Our proposed approach detected biologically relevant variants that otherwise would not have been identified due to data transformation to fit a normal distribution. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40104-023-00861-0. |
format | Online Article Text |
id | pubmed-10161580 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-101615802023-05-06 Robust identification of regulatory variants (eQTLs) using a differential expression framework developed for RNA-sequencing Marrella, Mackenzie A. Biase, Fernando H. J Anim Sci Biotechnol Research BACKGROUND: A gap currently exists between genetic variants and the underlying cell and tissue biology of a trait, and expression quantitative trait loci (eQTL) studies provide important information to help close that gap. However, two concerns that arise with eQTL analyses using RNA-sequencing data are normalization of data across samples and the data not following a normal distribution. Multiple pipelines have been suggested to address this. For instance, the most recent analysis of the human and farm Genotype-Tissue Expression (GTEx) project proposes using trimmed means of M-values (TMM) to normalize the data followed by an inverse normal transformation. RESULTS: In this study, we reasoned that eQTL analysis could be carried out using the same framework used for differential gene expression (DGE), which uses a negative binomial model, a statistical test feasible for count data. Using the GTEx framework, we identified 35 significant eQTLs (P < 5 × 10(–8)) following the ANOVA model and 39 significant eQTLs (P < 5 × 10(–8)) following the additive model. Using a differential gene expression framework, we identified 930 and six significant eQTLs (P < 5 × 10(–8)) following an analytical framework equivalent to the ANOVA and additive model, respectively. When we compared the two approaches, there was no overlap of significant eQTLs between the two frameworks. Because we defined specific contrasts, we identified trans eQTLs that more closely resembled what we expect from genetic variants showing complete dominance between alleles. Yet, these were not identified by the GTEx framework. CONCLUSIONS: Our results show that transforming RNA-sequencing data to fit a normal distribution prior to eQTL analysis is not required when the DGE framework is employed. Our proposed approach detected biologically relevant variants that otherwise would not have been identified due to data transformation to fit a normal distribution. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40104-023-00861-0. BioMed Central 2023-05-05 /pmc/articles/PMC10161580/ /pubmed/37143150 http://dx.doi.org/10.1186/s40104-023-00861-0 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Marrella, Mackenzie A. Biase, Fernando H. Robust identification of regulatory variants (eQTLs) using a differential expression framework developed for RNA-sequencing |
title | Robust identification of regulatory variants (eQTLs) using a differential expression framework developed for RNA-sequencing |
title_full | Robust identification of regulatory variants (eQTLs) using a differential expression framework developed for RNA-sequencing |
title_fullStr | Robust identification of regulatory variants (eQTLs) using a differential expression framework developed for RNA-sequencing |
title_full_unstemmed | Robust identification of regulatory variants (eQTLs) using a differential expression framework developed for RNA-sequencing |
title_short | Robust identification of regulatory variants (eQTLs) using a differential expression framework developed for RNA-sequencing |
title_sort | robust identification of regulatory variants (eqtls) using a differential expression framework developed for rna-sequencing |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10161580/ https://www.ncbi.nlm.nih.gov/pubmed/37143150 http://dx.doi.org/10.1186/s40104-023-00861-0 |
work_keys_str_mv | AT marrellamackenziea robustidentificationofregulatoryvariantseqtlsusingadifferentialexpressionframeworkdevelopedforrnasequencing AT biasefernandoh robustidentificationofregulatoryvariantseqtlsusingadifferentialexpressionframeworkdevelopedforrnasequencing |