Cargando…

AeQTL: eQTL analysis using region-based aggregation of rare genomic variants

Concurrently available genomic and transcriptomic data from large cohorts provide opportunities to discover expression quantitative trait loci (eQTLs)—genetic variants associated with gene expression changes. However, the statistical power of detecting rare variant eQTLs is often limited and most ex...

Descripción completa

Detalles Bibliográficos
Autores principales: Dong, Guanlan, Wendl, Michael C., Zhang, Bin, Ding, Li, Huang, Kuan-lin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8050802/
https://www.ncbi.nlm.nih.gov/pubmed/33691015
_version_ 1783679642184450048
author Dong, Guanlan
Wendl, Michael C.
Zhang, Bin
Ding, Li
Huang, Kuan-lin
author_facet Dong, Guanlan
Wendl, Michael C.
Zhang, Bin
Ding, Li
Huang, Kuan-lin
author_sort Dong, Guanlan
collection PubMed
description Concurrently available genomic and transcriptomic data from large cohorts provide opportunities to discover expression quantitative trait loci (eQTLs)—genetic variants associated with gene expression changes. However, the statistical power of detecting rare variant eQTLs is often limited and most existing eQTL tools are not compatible with sequence variant file formats. We have developed AeQTL (Aggregated eQTL), a software tool that performs eQTL analysis on variants aggregated according to user-specified regions and is designed to accommodate standard genomic files. AeQTL consistently yielded similar or higher powers for identifying rare variant eQTLs than single-variant tests. Using AeQTL, we discovered that aggregated rare germline truncations in cis exomic regions are significantly associated with the expression of BRCA1 and SLC25A39 in breast tumors. In a somatic mutation pan-cancer analysis, aggregated mutations of those predicted to be missense versus truncations were differentially associated with gene expressions of cancer drivers, and somatic truncation eQTLs were further identified as a new multi-omic classifier of oncogenes versus tumor-suppressor genes. AeQTL is easy to use and customize, allowing a broad application for discovering rare variants, including coding and noncoding variants, associated with gene expression. AeQTL is implemented in Python and the source code is freely available at https://github.com/Huang-lab/AeQTL under the MIT license.
format Online
Article
Text
id pubmed-8050802
institution National Center for Biotechnology Information
language English
publishDate 2021
record_format MEDLINE/PubMed
spelling pubmed-80508022021-04-16 AeQTL: eQTL analysis using region-based aggregation of rare genomic variants Dong, Guanlan Wendl, Michael C. Zhang, Bin Ding, Li Huang, Kuan-lin Pac Symp Biocomput Article Concurrently available genomic and transcriptomic data from large cohorts provide opportunities to discover expression quantitative trait loci (eQTLs)—genetic variants associated with gene expression changes. However, the statistical power of detecting rare variant eQTLs is often limited and most existing eQTL tools are not compatible with sequence variant file formats. We have developed AeQTL (Aggregated eQTL), a software tool that performs eQTL analysis on variants aggregated according to user-specified regions and is designed to accommodate standard genomic files. AeQTL consistently yielded similar or higher powers for identifying rare variant eQTLs than single-variant tests. Using AeQTL, we discovered that aggregated rare germline truncations in cis exomic regions are significantly associated with the expression of BRCA1 and SLC25A39 in breast tumors. In a somatic mutation pan-cancer analysis, aggregated mutations of those predicted to be missense versus truncations were differentially associated with gene expressions of cancer drivers, and somatic truncation eQTLs were further identified as a new multi-omic classifier of oncogenes versus tumor-suppressor genes. AeQTL is easy to use and customize, allowing a broad application for discovering rare variants, including coding and noncoding variants, associated with gene expression. AeQTL is implemented in Python and the source code is freely available at https://github.com/Huang-lab/AeQTL under the MIT license. 2021 /pmc/articles/PMC8050802/ /pubmed/33691015 Text en https://creativecommons.org/licenses/by/4.0/Open Access chapter published by World Scientific Publishing Company and distributed under the terms of the Creative Commons Attribution Non-Commercial (CC BY-NC) 4.0 License.
spellingShingle Article
Dong, Guanlan
Wendl, Michael C.
Zhang, Bin
Ding, Li
Huang, Kuan-lin
AeQTL: eQTL analysis using region-based aggregation of rare genomic variants
title AeQTL: eQTL analysis using region-based aggregation of rare genomic variants
title_full AeQTL: eQTL analysis using region-based aggregation of rare genomic variants
title_fullStr AeQTL: eQTL analysis using region-based aggregation of rare genomic variants
title_full_unstemmed AeQTL: eQTL analysis using region-based aggregation of rare genomic variants
title_short AeQTL: eQTL analysis using region-based aggregation of rare genomic variants
title_sort aeqtl: eqtl analysis using region-based aggregation of rare genomic variants
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8050802/
https://www.ncbi.nlm.nih.gov/pubmed/33691015
work_keys_str_mv AT dongguanlan aeqtleqtlanalysisusingregionbasedaggregationofraregenomicvariants
AT wendlmichaelc aeqtleqtlanalysisusingregionbasedaggregationofraregenomicvariants
AT zhangbin aeqtleqtlanalysisusingregionbasedaggregationofraregenomicvariants
AT dingli aeqtleqtlanalysisusingregionbasedaggregationofraregenomicvariants
AT huangkuanlin aeqtleqtlanalysisusingregionbasedaggregationofraregenomicvariants