Cargando…

Expression estimation and eQTL mapping for HLA genes with a personalized pipeline

The HLA (Human Leukocyte Antigens) genes are well-documented targets of balancing selection, and variation at these loci is associated with many disease phenotypes. Variation in expression levels also influences disease susceptibility and resistance, but little information exists about the regulatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Aguiar, Vitor R. C., César, Jônatas, Delaneau, Olivier, Dermitzakis, Emmanouil T., Meyer, Diogo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6497317/
https://www.ncbi.nlm.nih.gov/pubmed/31009447
http://dx.doi.org/10.1371/journal.pgen.1008091
_version_ 1783415451066302464
author Aguiar, Vitor R. C.
César, Jônatas
Delaneau, Olivier
Dermitzakis, Emmanouil T.
Meyer, Diogo
author_facet Aguiar, Vitor R. C.
César, Jônatas
Delaneau, Olivier
Dermitzakis, Emmanouil T.
Meyer, Diogo
author_sort Aguiar, Vitor R. C.
collection PubMed
description The HLA (Human Leukocyte Antigens) genes are well-documented targets of balancing selection, and variation at these loci is associated with many disease phenotypes. Variation in expression levels also influences disease susceptibility and resistance, but little information exists about the regulation and population-level patterns of expression. This results from the difficulty in mapping short reads originated from these highly polymorphic loci, and in accounting for the existence of several paralogues. We developed a computational pipeline to accurately estimate expression for HLA genes based on RNA-seq, improving both locus-level and allele-level estimates. First, reads are aligned to all known HLA sequences in order to infer HLA genotypes, then quantification of expression is carried out using a personalized index. We use simulations to show that expression estimates obtained in this way are not biased due to divergence from the reference genome. We applied our pipeline to the GEUVADIS dataset, and compared the quantifications to those obtained with reference transcriptome. Although the personalized pipeline recovers more reads, we found that using the reference transcriptome produces estimates similar to the personalized pipeline (r ≥ 0.87) with the exception of HLA-DQA1. We describe the impact of the HLA-personalized approach on downstream analyses for nine classical HLA loci (HLA-A, HLA-C, HLA-B, HLA-DRA, HLA-DRB1, HLA-DQA1, HLA-DQB1, HLA-DPA1, HLA-DPB1). Although the influence of the HLA-personalized approach is modest for eQTL mapping, the p-values and the causality of the eQTLs obtained are better than when the reference transcriptome is used. We investigate how the eQTLs we identified explain variation in expression among lineages of HLA alleles. Finally, we discuss possible causes underlying differences between expression estimates obtained using RNA-seq, antibody-based approaches and qPCR.
format Online
Article
Text
id pubmed-6497317
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-64973172019-05-17 Expression estimation and eQTL mapping for HLA genes with a personalized pipeline Aguiar, Vitor R. C. César, Jônatas Delaneau, Olivier Dermitzakis, Emmanouil T. Meyer, Diogo PLoS Genet Research Article The HLA (Human Leukocyte Antigens) genes are well-documented targets of balancing selection, and variation at these loci is associated with many disease phenotypes. Variation in expression levels also influences disease susceptibility and resistance, but little information exists about the regulation and population-level patterns of expression. This results from the difficulty in mapping short reads originated from these highly polymorphic loci, and in accounting for the existence of several paralogues. We developed a computational pipeline to accurately estimate expression for HLA genes based on RNA-seq, improving both locus-level and allele-level estimates. First, reads are aligned to all known HLA sequences in order to infer HLA genotypes, then quantification of expression is carried out using a personalized index. We use simulations to show that expression estimates obtained in this way are not biased due to divergence from the reference genome. We applied our pipeline to the GEUVADIS dataset, and compared the quantifications to those obtained with reference transcriptome. Although the personalized pipeline recovers more reads, we found that using the reference transcriptome produces estimates similar to the personalized pipeline (r ≥ 0.87) with the exception of HLA-DQA1. We describe the impact of the HLA-personalized approach on downstream analyses for nine classical HLA loci (HLA-A, HLA-C, HLA-B, HLA-DRA, HLA-DRB1, HLA-DQA1, HLA-DQB1, HLA-DPA1, HLA-DPB1). Although the influence of the HLA-personalized approach is modest for eQTL mapping, the p-values and the causality of the eQTLs obtained are better than when the reference transcriptome is used. We investigate how the eQTLs we identified explain variation in expression among lineages of HLA alleles. Finally, we discuss possible causes underlying differences between expression estimates obtained using RNA-seq, antibody-based approaches and qPCR. Public Library of Science 2019-04-22 /pmc/articles/PMC6497317/ /pubmed/31009447 http://dx.doi.org/10.1371/journal.pgen.1008091 Text en © 2019 Aguiar et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Aguiar, Vitor R. C.
César, Jônatas
Delaneau, Olivier
Dermitzakis, Emmanouil T.
Meyer, Diogo
Expression estimation and eQTL mapping for HLA genes with a personalized pipeline
title Expression estimation and eQTL mapping for HLA genes with a personalized pipeline
title_full Expression estimation and eQTL mapping for HLA genes with a personalized pipeline
title_fullStr Expression estimation and eQTL mapping for HLA genes with a personalized pipeline
title_full_unstemmed Expression estimation and eQTL mapping for HLA genes with a personalized pipeline
title_short Expression estimation and eQTL mapping for HLA genes with a personalized pipeline
title_sort expression estimation and eqtl mapping for hla genes with a personalized pipeline
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6497317/
https://www.ncbi.nlm.nih.gov/pubmed/31009447
http://dx.doi.org/10.1371/journal.pgen.1008091
work_keys_str_mv AT aguiarvitorrc expressionestimationandeqtlmappingforhlageneswithapersonalizedpipeline
AT cesarjonatas expressionestimationandeqtlmappingforhlageneswithapersonalizedpipeline
AT delaneauolivier expressionestimationandeqtlmappingforhlageneswithapersonalizedpipeline
AT dermitzakisemmanouilt expressionestimationandeqtlmappingforhlageneswithapersonalizedpipeline
AT meyerdiogo expressionestimationandeqtlmappingforhlageneswithapersonalizedpipeline