Cargando…
Expression estimation and eQTL mapping for HLA genes with a personalized pipeline
The HLA (Human Leukocyte Antigens) genes are well-documented targets of balancing selection, and variation at these loci is associated with many disease phenotypes. Variation in expression levels also influences disease susceptibility and resistance, but little information exists about the regulatio...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6497317/ https://www.ncbi.nlm.nih.gov/pubmed/31009447 http://dx.doi.org/10.1371/journal.pgen.1008091 |
_version_ | 1783415451066302464 |
---|---|
author | Aguiar, Vitor R. C. César, Jônatas Delaneau, Olivier Dermitzakis, Emmanouil T. Meyer, Diogo |
author_facet | Aguiar, Vitor R. C. César, Jônatas Delaneau, Olivier Dermitzakis, Emmanouil T. Meyer, Diogo |
author_sort | Aguiar, Vitor R. C. |
collection | PubMed |
description | The HLA (Human Leukocyte Antigens) genes are well-documented targets of balancing selection, and variation at these loci is associated with many disease phenotypes. Variation in expression levels also influences disease susceptibility and resistance, but little information exists about the regulation and population-level patterns of expression. This results from the difficulty in mapping short reads originated from these highly polymorphic loci, and in accounting for the existence of several paralogues. We developed a computational pipeline to accurately estimate expression for HLA genes based on RNA-seq, improving both locus-level and allele-level estimates. First, reads are aligned to all known HLA sequences in order to infer HLA genotypes, then quantification of expression is carried out using a personalized index. We use simulations to show that expression estimates obtained in this way are not biased due to divergence from the reference genome. We applied our pipeline to the GEUVADIS dataset, and compared the quantifications to those obtained with reference transcriptome. Although the personalized pipeline recovers more reads, we found that using the reference transcriptome produces estimates similar to the personalized pipeline (r ≥ 0.87) with the exception of HLA-DQA1. We describe the impact of the HLA-personalized approach on downstream analyses for nine classical HLA loci (HLA-A, HLA-C, HLA-B, HLA-DRA, HLA-DRB1, HLA-DQA1, HLA-DQB1, HLA-DPA1, HLA-DPB1). Although the influence of the HLA-personalized approach is modest for eQTL mapping, the p-values and the causality of the eQTLs obtained are better than when the reference transcriptome is used. We investigate how the eQTLs we identified explain variation in expression among lineages of HLA alleles. Finally, we discuss possible causes underlying differences between expression estimates obtained using RNA-seq, antibody-based approaches and qPCR. |
format | Online Article Text |
id | pubmed-6497317 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-64973172019-05-17 Expression estimation and eQTL mapping for HLA genes with a personalized pipeline Aguiar, Vitor R. C. César, Jônatas Delaneau, Olivier Dermitzakis, Emmanouil T. Meyer, Diogo PLoS Genet Research Article The HLA (Human Leukocyte Antigens) genes are well-documented targets of balancing selection, and variation at these loci is associated with many disease phenotypes. Variation in expression levels also influences disease susceptibility and resistance, but little information exists about the regulation and population-level patterns of expression. This results from the difficulty in mapping short reads originated from these highly polymorphic loci, and in accounting for the existence of several paralogues. We developed a computational pipeline to accurately estimate expression for HLA genes based on RNA-seq, improving both locus-level and allele-level estimates. First, reads are aligned to all known HLA sequences in order to infer HLA genotypes, then quantification of expression is carried out using a personalized index. We use simulations to show that expression estimates obtained in this way are not biased due to divergence from the reference genome. We applied our pipeline to the GEUVADIS dataset, and compared the quantifications to those obtained with reference transcriptome. Although the personalized pipeline recovers more reads, we found that using the reference transcriptome produces estimates similar to the personalized pipeline (r ≥ 0.87) with the exception of HLA-DQA1. We describe the impact of the HLA-personalized approach on downstream analyses for nine classical HLA loci (HLA-A, HLA-C, HLA-B, HLA-DRA, HLA-DRB1, HLA-DQA1, HLA-DQB1, HLA-DPA1, HLA-DPB1). Although the influence of the HLA-personalized approach is modest for eQTL mapping, the p-values and the causality of the eQTLs obtained are better than when the reference transcriptome is used. We investigate how the eQTLs we identified explain variation in expression among lineages of HLA alleles. Finally, we discuss possible causes underlying differences between expression estimates obtained using RNA-seq, antibody-based approaches and qPCR. Public Library of Science 2019-04-22 /pmc/articles/PMC6497317/ /pubmed/31009447 http://dx.doi.org/10.1371/journal.pgen.1008091 Text en © 2019 Aguiar et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Aguiar, Vitor R. C. César, Jônatas Delaneau, Olivier Dermitzakis, Emmanouil T. Meyer, Diogo Expression estimation and eQTL mapping for HLA genes with a personalized pipeline |
title | Expression estimation and eQTL mapping for HLA genes with a personalized pipeline |
title_full | Expression estimation and eQTL mapping for HLA genes with a personalized pipeline |
title_fullStr | Expression estimation and eQTL mapping for HLA genes with a personalized pipeline |
title_full_unstemmed | Expression estimation and eQTL mapping for HLA genes with a personalized pipeline |
title_short | Expression estimation and eQTL mapping for HLA genes with a personalized pipeline |
title_sort | expression estimation and eqtl mapping for hla genes with a personalized pipeline |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6497317/ https://www.ncbi.nlm.nih.gov/pubmed/31009447 http://dx.doi.org/10.1371/journal.pgen.1008091 |
work_keys_str_mv | AT aguiarvitorrc expressionestimationandeqtlmappingforhlageneswithapersonalizedpipeline AT cesarjonatas expressionestimationandeqtlmappingforhlageneswithapersonalizedpipeline AT delaneauolivier expressionestimationandeqtlmappingforhlageneswithapersonalizedpipeline AT dermitzakisemmanouilt expressionestimationandeqtlmappingforhlageneswithapersonalizedpipeline AT meyerdiogo expressionestimationandeqtlmappingforhlageneswithapersonalizedpipeline |