Cargando…

Prediction of HLA genotypes from single-cell transcriptome data

The human leukocyte antigen (HLA) locus plays a central role in adaptive immune function and has significant clinical implications for tissue transplant compatibility and allelic disease associations. Studies using bulk-cell RNA sequencing have demonstrated that HLA transcription may be regulated in...

Descripción completa

Detalles Bibliográficos
Autores principales: Solomon, Benjamin D., Zheng, Hong, Dillon, Laura W., Goldman, Jason D., Hourigan, Christopher S., Heath, James R., Khatri, Purvesh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10167300/
https://www.ncbi.nlm.nih.gov/pubmed/37180102
http://dx.doi.org/10.3389/fimmu.2023.1146826
_version_ 1785038634346348544
author Solomon, Benjamin D.
Zheng, Hong
Dillon, Laura W.
Goldman, Jason D.
Hourigan, Christopher S.
Heath, James R.
Khatri, Purvesh
author_facet Solomon, Benjamin D.
Zheng, Hong
Dillon, Laura W.
Goldman, Jason D.
Hourigan, Christopher S.
Heath, James R.
Khatri, Purvesh
author_sort Solomon, Benjamin D.
collection PubMed
description The human leukocyte antigen (HLA) locus plays a central role in adaptive immune function and has significant clinical implications for tissue transplant compatibility and allelic disease associations. Studies using bulk-cell RNA sequencing have demonstrated that HLA transcription may be regulated in an allele-specific manner and single-cell RNA sequencing (scRNA-seq) has the potential to better characterize these expression patterns. However, quantification of allele-specific expression (ASE) for HLA loci requires sample-specific reference genotyping due to extensive polymorphism. While genotype prediction from bulk RNA sequencing is well described, the feasibility of predicting HLA genotypes directly from single-cell data is unknown. Here we evaluate and expand upon several computational HLA genotyping tools by comparing predictions from human single-cell data to gold-standard, molecular genotyping. The highest 2-field accuracy averaged across all loci was 76% by arcasHLA and increased to 86% using a composite model of multiple genotyping tools. We also developed a highly accurate model (AUC 0.93) for predicting HLA-DRB345 copy number in order to improve genotyping accuracy of the HLA-DRB locus. Genotyping accuracy improved with read depth and was reproducible at repeat sampling. Using a metanalytic approach, we also show that HLA genotypes from PHLAT and OptiType can generate ASE ratios that are highly correlated (R(2) = 0.8 and 0.94, respectively) with those derived from gold-standard genotyping.
format Online
Article
Text
id pubmed-10167300
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-101673002023-05-10 Prediction of HLA genotypes from single-cell transcriptome data Solomon, Benjamin D. Zheng, Hong Dillon, Laura W. Goldman, Jason D. Hourigan, Christopher S. Heath, James R. Khatri, Purvesh Front Immunol Immunology The human leukocyte antigen (HLA) locus plays a central role in adaptive immune function and has significant clinical implications for tissue transplant compatibility and allelic disease associations. Studies using bulk-cell RNA sequencing have demonstrated that HLA transcription may be regulated in an allele-specific manner and single-cell RNA sequencing (scRNA-seq) has the potential to better characterize these expression patterns. However, quantification of allele-specific expression (ASE) for HLA loci requires sample-specific reference genotyping due to extensive polymorphism. While genotype prediction from bulk RNA sequencing is well described, the feasibility of predicting HLA genotypes directly from single-cell data is unknown. Here we evaluate and expand upon several computational HLA genotyping tools by comparing predictions from human single-cell data to gold-standard, molecular genotyping. The highest 2-field accuracy averaged across all loci was 76% by arcasHLA and increased to 86% using a composite model of multiple genotyping tools. We also developed a highly accurate model (AUC 0.93) for predicting HLA-DRB345 copy number in order to improve genotyping accuracy of the HLA-DRB locus. Genotyping accuracy improved with read depth and was reproducible at repeat sampling. Using a metanalytic approach, we also show that HLA genotypes from PHLAT and OptiType can generate ASE ratios that are highly correlated (R(2) = 0.8 and 0.94, respectively) with those derived from gold-standard genotyping. Frontiers Media S.A. 2023-04-25 /pmc/articles/PMC10167300/ /pubmed/37180102 http://dx.doi.org/10.3389/fimmu.2023.1146826 Text en Copyright © 2023 Solomon, Zheng, Dillon, Goldman, Hourigan, Heath and Khatri https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Immunology
Solomon, Benjamin D.
Zheng, Hong
Dillon, Laura W.
Goldman, Jason D.
Hourigan, Christopher S.
Heath, James R.
Khatri, Purvesh
Prediction of HLA genotypes from single-cell transcriptome data
title Prediction of HLA genotypes from single-cell transcriptome data
title_full Prediction of HLA genotypes from single-cell transcriptome data
title_fullStr Prediction of HLA genotypes from single-cell transcriptome data
title_full_unstemmed Prediction of HLA genotypes from single-cell transcriptome data
title_short Prediction of HLA genotypes from single-cell transcriptome data
title_sort prediction of hla genotypes from single-cell transcriptome data
topic Immunology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10167300/
https://www.ncbi.nlm.nih.gov/pubmed/37180102
http://dx.doi.org/10.3389/fimmu.2023.1146826
work_keys_str_mv AT solomonbenjamind predictionofhlagenotypesfromsinglecelltranscriptomedata
AT zhenghong predictionofhlagenotypesfromsinglecelltranscriptomedata
AT dillonlauraw predictionofhlagenotypesfromsinglecelltranscriptomedata
AT goldmanjasond predictionofhlagenotypesfromsinglecelltranscriptomedata
AT houriganchristophers predictionofhlagenotypesfromsinglecelltranscriptomedata
AT heathjamesr predictionofhlagenotypesfromsinglecelltranscriptomedata
AT khatripurvesh predictionofhlagenotypesfromsinglecelltranscriptomedata