Cargando…

Statistical Resolution of Ambiguous HLA Typing Data

High-resolution HLA typing plays a central role in many areas of immunology, such as in identifying immunogenetic risk factors for disease, in studying how the genomes of pathogens evolve in response to immune selection pressures, and also in vaccine design, where identification of HLA-restricted ep...

Descripción completa

Detalles Bibliográficos
Autores principales: Listgarten, Jennifer, Brumme, Zabrina, Kadie, Carl, Xiaojiang, Gao, Walker, Bruce, Carrington, Mary, Goulder, Philip, Heckerman, David
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2289775/
https://www.ncbi.nlm.nih.gov/pubmed/18392148
http://dx.doi.org/10.1371/journal.pcbi.1000016
_version_ 1782152336929456128
author Listgarten, Jennifer
Brumme, Zabrina
Kadie, Carl
Xiaojiang, Gao
Walker, Bruce
Carrington, Mary
Goulder, Philip
Heckerman, David
author_facet Listgarten, Jennifer
Brumme, Zabrina
Kadie, Carl
Xiaojiang, Gao
Walker, Bruce
Carrington, Mary
Goulder, Philip
Heckerman, David
author_sort Listgarten, Jennifer
collection PubMed
description High-resolution HLA typing plays a central role in many areas of immunology, such as in identifying immunogenetic risk factors for disease, in studying how the genomes of pathogens evolve in response to immune selection pressures, and also in vaccine design, where identification of HLA-restricted epitopes may be used to guide the selection of vaccine immunogens. Perhaps one of the most immediate applications is in direct medical decisions concerning the matching of stem cell transplant donors to unrelated recipients. However, high-resolution HLA typing is frequently unavailable due to its high cost or the inability to re-type historical data. In this paper, we introduce and evaluate a method for statistical, in silico refinement of ambiguous and/or low-resolution HLA data. Our method, which requires an independent, high-resolution training data set drawn from the same population as the data to be refined, uses linkage disequilibrium in HLA haplotypes as well as four-digit allele frequency data to probabilistically refine HLA typings. Central to our approach is the use of haplotype inference. We introduce new methodology to this area, improving upon the Expectation-Maximization (EM)-based approaches currently used within the HLA community. Our improvements are achieved by using a parsimonious parameterization for haplotype distributions and by smoothing the maximum likelihood (ML) solution. These improvements make it possible to scale the refinement to a larger number of alleles and loci in a more computationally efficient and stable manner. We also show how to augment our method in order to incorporate ethnicity information (as HLA allele distributions vary widely according to race/ethnicity as well as geographic area), and demonstrate the potential utility of this experimentally. A tool based on our approach is freely available for research purposes at http://microsoft.com/science.
format Text
id pubmed-2289775
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-22897752008-04-08 Statistical Resolution of Ambiguous HLA Typing Data Listgarten, Jennifer Brumme, Zabrina Kadie, Carl Xiaojiang, Gao Walker, Bruce Carrington, Mary Goulder, Philip Heckerman, David PLoS Comput Biol Research Article High-resolution HLA typing plays a central role in many areas of immunology, such as in identifying immunogenetic risk factors for disease, in studying how the genomes of pathogens evolve in response to immune selection pressures, and also in vaccine design, where identification of HLA-restricted epitopes may be used to guide the selection of vaccine immunogens. Perhaps one of the most immediate applications is in direct medical decisions concerning the matching of stem cell transplant donors to unrelated recipients. However, high-resolution HLA typing is frequently unavailable due to its high cost or the inability to re-type historical data. In this paper, we introduce and evaluate a method for statistical, in silico refinement of ambiguous and/or low-resolution HLA data. Our method, which requires an independent, high-resolution training data set drawn from the same population as the data to be refined, uses linkage disequilibrium in HLA haplotypes as well as four-digit allele frequency data to probabilistically refine HLA typings. Central to our approach is the use of haplotype inference. We introduce new methodology to this area, improving upon the Expectation-Maximization (EM)-based approaches currently used within the HLA community. Our improvements are achieved by using a parsimonious parameterization for haplotype distributions and by smoothing the maximum likelihood (ML) solution. These improvements make it possible to scale the refinement to a larger number of alleles and loci in a more computationally efficient and stable manner. We also show how to augment our method in order to incorporate ethnicity information (as HLA allele distributions vary widely according to race/ethnicity as well as geographic area), and demonstrate the potential utility of this experimentally. A tool based on our approach is freely available for research purposes at http://microsoft.com/science. Public Library of Science 2008-02-29 /pmc/articles/PMC2289775/ /pubmed/18392148 http://dx.doi.org/10.1371/journal.pcbi.1000016 Text en Listgarten et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Listgarten, Jennifer
Brumme, Zabrina
Kadie, Carl
Xiaojiang, Gao
Walker, Bruce
Carrington, Mary
Goulder, Philip
Heckerman, David
Statistical Resolution of Ambiguous HLA Typing Data
title Statistical Resolution of Ambiguous HLA Typing Data
title_full Statistical Resolution of Ambiguous HLA Typing Data
title_fullStr Statistical Resolution of Ambiguous HLA Typing Data
title_full_unstemmed Statistical Resolution of Ambiguous HLA Typing Data
title_short Statistical Resolution of Ambiguous HLA Typing Data
title_sort statistical resolution of ambiguous hla typing data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2289775/
https://www.ncbi.nlm.nih.gov/pubmed/18392148
http://dx.doi.org/10.1371/journal.pcbi.1000016
work_keys_str_mv AT listgartenjennifer statisticalresolutionofambiguoushlatypingdata
AT brummezabrina statisticalresolutionofambiguoushlatypingdata
AT kadiecarl statisticalresolutionofambiguoushlatypingdata
AT xiaojianggao statisticalresolutionofambiguoushlatypingdata
AT walkerbruce statisticalresolutionofambiguoushlatypingdata
AT carringtonmary statisticalresolutionofambiguoushlatypingdata
AT goulderphilip statisticalresolutionofambiguoushlatypingdata
AT heckermandavid statisticalresolutionofambiguoushlatypingdata