Cargando…

HLA-Clus: HLA class I clustering based on 3D structure

BACKGROUND: In a previous paper, we classified populated HLA class I alleles into supertypes and subtypes based on the similarity of 3D landscape of peptide binding grooves, using newly defined structure distance metric and hierarchical clustering approach. Compared to other approaches, our method a...

Descripción completa

Detalles Bibliográficos
Autores principales: Shen, Yue, Parks, Jerry M., Smith, Jeremy C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10169335/
https://www.ncbi.nlm.nih.gov/pubmed/37161375
http://dx.doi.org/10.1186/s12859-023-05297-x
_version_ 1785039033412354048
author Shen, Yue
Parks, Jerry M.
Smith, Jeremy C.
author_facet Shen, Yue
Parks, Jerry M.
Smith, Jeremy C.
author_sort Shen, Yue
collection PubMed
description BACKGROUND: In a previous paper, we classified populated HLA class I alleles into supertypes and subtypes based on the similarity of 3D landscape of peptide binding grooves, using newly defined structure distance metric and hierarchical clustering approach. Compared to other approaches, our method achieves higher correlation with peptide binding specificity, intra-cluster similarity (cohesion), and robustness. Here we introduce HLA-Clus, a Python package for clustering HLA Class I alleles using the method we developed recently and describe additional features including a new nearest neighbor clustering method that facilitates clustering based on user-defined criteria. RESULTS: The HLA-Clus pipeline includes three stages: First, HLA Class I structural models are coarse grained and transformed into clouds of labeled points. Second, similarities between alleles are determined using a newly defined structure distance metric that accounts for spatial and physicochemical similarities. Finally, alleles are clustered via hierarchical or nearest-neighbor approaches. We also interfaced HLA-Clus with the peptide:HLA affinity predictor MHCnuggets. By using the nearest neighbor clustering method to select optimal allele-specific deep learning models in MHCnuggets, the average accuracy of peptide binding prediction of rare alleles was improved. CONCLUSIONS: The HLA-Clus package offers a solution for characterizing the peptide binding specificities of a large number of HLA alleles. This method can be applied in HLA functional studies, such as the development of peptide affinity predictors, disease association studies, and HLA matching for grafting. HLA-Clus is freely available at our GitHub repository (https://github.com/yshen25/HLA-Clus). SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05297-x.
format Online
Article
Text
id pubmed-10169335
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-101693352023-05-11 HLA-Clus: HLA class I clustering based on 3D structure Shen, Yue Parks, Jerry M. Smith, Jeremy C. BMC Bioinformatics Software BACKGROUND: In a previous paper, we classified populated HLA class I alleles into supertypes and subtypes based on the similarity of 3D landscape of peptide binding grooves, using newly defined structure distance metric and hierarchical clustering approach. Compared to other approaches, our method achieves higher correlation with peptide binding specificity, intra-cluster similarity (cohesion), and robustness. Here we introduce HLA-Clus, a Python package for clustering HLA Class I alleles using the method we developed recently and describe additional features including a new nearest neighbor clustering method that facilitates clustering based on user-defined criteria. RESULTS: The HLA-Clus pipeline includes three stages: First, HLA Class I structural models are coarse grained and transformed into clouds of labeled points. Second, similarities between alleles are determined using a newly defined structure distance metric that accounts for spatial and physicochemical similarities. Finally, alleles are clustered via hierarchical or nearest-neighbor approaches. We also interfaced HLA-Clus with the peptide:HLA affinity predictor MHCnuggets. By using the nearest neighbor clustering method to select optimal allele-specific deep learning models in MHCnuggets, the average accuracy of peptide binding prediction of rare alleles was improved. CONCLUSIONS: The HLA-Clus package offers a solution for characterizing the peptide binding specificities of a large number of HLA alleles. This method can be applied in HLA functional studies, such as the development of peptide affinity predictors, disease association studies, and HLA matching for grafting. HLA-Clus is freely available at our GitHub repository (https://github.com/yshen25/HLA-Clus). SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05297-x. BioMed Central 2023-05-09 /pmc/articles/PMC10169335/ /pubmed/37161375 http://dx.doi.org/10.1186/s12859-023-05297-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Shen, Yue
Parks, Jerry M.
Smith, Jeremy C.
HLA-Clus: HLA class I clustering based on 3D structure
title HLA-Clus: HLA class I clustering based on 3D structure
title_full HLA-Clus: HLA class I clustering based on 3D structure
title_fullStr HLA-Clus: HLA class I clustering based on 3D structure
title_full_unstemmed HLA-Clus: HLA class I clustering based on 3D structure
title_short HLA-Clus: HLA class I clustering based on 3D structure
title_sort hla-clus: hla class i clustering based on 3d structure
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10169335/
https://www.ncbi.nlm.nih.gov/pubmed/37161375
http://dx.doi.org/10.1186/s12859-023-05297-x
work_keys_str_mv AT shenyue hlaclushlaclassiclusteringbasedon3dstructure
AT parksjerrym hlaclushlaclassiclusteringbasedon3dstructure
AT smithjeremyc hlaclushlaclassiclusteringbasedon3dstructure