Cargando…
HLA-Clus: HLA class I clustering based on 3D structure
BACKGROUND: In a previous paper, we classified populated HLA class I alleles into supertypes and subtypes based on the similarity of 3D landscape of peptide binding grooves, using newly defined structure distance metric and hierarchical clustering approach. Compared to other approaches, our method a...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10169335/ https://www.ncbi.nlm.nih.gov/pubmed/37161375 http://dx.doi.org/10.1186/s12859-023-05297-x |
_version_ | 1785039033412354048 |
---|---|
author | Shen, Yue Parks, Jerry M. Smith, Jeremy C. |
author_facet | Shen, Yue Parks, Jerry M. Smith, Jeremy C. |
author_sort | Shen, Yue |
collection | PubMed |
description | BACKGROUND: In a previous paper, we classified populated HLA class I alleles into supertypes and subtypes based on the similarity of 3D landscape of peptide binding grooves, using newly defined structure distance metric and hierarchical clustering approach. Compared to other approaches, our method achieves higher correlation with peptide binding specificity, intra-cluster similarity (cohesion), and robustness. Here we introduce HLA-Clus, a Python package for clustering HLA Class I alleles using the method we developed recently and describe additional features including a new nearest neighbor clustering method that facilitates clustering based on user-defined criteria. RESULTS: The HLA-Clus pipeline includes three stages: First, HLA Class I structural models are coarse grained and transformed into clouds of labeled points. Second, similarities between alleles are determined using a newly defined structure distance metric that accounts for spatial and physicochemical similarities. Finally, alleles are clustered via hierarchical or nearest-neighbor approaches. We also interfaced HLA-Clus with the peptide:HLA affinity predictor MHCnuggets. By using the nearest neighbor clustering method to select optimal allele-specific deep learning models in MHCnuggets, the average accuracy of peptide binding prediction of rare alleles was improved. CONCLUSIONS: The HLA-Clus package offers a solution for characterizing the peptide binding specificities of a large number of HLA alleles. This method can be applied in HLA functional studies, such as the development of peptide affinity predictors, disease association studies, and HLA matching for grafting. HLA-Clus is freely available at our GitHub repository (https://github.com/yshen25/HLA-Clus). SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05297-x. |
format | Online Article Text |
id | pubmed-10169335 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-101693352023-05-11 HLA-Clus: HLA class I clustering based on 3D structure Shen, Yue Parks, Jerry M. Smith, Jeremy C. BMC Bioinformatics Software BACKGROUND: In a previous paper, we classified populated HLA class I alleles into supertypes and subtypes based on the similarity of 3D landscape of peptide binding grooves, using newly defined structure distance metric and hierarchical clustering approach. Compared to other approaches, our method achieves higher correlation with peptide binding specificity, intra-cluster similarity (cohesion), and robustness. Here we introduce HLA-Clus, a Python package for clustering HLA Class I alleles using the method we developed recently and describe additional features including a new nearest neighbor clustering method that facilitates clustering based on user-defined criteria. RESULTS: The HLA-Clus pipeline includes three stages: First, HLA Class I structural models are coarse grained and transformed into clouds of labeled points. Second, similarities between alleles are determined using a newly defined structure distance metric that accounts for spatial and physicochemical similarities. Finally, alleles are clustered via hierarchical or nearest-neighbor approaches. We also interfaced HLA-Clus with the peptide:HLA affinity predictor MHCnuggets. By using the nearest neighbor clustering method to select optimal allele-specific deep learning models in MHCnuggets, the average accuracy of peptide binding prediction of rare alleles was improved. CONCLUSIONS: The HLA-Clus package offers a solution for characterizing the peptide binding specificities of a large number of HLA alleles. This method can be applied in HLA functional studies, such as the development of peptide affinity predictors, disease association studies, and HLA matching for grafting. HLA-Clus is freely available at our GitHub repository (https://github.com/yshen25/HLA-Clus). SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05297-x. BioMed Central 2023-05-09 /pmc/articles/PMC10169335/ /pubmed/37161375 http://dx.doi.org/10.1186/s12859-023-05297-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Software Shen, Yue Parks, Jerry M. Smith, Jeremy C. HLA-Clus: HLA class I clustering based on 3D structure |
title | HLA-Clus: HLA class I clustering based on 3D structure |
title_full | HLA-Clus: HLA class I clustering based on 3D structure |
title_fullStr | HLA-Clus: HLA class I clustering based on 3D structure |
title_full_unstemmed | HLA-Clus: HLA class I clustering based on 3D structure |
title_short | HLA-Clus: HLA class I clustering based on 3D structure |
title_sort | hla-clus: hla class i clustering based on 3d structure |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10169335/ https://www.ncbi.nlm.nih.gov/pubmed/37161375 http://dx.doi.org/10.1186/s12859-023-05297-x |
work_keys_str_mv | AT shenyue hlaclushlaclassiclusteringbasedon3dstructure AT parksjerrym hlaclushlaclassiclusteringbasedon3dstructure AT smithjeremyc hlaclushlaclassiclusteringbasedon3dstructure |