Cargando…
PredMHC: An Effective Predictor of Major Histocompatibility Complex Using Mixed Features
The major histocompatibility complex (MHC) is a large locus on vertebrate DNA that contains a tightly linked set of polymorphic genes encoding cell surface proteins essential for the adaptive immune system. The groups of proteins encoded in the MHC play an important role in the adaptive immune syste...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9081368/ https://www.ncbi.nlm.nih.gov/pubmed/35547252 http://dx.doi.org/10.3389/fgene.2022.875112 |
_version_ | 1784702970623950848 |
---|---|
author | Chen, Dong Li, Yanjuan |
author_facet | Chen, Dong Li, Yanjuan |
author_sort | Chen, Dong |
collection | PubMed |
description | The major histocompatibility complex (MHC) is a large locus on vertebrate DNA that contains a tightly linked set of polymorphic genes encoding cell surface proteins essential for the adaptive immune system. The groups of proteins encoded in the MHC play an important role in the adaptive immune system. Therefore, the accurate identification of the MHC is necessary to understand its role in the adaptive immune system. An effective predictor called PredMHC is established in this study to identify the MHC from protein sequences. Firstly, PredMHC encoded a protein sequence with mixed features including 188D, APAAC, KSCTriad, CKSAAGP, and PAAC. Secondly, three classifiers including SGD, SMO, and random forest were trained on the mixed features of the protein sequence. Finally, the prediction result was obtained by the voting of the three classifiers. The experimental results of the 10-fold cross-validation test in the training dataset showed that PredMHC can obtain 91.69% accuracy. Experimental results on comparison with other features, classifiers, and existing methods showed the effectiveness of PredMHC in predicting the MHC. |
format | Online Article Text |
id | pubmed-9081368 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-90813682022-05-10 PredMHC: An Effective Predictor of Major Histocompatibility Complex Using Mixed Features Chen, Dong Li, Yanjuan Front Genet Genetics The major histocompatibility complex (MHC) is a large locus on vertebrate DNA that contains a tightly linked set of polymorphic genes encoding cell surface proteins essential for the adaptive immune system. The groups of proteins encoded in the MHC play an important role in the adaptive immune system. Therefore, the accurate identification of the MHC is necessary to understand its role in the adaptive immune system. An effective predictor called PredMHC is established in this study to identify the MHC from protein sequences. Firstly, PredMHC encoded a protein sequence with mixed features including 188D, APAAC, KSCTriad, CKSAAGP, and PAAC. Secondly, three classifiers including SGD, SMO, and random forest were trained on the mixed features of the protein sequence. Finally, the prediction result was obtained by the voting of the three classifiers. The experimental results of the 10-fold cross-validation test in the training dataset showed that PredMHC can obtain 91.69% accuracy. Experimental results on comparison with other features, classifiers, and existing methods showed the effectiveness of PredMHC in predicting the MHC. Frontiers Media S.A. 2022-04-25 /pmc/articles/PMC9081368/ /pubmed/35547252 http://dx.doi.org/10.3389/fgene.2022.875112 Text en Copyright © 2022 Chen and Li. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Chen, Dong Li, Yanjuan PredMHC: An Effective Predictor of Major Histocompatibility Complex Using Mixed Features |
title | PredMHC: An Effective Predictor of Major Histocompatibility Complex Using Mixed Features |
title_full | PredMHC: An Effective Predictor of Major Histocompatibility Complex Using Mixed Features |
title_fullStr | PredMHC: An Effective Predictor of Major Histocompatibility Complex Using Mixed Features |
title_full_unstemmed | PredMHC: An Effective Predictor of Major Histocompatibility Complex Using Mixed Features |
title_short | PredMHC: An Effective Predictor of Major Histocompatibility Complex Using Mixed Features |
title_sort | predmhc: an effective predictor of major histocompatibility complex using mixed features |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9081368/ https://www.ncbi.nlm.nih.gov/pubmed/35547252 http://dx.doi.org/10.3389/fgene.2022.875112 |
work_keys_str_mv | AT chendong predmhcaneffectivepredictorofmajorhistocompatibilitycomplexusingmixedfeatures AT liyanjuan predmhcaneffectivepredictorofmajorhistocompatibilitycomplexusingmixedfeatures |