Cargando…
Identification of Vesicle Transport Proteins via Hypergraph Regularized K-Local Hyperplane Distance Nearest Neighbour Model
The prediction of protein function is a common topic in the field of bioinformatics. In recent years, advances in machine learning have inspired a growing number of algorithms for predicting protein function. A large number of parameters and fairly complex neural networks are often used to improve t...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9326258/ https://www.ncbi.nlm.nih.gov/pubmed/35910197 http://dx.doi.org/10.3389/fgene.2022.960388 |
_version_ | 1784757243303952384 |
---|---|
author | Fan, Rui Suo, Bing Ding, Yijie |
author_facet | Fan, Rui Suo, Bing Ding, Yijie |
author_sort | Fan, Rui |
collection | PubMed |
description | The prediction of protein function is a common topic in the field of bioinformatics. In recent years, advances in machine learning have inspired a growing number of algorithms for predicting protein function. A large number of parameters and fairly complex neural networks are often used to improve the prediction performance, an approach that is time-consuming and costly. In this study, we leveraged traditional features and machine learning classifiers to boost the performance of vesicle transport protein identification and make the prediction process faster. We adopt the pseudo position-specific scoring matrix (PsePSSM) feature and our proposed new classifier hypergraph regularized k-local hyperplane distance nearest neighbour (HG-HKNN) to classify vesicular transport proteins. We address dataset imbalances with random undersampling. The results show that our strategy has an area under the receiver operating characteristic curve (AUC) of 0.870 and a Matthews correlation coefficient (MCC) of 0.53 on the benchmark dataset, outperforming all state-of-the-art methods on the same dataset, and other metrics of our model are also comparable to existing methods. |
format | Online Article Text |
id | pubmed-9326258 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-93262582022-07-28 Identification of Vesicle Transport Proteins via Hypergraph Regularized K-Local Hyperplane Distance Nearest Neighbour Model Fan, Rui Suo, Bing Ding, Yijie Front Genet Genetics The prediction of protein function is a common topic in the field of bioinformatics. In recent years, advances in machine learning have inspired a growing number of algorithms for predicting protein function. A large number of parameters and fairly complex neural networks are often used to improve the prediction performance, an approach that is time-consuming and costly. In this study, we leveraged traditional features and machine learning classifiers to boost the performance of vesicle transport protein identification and make the prediction process faster. We adopt the pseudo position-specific scoring matrix (PsePSSM) feature and our proposed new classifier hypergraph regularized k-local hyperplane distance nearest neighbour (HG-HKNN) to classify vesicular transport proteins. We address dataset imbalances with random undersampling. The results show that our strategy has an area under the receiver operating characteristic curve (AUC) of 0.870 and a Matthews correlation coefficient (MCC) of 0.53 on the benchmark dataset, outperforming all state-of-the-art methods on the same dataset, and other metrics of our model are also comparable to existing methods. Frontiers Media S.A. 2022-07-13 /pmc/articles/PMC9326258/ /pubmed/35910197 http://dx.doi.org/10.3389/fgene.2022.960388 Text en Copyright © 2022 Fan, Suo and Ding. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Fan, Rui Suo, Bing Ding, Yijie Identification of Vesicle Transport Proteins via Hypergraph Regularized K-Local Hyperplane Distance Nearest Neighbour Model |
title | Identification of Vesicle Transport Proteins via Hypergraph Regularized K-Local Hyperplane Distance Nearest Neighbour Model |
title_full | Identification of Vesicle Transport Proteins via Hypergraph Regularized K-Local Hyperplane Distance Nearest Neighbour Model |
title_fullStr | Identification of Vesicle Transport Proteins via Hypergraph Regularized K-Local Hyperplane Distance Nearest Neighbour Model |
title_full_unstemmed | Identification of Vesicle Transport Proteins via Hypergraph Regularized K-Local Hyperplane Distance Nearest Neighbour Model |
title_short | Identification of Vesicle Transport Proteins via Hypergraph Regularized K-Local Hyperplane Distance Nearest Neighbour Model |
title_sort | identification of vesicle transport proteins via hypergraph regularized k-local hyperplane distance nearest neighbour model |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9326258/ https://www.ncbi.nlm.nih.gov/pubmed/35910197 http://dx.doi.org/10.3389/fgene.2022.960388 |
work_keys_str_mv | AT fanrui identificationofvesicletransportproteinsviahypergraphregularizedklocalhyperplanedistancenearestneighbourmodel AT suobing identificationofvesicletransportproteinsviahypergraphregularizedklocalhyperplanedistancenearestneighbourmodel AT dingyijie identificationofvesicletransportproteinsviahypergraphregularizedklocalhyperplanedistancenearestneighbourmodel |