Cargando…

Identification of Vesicle Transport Proteins via Hypergraph Regularized K-Local Hyperplane Distance Nearest Neighbour Model

The prediction of protein function is a common topic in the field of bioinformatics. In recent years, advances in machine learning have inspired a growing number of algorithms for predicting protein function. A large number of parameters and fairly complex neural networks are often used to improve t...

Descripción completa

Detalles Bibliográficos
Autores principales: Fan, Rui, Suo, Bing, Ding, Yijie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9326258/
https://www.ncbi.nlm.nih.gov/pubmed/35910197
http://dx.doi.org/10.3389/fgene.2022.960388
_version_ 1784757243303952384
author Fan, Rui
Suo, Bing
Ding, Yijie
author_facet Fan, Rui
Suo, Bing
Ding, Yijie
author_sort Fan, Rui
collection PubMed
description The prediction of protein function is a common topic in the field of bioinformatics. In recent years, advances in machine learning have inspired a growing number of algorithms for predicting protein function. A large number of parameters and fairly complex neural networks are often used to improve the prediction performance, an approach that is time-consuming and costly. In this study, we leveraged traditional features and machine learning classifiers to boost the performance of vesicle transport protein identification and make the prediction process faster. We adopt the pseudo position-specific scoring matrix (PsePSSM) feature and our proposed new classifier hypergraph regularized k-local hyperplane distance nearest neighbour (HG-HKNN) to classify vesicular transport proteins. We address dataset imbalances with random undersampling. The results show that our strategy has an area under the receiver operating characteristic curve (AUC) of 0.870 and a Matthews correlation coefficient (MCC) of 0.53 on the benchmark dataset, outperforming all state-of-the-art methods on the same dataset, and other metrics of our model are also comparable to existing methods.
format Online
Article
Text
id pubmed-9326258
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-93262582022-07-28 Identification of Vesicle Transport Proteins via Hypergraph Regularized K-Local Hyperplane Distance Nearest Neighbour Model Fan, Rui Suo, Bing Ding, Yijie Front Genet Genetics The prediction of protein function is a common topic in the field of bioinformatics. In recent years, advances in machine learning have inspired a growing number of algorithms for predicting protein function. A large number of parameters and fairly complex neural networks are often used to improve the prediction performance, an approach that is time-consuming and costly. In this study, we leveraged traditional features and machine learning classifiers to boost the performance of vesicle transport protein identification and make the prediction process faster. We adopt the pseudo position-specific scoring matrix (PsePSSM) feature and our proposed new classifier hypergraph regularized k-local hyperplane distance nearest neighbour (HG-HKNN) to classify vesicular transport proteins. We address dataset imbalances with random undersampling. The results show that our strategy has an area under the receiver operating characteristic curve (AUC) of 0.870 and a Matthews correlation coefficient (MCC) of 0.53 on the benchmark dataset, outperforming all state-of-the-art methods on the same dataset, and other metrics of our model are also comparable to existing methods. Frontiers Media S.A. 2022-07-13 /pmc/articles/PMC9326258/ /pubmed/35910197 http://dx.doi.org/10.3389/fgene.2022.960388 Text en Copyright © 2022 Fan, Suo and Ding. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Fan, Rui
Suo, Bing
Ding, Yijie
Identification of Vesicle Transport Proteins via Hypergraph Regularized K-Local Hyperplane Distance Nearest Neighbour Model
title Identification of Vesicle Transport Proteins via Hypergraph Regularized K-Local Hyperplane Distance Nearest Neighbour Model
title_full Identification of Vesicle Transport Proteins via Hypergraph Regularized K-Local Hyperplane Distance Nearest Neighbour Model
title_fullStr Identification of Vesicle Transport Proteins via Hypergraph Regularized K-Local Hyperplane Distance Nearest Neighbour Model
title_full_unstemmed Identification of Vesicle Transport Proteins via Hypergraph Regularized K-Local Hyperplane Distance Nearest Neighbour Model
title_short Identification of Vesicle Transport Proteins via Hypergraph Regularized K-Local Hyperplane Distance Nearest Neighbour Model
title_sort identification of vesicle transport proteins via hypergraph regularized k-local hyperplane distance nearest neighbour model
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9326258/
https://www.ncbi.nlm.nih.gov/pubmed/35910197
http://dx.doi.org/10.3389/fgene.2022.960388
work_keys_str_mv AT fanrui identificationofvesicletransportproteinsviahypergraphregularizedklocalhyperplanedistancenearestneighbourmodel
AT suobing identificationofvesicletransportproteinsviahypergraphregularizedklocalhyperplanedistancenearestneighbourmodel
AT dingyijie identificationofvesicletransportproteinsviahypergraphregularizedklocalhyperplanedistancenearestneighbourmodel