Cargando…

Prediction of Carbohydrate-Binding Proteins from Sequences Using Support Vector Machines

Carbohydrate-binding proteins are proteins that can interact with sugar chains but do not modify them. They are involved in many physiological functions, and we have developed a method for predicting them from their amino acid sequences. Our method is based on support vector machines (SVMs). We firs...

Descripción completa

Detalles Bibliográficos
Autores principales: Someya, Seizi, Kakuta, Masanori, Morita, Mizuki, Sumikoshi, Kazuya, Cao, Wei, Ge, Zhenyi, Hirose, Osamu, Nakamura, Shugo, Terada, Tohru, Shimizu, Kentaro
Formato: Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2948896/
https://www.ncbi.nlm.nih.gov/pubmed/20936154
http://dx.doi.org/10.1155/2010/289301
_version_ 1782187501927006208
author Someya, Seizi
Kakuta, Masanori
Morita, Mizuki
Sumikoshi, Kazuya
Cao, Wei
Ge, Zhenyi
Hirose, Osamu
Nakamura, Shugo
Terada, Tohru
Shimizu, Kentaro
author_facet Someya, Seizi
Kakuta, Masanori
Morita, Mizuki
Sumikoshi, Kazuya
Cao, Wei
Ge, Zhenyi
Hirose, Osamu
Nakamura, Shugo
Terada, Tohru
Shimizu, Kentaro
author_sort Someya, Seizi
collection PubMed
description Carbohydrate-binding proteins are proteins that can interact with sugar chains but do not modify them. They are involved in many physiological functions, and we have developed a method for predicting them from their amino acid sequences. Our method is based on support vector machines (SVMs). We first clarified the definition of carbohydrate-binding proteins and then constructed positive and negative datasets with which the SVMs were trained. By applying the leave-one-out test to these datasets, our method delivered 0.92 of the area under the receiver operating characteristic (ROC) curve. We also examined two amino acid grouping methods that enable effective learning of sequence patterns and evaluated the performance of these methods. When we applied our method in combination with the homology-based prediction method to the annotated human genome database, H-invDB, we found that the true positive rate of prediction was improved.
format Text
id pubmed-2948896
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-29488962010-10-08 Prediction of Carbohydrate-Binding Proteins from Sequences Using Support Vector Machines Someya, Seizi Kakuta, Masanori Morita, Mizuki Sumikoshi, Kazuya Cao, Wei Ge, Zhenyi Hirose, Osamu Nakamura, Shugo Terada, Tohru Shimizu, Kentaro Adv Bioinformatics Research Article Carbohydrate-binding proteins are proteins that can interact with sugar chains but do not modify them. They are involved in many physiological functions, and we have developed a method for predicting them from their amino acid sequences. Our method is based on support vector machines (SVMs). We first clarified the definition of carbohydrate-binding proteins and then constructed positive and negative datasets with which the SVMs were trained. By applying the leave-one-out test to these datasets, our method delivered 0.92 of the area under the receiver operating characteristic (ROC) curve. We also examined two amino acid grouping methods that enable effective learning of sequence patterns and evaluated the performance of these methods. When we applied our method in combination with the homology-based prediction method to the annotated human genome database, H-invDB, we found that the true positive rate of prediction was improved. Hindawi Publishing Corporation 2010 2010-09-27 /pmc/articles/PMC2948896/ /pubmed/20936154 http://dx.doi.org/10.1155/2010/289301 Text en Copyright © 2010 Seizi Someya et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Someya, Seizi
Kakuta, Masanori
Morita, Mizuki
Sumikoshi, Kazuya
Cao, Wei
Ge, Zhenyi
Hirose, Osamu
Nakamura, Shugo
Terada, Tohru
Shimizu, Kentaro
Prediction of Carbohydrate-Binding Proteins from Sequences Using Support Vector Machines
title Prediction of Carbohydrate-Binding Proteins from Sequences Using Support Vector Machines
title_full Prediction of Carbohydrate-Binding Proteins from Sequences Using Support Vector Machines
title_fullStr Prediction of Carbohydrate-Binding Proteins from Sequences Using Support Vector Machines
title_full_unstemmed Prediction of Carbohydrate-Binding Proteins from Sequences Using Support Vector Machines
title_short Prediction of Carbohydrate-Binding Proteins from Sequences Using Support Vector Machines
title_sort prediction of carbohydrate-binding proteins from sequences using support vector machines
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2948896/
https://www.ncbi.nlm.nih.gov/pubmed/20936154
http://dx.doi.org/10.1155/2010/289301
work_keys_str_mv AT someyaseizi predictionofcarbohydratebindingproteinsfromsequencesusingsupportvectormachines
AT kakutamasanori predictionofcarbohydratebindingproteinsfromsequencesusingsupportvectormachines
AT moritamizuki predictionofcarbohydratebindingproteinsfromsequencesusingsupportvectormachines
AT sumikoshikazuya predictionofcarbohydratebindingproteinsfromsequencesusingsupportvectormachines
AT caowei predictionofcarbohydratebindingproteinsfromsequencesusingsupportvectormachines
AT gezhenyi predictionofcarbohydratebindingproteinsfromsequencesusingsupportvectormachines
AT hiroseosamu predictionofcarbohydratebindingproteinsfromsequencesusingsupportvectormachines
AT nakamurashugo predictionofcarbohydratebindingproteinsfromsequencesusingsupportvectormachines
AT teradatohru predictionofcarbohydratebindingproteinsfromsequencesusingsupportvectormachines
AT shimizukentaro predictionofcarbohydratebindingproteinsfromsequencesusingsupportvectormachines