Cargando…
Prediction of Carbohydrate-Binding Proteins from Sequences Using Support Vector Machines
Carbohydrate-binding proteins are proteins that can interact with sugar chains but do not modify them. They are involved in many physiological functions, and we have developed a method for predicting them from their amino acid sequences. Our method is based on support vector machines (SVMs). We firs...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2948896/ https://www.ncbi.nlm.nih.gov/pubmed/20936154 http://dx.doi.org/10.1155/2010/289301 |
_version_ | 1782187501927006208 |
---|---|
author | Someya, Seizi Kakuta, Masanori Morita, Mizuki Sumikoshi, Kazuya Cao, Wei Ge, Zhenyi Hirose, Osamu Nakamura, Shugo Terada, Tohru Shimizu, Kentaro |
author_facet | Someya, Seizi Kakuta, Masanori Morita, Mizuki Sumikoshi, Kazuya Cao, Wei Ge, Zhenyi Hirose, Osamu Nakamura, Shugo Terada, Tohru Shimizu, Kentaro |
author_sort | Someya, Seizi |
collection | PubMed |
description | Carbohydrate-binding proteins are proteins that can interact with sugar chains but do not modify them. They are involved in many physiological functions, and we have developed a method for predicting them from their amino acid sequences. Our method is based on support vector machines (SVMs). We first clarified the definition of carbohydrate-binding proteins and then constructed positive and negative datasets with which the SVMs were trained. By applying the leave-one-out test to these datasets, our method delivered 0.92 of the area under the receiver operating characteristic (ROC) curve. We also examined two amino acid grouping methods that enable effective learning of sequence patterns and evaluated the performance of these methods. When we applied our method in combination with the homology-based prediction method to the annotated human genome database, H-invDB, we found that the true positive rate of prediction was improved. |
format | Text |
id | pubmed-2948896 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-29488962010-10-08 Prediction of Carbohydrate-Binding Proteins from Sequences Using Support Vector Machines Someya, Seizi Kakuta, Masanori Morita, Mizuki Sumikoshi, Kazuya Cao, Wei Ge, Zhenyi Hirose, Osamu Nakamura, Shugo Terada, Tohru Shimizu, Kentaro Adv Bioinformatics Research Article Carbohydrate-binding proteins are proteins that can interact with sugar chains but do not modify them. They are involved in many physiological functions, and we have developed a method for predicting them from their amino acid sequences. Our method is based on support vector machines (SVMs). We first clarified the definition of carbohydrate-binding proteins and then constructed positive and negative datasets with which the SVMs were trained. By applying the leave-one-out test to these datasets, our method delivered 0.92 of the area under the receiver operating characteristic (ROC) curve. We also examined two amino acid grouping methods that enable effective learning of sequence patterns and evaluated the performance of these methods. When we applied our method in combination with the homology-based prediction method to the annotated human genome database, H-invDB, we found that the true positive rate of prediction was improved. Hindawi Publishing Corporation 2010 2010-09-27 /pmc/articles/PMC2948896/ /pubmed/20936154 http://dx.doi.org/10.1155/2010/289301 Text en Copyright © 2010 Seizi Someya et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Someya, Seizi Kakuta, Masanori Morita, Mizuki Sumikoshi, Kazuya Cao, Wei Ge, Zhenyi Hirose, Osamu Nakamura, Shugo Terada, Tohru Shimizu, Kentaro Prediction of Carbohydrate-Binding Proteins from Sequences Using Support Vector Machines |
title | Prediction of Carbohydrate-Binding Proteins from Sequences Using Support Vector Machines |
title_full | Prediction of Carbohydrate-Binding Proteins from Sequences Using Support Vector Machines |
title_fullStr | Prediction of Carbohydrate-Binding Proteins from Sequences Using Support Vector Machines |
title_full_unstemmed | Prediction of Carbohydrate-Binding Proteins from Sequences Using Support Vector Machines |
title_short | Prediction of Carbohydrate-Binding Proteins from Sequences Using Support Vector Machines |
title_sort | prediction of carbohydrate-binding proteins from sequences using support vector machines |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2948896/ https://www.ncbi.nlm.nih.gov/pubmed/20936154 http://dx.doi.org/10.1155/2010/289301 |
work_keys_str_mv | AT someyaseizi predictionofcarbohydratebindingproteinsfromsequencesusingsupportvectormachines AT kakutamasanori predictionofcarbohydratebindingproteinsfromsequencesusingsupportvectormachines AT moritamizuki predictionofcarbohydratebindingproteinsfromsequencesusingsupportvectormachines AT sumikoshikazuya predictionofcarbohydratebindingproteinsfromsequencesusingsupportvectormachines AT caowei predictionofcarbohydratebindingproteinsfromsequencesusingsupportvectormachines AT gezhenyi predictionofcarbohydratebindingproteinsfromsequencesusingsupportvectormachines AT hiroseosamu predictionofcarbohydratebindingproteinsfromsequencesusingsupportvectormachines AT nakamurashugo predictionofcarbohydratebindingproteinsfromsequencesusingsupportvectormachines AT teradatohru predictionofcarbohydratebindingproteinsfromsequencesusingsupportvectormachines AT shimizukentaro predictionofcarbohydratebindingproteinsfromsequencesusingsupportvectormachines |