Cargando…
Sequence-based predictive modeling to identify cancerlectins
Lectins are a diverse type of glycoproteins or carbohydrate-binding proteins that have a wide distribution to various species. They can specially identify and exclusively bind to a certain kind of saccharide groups. Cancerlectins are a group of lectins that are closely related to cancer and play a m...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Impact Journals LLC
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5438640/ https://www.ncbi.nlm.nih.gov/pubmed/28423655 http://dx.doi.org/10.18632/oncotarget.15963 |
_version_ | 1783237808592257024 |
---|---|
author | Lai, Hong-Yan Chen, Xin-Xin Chen, Wei Tang, Hua Lin, Hao |
author_facet | Lai, Hong-Yan Chen, Xin-Xin Chen, Wei Tang, Hua Lin, Hao |
author_sort | Lai, Hong-Yan |
collection | PubMed |
description | Lectins are a diverse type of glycoproteins or carbohydrate-binding proteins that have a wide distribution to various species. They can specially identify and exclusively bind to a certain kind of saccharide groups. Cancerlectins are a group of lectins that are closely related to cancer and play a major role in the initiation, survival, growth, metastasis and spread of tumor. Several computational methods have emerged to discriminate cancerlectins from non-cancerlectins, which promote the study on pathogenic mechanisms and clinical treatment of cancer. However, the predictive accuracies of most of these techniques are very limited. In this work, by constructing a benchmark dataset based on the CancerLectinDB database, a new amino acid sequence-based strategy for feature description was developed, and then the binomial distribution was applied to screen the optimal feature set. Ultimately, an SVM-based predictor was performed to distinguish cancerlectins from non-cancerlectins, and achieved an accuracy of 77.48% with AUC of 85.52% in jackknife cross-validation. The results revealed that our prediction model could perform better comparing with published predictive tools. |
format | Online Article Text |
id | pubmed-5438640 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Impact Journals LLC |
record_format | MEDLINE/PubMed |
spelling | pubmed-54386402017-05-24 Sequence-based predictive modeling to identify cancerlectins Lai, Hong-Yan Chen, Xin-Xin Chen, Wei Tang, Hua Lin, Hao Oncotarget Research Paper Lectins are a diverse type of glycoproteins or carbohydrate-binding proteins that have a wide distribution to various species. They can specially identify and exclusively bind to a certain kind of saccharide groups. Cancerlectins are a group of lectins that are closely related to cancer and play a major role in the initiation, survival, growth, metastasis and spread of tumor. Several computational methods have emerged to discriminate cancerlectins from non-cancerlectins, which promote the study on pathogenic mechanisms and clinical treatment of cancer. However, the predictive accuracies of most of these techniques are very limited. In this work, by constructing a benchmark dataset based on the CancerLectinDB database, a new amino acid sequence-based strategy for feature description was developed, and then the binomial distribution was applied to screen the optimal feature set. Ultimately, an SVM-based predictor was performed to distinguish cancerlectins from non-cancerlectins, and achieved an accuracy of 77.48% with AUC of 85.52% in jackknife cross-validation. The results revealed that our prediction model could perform better comparing with published predictive tools. Impact Journals LLC 2017-03-07 /pmc/articles/PMC5438640/ /pubmed/28423655 http://dx.doi.org/10.18632/oncotarget.15963 Text en Copyright: © 2017 Lai et al. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/) (CC-BY), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Paper Lai, Hong-Yan Chen, Xin-Xin Chen, Wei Tang, Hua Lin, Hao Sequence-based predictive modeling to identify cancerlectins |
title | Sequence-based predictive modeling to identify cancerlectins |
title_full | Sequence-based predictive modeling to identify cancerlectins |
title_fullStr | Sequence-based predictive modeling to identify cancerlectins |
title_full_unstemmed | Sequence-based predictive modeling to identify cancerlectins |
title_short | Sequence-based predictive modeling to identify cancerlectins |
title_sort | sequence-based predictive modeling to identify cancerlectins |
topic | Research Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5438640/ https://www.ncbi.nlm.nih.gov/pubmed/28423655 http://dx.doi.org/10.18632/oncotarget.15963 |
work_keys_str_mv | AT laihongyan sequencebasedpredictivemodelingtoidentifycancerlectins AT chenxinxin sequencebasedpredictivemodelingtoidentifycancerlectins AT chenwei sequencebasedpredictivemodelingtoidentifycancerlectins AT tanghua sequencebasedpredictivemodelingtoidentifycancerlectins AT linhao sequencebasedpredictivemodelingtoidentifycancerlectins |