Cargando…
Accurate Identification of Cancerlectins through Hybrid Machine Learning Technology
Cancerlectins are cancer-related proteins that function as lectins. They have been identified through computational identification techniques, but these techniques have sometimes failed to identify proteins because of sequence diversity among the cancerlectins. Advanced machine learning identificati...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4961832/ https://www.ncbi.nlm.nih.gov/pubmed/27478823 http://dx.doi.org/10.1155/2016/7604641 |
_version_ | 1782444715366416384 |
---|---|
author | Zhang, Jieru Ju, Ying Lu, Huijuan Xuan, Ping Zou, Quan |
author_facet | Zhang, Jieru Ju, Ying Lu, Huijuan Xuan, Ping Zou, Quan |
author_sort | Zhang, Jieru |
collection | PubMed |
description | Cancerlectins are cancer-related proteins that function as lectins. They have been identified through computational identification techniques, but these techniques have sometimes failed to identify proteins because of sequence diversity among the cancerlectins. Advanced machine learning identification methods, such as support vector machine and basic sequence features (n-gram), have also been used to identify cancerlectins. In this study, various protein fingerprint features and advanced classifiers, including ensemble learning techniques, were utilized to identify this group of proteins. We improved the prediction accuracy of the original feature extraction methods and classification algorithms by more than 10% on average. Our work provides a basis for the computational identification of cancerlectins and reveals the power of hybrid machine learning techniques in computational proteomics. |
format | Online Article Text |
id | pubmed-4961832 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-49618322016-07-31 Accurate Identification of Cancerlectins through Hybrid Machine Learning Technology Zhang, Jieru Ju, Ying Lu, Huijuan Xuan, Ping Zou, Quan Int J Genomics Research Article Cancerlectins are cancer-related proteins that function as lectins. They have been identified through computational identification techniques, but these techniques have sometimes failed to identify proteins because of sequence diversity among the cancerlectins. Advanced machine learning identification methods, such as support vector machine and basic sequence features (n-gram), have also been used to identify cancerlectins. In this study, various protein fingerprint features and advanced classifiers, including ensemble learning techniques, were utilized to identify this group of proteins. We improved the prediction accuracy of the original feature extraction methods and classification algorithms by more than 10% on average. Our work provides a basis for the computational identification of cancerlectins and reveals the power of hybrid machine learning techniques in computational proteomics. Hindawi Publishing Corporation 2016 2016-07-13 /pmc/articles/PMC4961832/ /pubmed/27478823 http://dx.doi.org/10.1155/2016/7604641 Text en Copyright © 2016 Jieru Zhang et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Zhang, Jieru Ju, Ying Lu, Huijuan Xuan, Ping Zou, Quan Accurate Identification of Cancerlectins through Hybrid Machine Learning Technology |
title | Accurate Identification of Cancerlectins through Hybrid Machine Learning Technology |
title_full | Accurate Identification of Cancerlectins through Hybrid Machine Learning Technology |
title_fullStr | Accurate Identification of Cancerlectins through Hybrid Machine Learning Technology |
title_full_unstemmed | Accurate Identification of Cancerlectins through Hybrid Machine Learning Technology |
title_short | Accurate Identification of Cancerlectins through Hybrid Machine Learning Technology |
title_sort | accurate identification of cancerlectins through hybrid machine learning technology |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4961832/ https://www.ncbi.nlm.nih.gov/pubmed/27478823 http://dx.doi.org/10.1155/2016/7604641 |
work_keys_str_mv | AT zhangjieru accurateidentificationofcancerlectinsthroughhybridmachinelearningtechnology AT juying accurateidentificationofcancerlectinsthroughhybridmachinelearningtechnology AT luhuijuan accurateidentificationofcancerlectinsthroughhybridmachinelearningtechnology AT xuanping accurateidentificationofcancerlectinsthroughhybridmachinelearningtechnology AT zouquan accurateidentificationofcancerlectinsthroughhybridmachinelearningtechnology |