Cargando…

Accurate Identification of Cancerlectins through Hybrid Machine Learning Technology

Cancerlectins are cancer-related proteins that function as lectins. They have been identified through computational identification techniques, but these techniques have sometimes failed to identify proteins because of sequence diversity among the cancerlectins. Advanced machine learning identificati...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Jieru, Ju, Ying, Lu, Huijuan, Xuan, Ping, Zou, Quan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4961832/
https://www.ncbi.nlm.nih.gov/pubmed/27478823
http://dx.doi.org/10.1155/2016/7604641
_version_ 1782444715366416384
author Zhang, Jieru
Ju, Ying
Lu, Huijuan
Xuan, Ping
Zou, Quan
author_facet Zhang, Jieru
Ju, Ying
Lu, Huijuan
Xuan, Ping
Zou, Quan
author_sort Zhang, Jieru
collection PubMed
description Cancerlectins are cancer-related proteins that function as lectins. They have been identified through computational identification techniques, but these techniques have sometimes failed to identify proteins because of sequence diversity among the cancerlectins. Advanced machine learning identification methods, such as support vector machine and basic sequence features (n-gram), have also been used to identify cancerlectins. In this study, various protein fingerprint features and advanced classifiers, including ensemble learning techniques, were utilized to identify this group of proteins. We improved the prediction accuracy of the original feature extraction methods and classification algorithms by more than 10% on average. Our work provides a basis for the computational identification of cancerlectins and reveals the power of hybrid machine learning techniques in computational proteomics.
format Online
Article
Text
id pubmed-4961832
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-49618322016-07-31 Accurate Identification of Cancerlectins through Hybrid Machine Learning Technology Zhang, Jieru Ju, Ying Lu, Huijuan Xuan, Ping Zou, Quan Int J Genomics Research Article Cancerlectins are cancer-related proteins that function as lectins. They have been identified through computational identification techniques, but these techniques have sometimes failed to identify proteins because of sequence diversity among the cancerlectins. Advanced machine learning identification methods, such as support vector machine and basic sequence features (n-gram), have also been used to identify cancerlectins. In this study, various protein fingerprint features and advanced classifiers, including ensemble learning techniques, were utilized to identify this group of proteins. We improved the prediction accuracy of the original feature extraction methods and classification algorithms by more than 10% on average. Our work provides a basis for the computational identification of cancerlectins and reveals the power of hybrid machine learning techniques in computational proteomics. Hindawi Publishing Corporation 2016 2016-07-13 /pmc/articles/PMC4961832/ /pubmed/27478823 http://dx.doi.org/10.1155/2016/7604641 Text en Copyright © 2016 Jieru Zhang et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhang, Jieru
Ju, Ying
Lu, Huijuan
Xuan, Ping
Zou, Quan
Accurate Identification of Cancerlectins through Hybrid Machine Learning Technology
title Accurate Identification of Cancerlectins through Hybrid Machine Learning Technology
title_full Accurate Identification of Cancerlectins through Hybrid Machine Learning Technology
title_fullStr Accurate Identification of Cancerlectins through Hybrid Machine Learning Technology
title_full_unstemmed Accurate Identification of Cancerlectins through Hybrid Machine Learning Technology
title_short Accurate Identification of Cancerlectins through Hybrid Machine Learning Technology
title_sort accurate identification of cancerlectins through hybrid machine learning technology
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4961832/
https://www.ncbi.nlm.nih.gov/pubmed/27478823
http://dx.doi.org/10.1155/2016/7604641
work_keys_str_mv AT zhangjieru accurateidentificationofcancerlectinsthroughhybridmachinelearningtechnology
AT juying accurateidentificationofcancerlectinsthroughhybridmachinelearningtechnology
AT luhuijuan accurateidentificationofcancerlectinsthroughhybridmachinelearningtechnology
AT xuanping accurateidentificationofcancerlectinsthroughhybridmachinelearningtechnology
AT zouquan accurateidentificationofcancerlectinsthroughhybridmachinelearningtechnology