Cargando…

FKRR-MVSF: A Fuzzy Kernel Ridge Regression Model for Identifying DNA-Binding Proteins by Multi-View Sequence Features via Chou’s Five-Step Rule

DNA-binding proteins play an important role in cell metabolism. In biological laboratories, the detection methods of DNA-binding proteins includes yeast one-hybrid methods, bacterial singles and X-ray crystallography methods and others, but these methods involve a lot of labor, material and time. In...

Descripción completa

Detalles Bibliográficos
Autores principales: Zou, Yi, Ding, Yijie, Tang, Jijun, Guo, Fei, Peng, Li
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6747228/
https://www.ncbi.nlm.nih.gov/pubmed/31454964
http://dx.doi.org/10.3390/ijms20174175
_version_ 1783451853374095360
author Zou, Yi
Ding, Yijie
Tang, Jijun
Guo, Fei
Peng, Li
author_facet Zou, Yi
Ding, Yijie
Tang, Jijun
Guo, Fei
Peng, Li
author_sort Zou, Yi
collection PubMed
description DNA-binding proteins play an important role in cell metabolism. In biological laboratories, the detection methods of DNA-binding proteins includes yeast one-hybrid methods, bacterial singles and X-ray crystallography methods and others, but these methods involve a lot of labor, material and time. In recent years, many computation-based approachs have been proposed to detect DNA-binding proteins. In this paper, a machine learning-based method, which is called the Fuzzy Kernel Ridge Regression model based on Multi-View Sequence Features (FKRR-MVSF), is proposed to identifying DNA-binding proteins. First of all, multi-view sequence features are extracted from protein sequences. Next, a Multiple Kernel Learning (MKL) algorithm is employed to combine multiple features. Finally, a Fuzzy Kernel Ridge Regression (FKRR) model is built to detect DNA-binding proteins. Compared with other methods, our model achieves good results. Our method obtains an accuracy of 83.26% and 81.72% on two benchmark datasets (PDB1075 and compared with PDB186), respectively.
format Online
Article
Text
id pubmed-6747228
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-67472282019-09-27 FKRR-MVSF: A Fuzzy Kernel Ridge Regression Model for Identifying DNA-Binding Proteins by Multi-View Sequence Features via Chou’s Five-Step Rule Zou, Yi Ding, Yijie Tang, Jijun Guo, Fei Peng, Li Int J Mol Sci Article DNA-binding proteins play an important role in cell metabolism. In biological laboratories, the detection methods of DNA-binding proteins includes yeast one-hybrid methods, bacterial singles and X-ray crystallography methods and others, but these methods involve a lot of labor, material and time. In recent years, many computation-based approachs have been proposed to detect DNA-binding proteins. In this paper, a machine learning-based method, which is called the Fuzzy Kernel Ridge Regression model based on Multi-View Sequence Features (FKRR-MVSF), is proposed to identifying DNA-binding proteins. First of all, multi-view sequence features are extracted from protein sequences. Next, a Multiple Kernel Learning (MKL) algorithm is employed to combine multiple features. Finally, a Fuzzy Kernel Ridge Regression (FKRR) model is built to detect DNA-binding proteins. Compared with other methods, our model achieves good results. Our method obtains an accuracy of 83.26% and 81.72% on two benchmark datasets (PDB1075 and compared with PDB186), respectively. MDPI 2019-08-26 /pmc/articles/PMC6747228/ /pubmed/31454964 http://dx.doi.org/10.3390/ijms20174175 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zou, Yi
Ding, Yijie
Tang, Jijun
Guo, Fei
Peng, Li
FKRR-MVSF: A Fuzzy Kernel Ridge Regression Model for Identifying DNA-Binding Proteins by Multi-View Sequence Features via Chou’s Five-Step Rule
title FKRR-MVSF: A Fuzzy Kernel Ridge Regression Model for Identifying DNA-Binding Proteins by Multi-View Sequence Features via Chou’s Five-Step Rule
title_full FKRR-MVSF: A Fuzzy Kernel Ridge Regression Model for Identifying DNA-Binding Proteins by Multi-View Sequence Features via Chou’s Five-Step Rule
title_fullStr FKRR-MVSF: A Fuzzy Kernel Ridge Regression Model for Identifying DNA-Binding Proteins by Multi-View Sequence Features via Chou’s Five-Step Rule
title_full_unstemmed FKRR-MVSF: A Fuzzy Kernel Ridge Regression Model for Identifying DNA-Binding Proteins by Multi-View Sequence Features via Chou’s Five-Step Rule
title_short FKRR-MVSF: A Fuzzy Kernel Ridge Regression Model for Identifying DNA-Binding Proteins by Multi-View Sequence Features via Chou’s Five-Step Rule
title_sort fkrr-mvsf: a fuzzy kernel ridge regression model for identifying dna-binding proteins by multi-view sequence features via chou’s five-step rule
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6747228/
https://www.ncbi.nlm.nih.gov/pubmed/31454964
http://dx.doi.org/10.3390/ijms20174175
work_keys_str_mv AT zouyi fkrrmvsfafuzzykernelridgeregressionmodelforidentifyingdnabindingproteinsbymultiviewsequencefeaturesviachousfivesteprule
AT dingyijie fkrrmvsfafuzzykernelridgeregressionmodelforidentifyingdnabindingproteinsbymultiviewsequencefeaturesviachousfivesteprule
AT tangjijun fkrrmvsfafuzzykernelridgeregressionmodelforidentifyingdnabindingproteinsbymultiviewsequencefeaturesviachousfivesteprule
AT guofei fkrrmvsfafuzzykernelridgeregressionmodelforidentifyingdnabindingproteinsbymultiviewsequencefeaturesviachousfivesteprule
AT pengli fkrrmvsfafuzzykernelridgeregressionmodelforidentifyingdnabindingproteinsbymultiviewsequencefeaturesviachousfivesteprule