Cargando…

iDNAProt-ES: Identification of DNA-binding Proteins Using Evolutionary and Structural Features

DNA-binding proteins play a very important role in the structural composition of the DNA. In addition, they regulate and effect various cellular processes like transcription, DNA replication, DNA recombination, repair and modification. The experimental methods used to identify DNA-binding proteins a...

Descripción completa

Detalles Bibliográficos
Autores principales: Chowdhury, Shahana Yasmin, Shatabda, Swakkhar, Dehzangi, Abdollah
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5668250/
https://www.ncbi.nlm.nih.gov/pubmed/29097781
http://dx.doi.org/10.1038/s41598-017-14945-1
_version_ 1783275632408395776
author Chowdhury, Shahana Yasmin
Shatabda, Swakkhar
Dehzangi, Abdollah
author_facet Chowdhury, Shahana Yasmin
Shatabda, Swakkhar
Dehzangi, Abdollah
author_sort Chowdhury, Shahana Yasmin
collection PubMed
description DNA-binding proteins play a very important role in the structural composition of the DNA. In addition, they regulate and effect various cellular processes like transcription, DNA replication, DNA recombination, repair and modification. The experimental methods used to identify DNA-binding proteins are expensive and time consuming and thus attracted researchers from computational field to address the problem. In this paper, we present iDNAProt-ES, a DNA-binding protein prediction method that utilizes both sequence based evolutionary and structure based features of proteins to identify their DNA-binding functionality. We used recursive feature elimination to extract an optimal set of features and train them using Support Vector Machine (SVM) with linear kernel to select the final model. Our proposed method significantly outperforms the existing state-of-the-art predictors on standard benchmark dataset. The accuracy of the predictor is 90.18% using jack knife test and 88.87% using 10-fold cross validation on the benchmark dataset. The accuracy of the predictor on the independent dataset is 80.64% which is also significantly better than the state-of-the-art methods. iDNAProt-ES is a novel prediction method that uses evolutionary and structural based features. We believe the superior performance of iDNAProt-ES will motivate the researchers to use this method to identify DNA-binding proteins. iDNAProt-ES is publicly available as a web server at: http://brl.uiu.ac.bd/iDNAProt-ES/.
format Online
Article
Text
id pubmed-5668250
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-56682502017-11-08 iDNAProt-ES: Identification of DNA-binding Proteins Using Evolutionary and Structural Features Chowdhury, Shahana Yasmin Shatabda, Swakkhar Dehzangi, Abdollah Sci Rep Article DNA-binding proteins play a very important role in the structural composition of the DNA. In addition, they regulate and effect various cellular processes like transcription, DNA replication, DNA recombination, repair and modification. The experimental methods used to identify DNA-binding proteins are expensive and time consuming and thus attracted researchers from computational field to address the problem. In this paper, we present iDNAProt-ES, a DNA-binding protein prediction method that utilizes both sequence based evolutionary and structure based features of proteins to identify their DNA-binding functionality. We used recursive feature elimination to extract an optimal set of features and train them using Support Vector Machine (SVM) with linear kernel to select the final model. Our proposed method significantly outperforms the existing state-of-the-art predictors on standard benchmark dataset. The accuracy of the predictor is 90.18% using jack knife test and 88.87% using 10-fold cross validation on the benchmark dataset. The accuracy of the predictor on the independent dataset is 80.64% which is also significantly better than the state-of-the-art methods. iDNAProt-ES is a novel prediction method that uses evolutionary and structural based features. We believe the superior performance of iDNAProt-ES will motivate the researchers to use this method to identify DNA-binding proteins. iDNAProt-ES is publicly available as a web server at: http://brl.uiu.ac.bd/iDNAProt-ES/. Nature Publishing Group UK 2017-11-02 /pmc/articles/PMC5668250/ /pubmed/29097781 http://dx.doi.org/10.1038/s41598-017-14945-1 Text en © The Author(s) 2017 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Chowdhury, Shahana Yasmin
Shatabda, Swakkhar
Dehzangi, Abdollah
iDNAProt-ES: Identification of DNA-binding Proteins Using Evolutionary and Structural Features
title iDNAProt-ES: Identification of DNA-binding Proteins Using Evolutionary and Structural Features
title_full iDNAProt-ES: Identification of DNA-binding Proteins Using Evolutionary and Structural Features
title_fullStr iDNAProt-ES: Identification of DNA-binding Proteins Using Evolutionary and Structural Features
title_full_unstemmed iDNAProt-ES: Identification of DNA-binding Proteins Using Evolutionary and Structural Features
title_short iDNAProt-ES: Identification of DNA-binding Proteins Using Evolutionary and Structural Features
title_sort idnaprot-es: identification of dna-binding proteins using evolutionary and structural features
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5668250/
https://www.ncbi.nlm.nih.gov/pubmed/29097781
http://dx.doi.org/10.1038/s41598-017-14945-1
work_keys_str_mv AT chowdhuryshahanayasmin idnaprotesidentificationofdnabindingproteinsusingevolutionaryandstructuralfeatures
AT shatabdaswakkhar idnaprotesidentificationofdnabindingproteinsusingevolutionaryandstructuralfeatures
AT dehzangiabdollah idnaprotesidentificationofdnabindingproteinsusingevolutionaryandstructuralfeatures