Cargando…
iDNAProt-ES: Identification of DNA-binding Proteins Using Evolutionary and Structural Features
DNA-binding proteins play a very important role in the structural composition of the DNA. In addition, they regulate and effect various cellular processes like transcription, DNA replication, DNA recombination, repair and modification. The experimental methods used to identify DNA-binding proteins a...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5668250/ https://www.ncbi.nlm.nih.gov/pubmed/29097781 http://dx.doi.org/10.1038/s41598-017-14945-1 |
_version_ | 1783275632408395776 |
---|---|
author | Chowdhury, Shahana Yasmin Shatabda, Swakkhar Dehzangi, Abdollah |
author_facet | Chowdhury, Shahana Yasmin Shatabda, Swakkhar Dehzangi, Abdollah |
author_sort | Chowdhury, Shahana Yasmin |
collection | PubMed |
description | DNA-binding proteins play a very important role in the structural composition of the DNA. In addition, they regulate and effect various cellular processes like transcription, DNA replication, DNA recombination, repair and modification. The experimental methods used to identify DNA-binding proteins are expensive and time consuming and thus attracted researchers from computational field to address the problem. In this paper, we present iDNAProt-ES, a DNA-binding protein prediction method that utilizes both sequence based evolutionary and structure based features of proteins to identify their DNA-binding functionality. We used recursive feature elimination to extract an optimal set of features and train them using Support Vector Machine (SVM) with linear kernel to select the final model. Our proposed method significantly outperforms the existing state-of-the-art predictors on standard benchmark dataset. The accuracy of the predictor is 90.18% using jack knife test and 88.87% using 10-fold cross validation on the benchmark dataset. The accuracy of the predictor on the independent dataset is 80.64% which is also significantly better than the state-of-the-art methods. iDNAProt-ES is a novel prediction method that uses evolutionary and structural based features. We believe the superior performance of iDNAProt-ES will motivate the researchers to use this method to identify DNA-binding proteins. iDNAProt-ES is publicly available as a web server at: http://brl.uiu.ac.bd/iDNAProt-ES/. |
format | Online Article Text |
id | pubmed-5668250 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-56682502017-11-08 iDNAProt-ES: Identification of DNA-binding Proteins Using Evolutionary and Structural Features Chowdhury, Shahana Yasmin Shatabda, Swakkhar Dehzangi, Abdollah Sci Rep Article DNA-binding proteins play a very important role in the structural composition of the DNA. In addition, they regulate and effect various cellular processes like transcription, DNA replication, DNA recombination, repair and modification. The experimental methods used to identify DNA-binding proteins are expensive and time consuming and thus attracted researchers from computational field to address the problem. In this paper, we present iDNAProt-ES, a DNA-binding protein prediction method that utilizes both sequence based evolutionary and structure based features of proteins to identify their DNA-binding functionality. We used recursive feature elimination to extract an optimal set of features and train them using Support Vector Machine (SVM) with linear kernel to select the final model. Our proposed method significantly outperforms the existing state-of-the-art predictors on standard benchmark dataset. The accuracy of the predictor is 90.18% using jack knife test and 88.87% using 10-fold cross validation on the benchmark dataset. The accuracy of the predictor on the independent dataset is 80.64% which is also significantly better than the state-of-the-art methods. iDNAProt-ES is a novel prediction method that uses evolutionary and structural based features. We believe the superior performance of iDNAProt-ES will motivate the researchers to use this method to identify DNA-binding proteins. iDNAProt-ES is publicly available as a web server at: http://brl.uiu.ac.bd/iDNAProt-ES/. Nature Publishing Group UK 2017-11-02 /pmc/articles/PMC5668250/ /pubmed/29097781 http://dx.doi.org/10.1038/s41598-017-14945-1 Text en © The Author(s) 2017 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Chowdhury, Shahana Yasmin Shatabda, Swakkhar Dehzangi, Abdollah iDNAProt-ES: Identification of DNA-binding Proteins Using Evolutionary and Structural Features |
title | iDNAProt-ES: Identification of DNA-binding Proteins Using Evolutionary and Structural Features |
title_full | iDNAProt-ES: Identification of DNA-binding Proteins Using Evolutionary and Structural Features |
title_fullStr | iDNAProt-ES: Identification of DNA-binding Proteins Using Evolutionary and Structural Features |
title_full_unstemmed | iDNAProt-ES: Identification of DNA-binding Proteins Using Evolutionary and Structural Features |
title_short | iDNAProt-ES: Identification of DNA-binding Proteins Using Evolutionary and Structural Features |
title_sort | idnaprot-es: identification of dna-binding proteins using evolutionary and structural features |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5668250/ https://www.ncbi.nlm.nih.gov/pubmed/29097781 http://dx.doi.org/10.1038/s41598-017-14945-1 |
work_keys_str_mv | AT chowdhuryshahanayasmin idnaprotesidentificationofdnabindingproteinsusingevolutionaryandstructuralfeatures AT shatabdaswakkhar idnaprotesidentificationofdnabindingproteinsusingevolutionaryandstructuralfeatures AT dehzangiabdollah idnaprotesidentificationofdnabindingproteinsusingevolutionaryandstructuralfeatures |