Cargando…
ATPsite: sequence-based prediction of ATP-binding residues
BACKGROUND: ATP is a ubiquitous nucleotide that provides energy for cellular activities, catalyzes chemical reactions, and is involved in cellular signalling. The knowledge of the ATP-protein interactions helps with annotation of protein functions and finds applications in drug design. The sequence...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3289083/ https://www.ncbi.nlm.nih.gov/pubmed/22165846 http://dx.doi.org/10.1186/1477-5956-9-S1-S4 |
_version_ | 1782224847311470592 |
---|---|
author | Chen, Ke Mizianty, Marcin J Kurgan, Lukasz |
author_facet | Chen, Ke Mizianty, Marcin J Kurgan, Lukasz |
author_sort | Chen, Ke |
collection | PubMed |
description | BACKGROUND: ATP is a ubiquitous nucleotide that provides energy for cellular activities, catalyzes chemical reactions, and is involved in cellular signalling. The knowledge of the ATP-protein interactions helps with annotation of protein functions and finds applications in drug design. The sequence to structure annotation gap motivates development of high-throughput sequence-based predictors of the ATP-binding residues. Moreover, our empirical tests show that the only existing predictor, ATPint, is characterized by relatively low predictive quality. METHODS: We propose a novel, high-throughput machine learning-based predictor, ATPsite, which identifies ATP-binding residues from protein sequences. Our predictor utilizes Support Vector Machine classifier and a comprehensive set of input features that are based on the sequence, evolutionary profiles, and the sequence-predicted structural descriptors including secondary structure, solvent accessibility, and dihedral angles. RESULTS: The ATPsite achieves significantly higher Mathews Correlation Coefficient (MCC) and Area Under the ROC Curve (AUC) values when compared with the existing methods including the ATPint, conservation-based rate4site, and alignment-based BLAST predictors. We also assessed the effectiveness of individual input types. The PSSM profile, the conservation scores, and certain features based on amino acid groups are shown to be more effective in predicting the ATP-binding residues than the remaining feature groups. CONCLUSIONS: Statistical tests show that ATPsite significantly outperforms existing solutions. The consensus of the ATPsite with the sequence-alignment based predictor is shown to give further improvements. |
format | Online Article Text |
id | pubmed-3289083 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32890832012-02-29 ATPsite: sequence-based prediction of ATP-binding residues Chen, Ke Mizianty, Marcin J Kurgan, Lukasz Proteome Sci Proceedings BACKGROUND: ATP is a ubiquitous nucleotide that provides energy for cellular activities, catalyzes chemical reactions, and is involved in cellular signalling. The knowledge of the ATP-protein interactions helps with annotation of protein functions and finds applications in drug design. The sequence to structure annotation gap motivates development of high-throughput sequence-based predictors of the ATP-binding residues. Moreover, our empirical tests show that the only existing predictor, ATPint, is characterized by relatively low predictive quality. METHODS: We propose a novel, high-throughput machine learning-based predictor, ATPsite, which identifies ATP-binding residues from protein sequences. Our predictor utilizes Support Vector Machine classifier and a comprehensive set of input features that are based on the sequence, evolutionary profiles, and the sequence-predicted structural descriptors including secondary structure, solvent accessibility, and dihedral angles. RESULTS: The ATPsite achieves significantly higher Mathews Correlation Coefficient (MCC) and Area Under the ROC Curve (AUC) values when compared with the existing methods including the ATPint, conservation-based rate4site, and alignment-based BLAST predictors. We also assessed the effectiveness of individual input types. The PSSM profile, the conservation scores, and certain features based on amino acid groups are shown to be more effective in predicting the ATP-binding residues than the remaining feature groups. CONCLUSIONS: Statistical tests show that ATPsite significantly outperforms existing solutions. The consensus of the ATPsite with the sequence-alignment based predictor is shown to give further improvements. BioMed Central 2011-10-14 /pmc/articles/PMC3289083/ /pubmed/22165846 http://dx.doi.org/10.1186/1477-5956-9-S1-S4 Text en Copyright ©2011 Chen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Chen, Ke Mizianty, Marcin J Kurgan, Lukasz ATPsite: sequence-based prediction of ATP-binding residues |
title | ATPsite: sequence-based prediction of ATP-binding residues |
title_full | ATPsite: sequence-based prediction of ATP-binding residues |
title_fullStr | ATPsite: sequence-based prediction of ATP-binding residues |
title_full_unstemmed | ATPsite: sequence-based prediction of ATP-binding residues |
title_short | ATPsite: sequence-based prediction of ATP-binding residues |
title_sort | atpsite: sequence-based prediction of atp-binding residues |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3289083/ https://www.ncbi.nlm.nih.gov/pubmed/22165846 http://dx.doi.org/10.1186/1477-5956-9-S1-S4 |
work_keys_str_mv | AT chenke atpsitesequencebasedpredictionofatpbindingresidues AT miziantymarcinj atpsitesequencebasedpredictionofatpbindingresidues AT kurganlukasz atpsitesequencebasedpredictionofatpbindingresidues |