Cargando…

BindN+ for accurate prediction of DNA and RNA-binding residues from protein sequence features

BACKGROUND: Understanding how biomolecules interact is a major task of systems biology. To model protein-nucleic acid interactions, it is important to identify the DNA or RNA-binding residues in proteins. Protein sequence features, including the biochemical property of amino acids and evolutionary i...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Liangjiang, Huang, Caiyan, Yang, Mary Qu, Yang, Jack Y
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2880409/
https://www.ncbi.nlm.nih.gov/pubmed/20522253
http://dx.doi.org/10.1186/1752-0509-4-S1-S3
_version_ 1782182024396668928
author Wang, Liangjiang
Huang, Caiyan
Yang, Mary Qu
Yang, Jack Y
author_facet Wang, Liangjiang
Huang, Caiyan
Yang, Mary Qu
Yang, Jack Y
author_sort Wang, Liangjiang
collection PubMed
description BACKGROUND: Understanding how biomolecules interact is a major task of systems biology. To model protein-nucleic acid interactions, it is important to identify the DNA or RNA-binding residues in proteins. Protein sequence features, including the biochemical property of amino acids and evolutionary information in terms of position-specific scoring matrix (PSSM), have been used for DNA or RNA-binding site prediction. However, PSSM is rather designed for PSI-BLAST searches, and it may not contain all the evolutionary information for modelling DNA or RNA-binding sites in protein sequences. RESULTS: In the present study, several new descriptors of evolutionary information have been developed and evaluated for sequence-based prediction of DNA and RNA-binding residues using support vector machines (SVMs). The new descriptors were shown to improve classifier performance. Interestingly, the best classifiers were obtained by combining the new descriptors and PSSM, suggesting that they captured different aspects of evolutionary information for DNA and RNA-binding site prediction. The SVM classifiers achieved 77.3% sensitivity and 79.3% specificity for prediction of DNA-binding residues, and 71.6% sensitivity and 78.7% specificity for RNA-binding site prediction. CONCLUSIONS: Predictions at this level of accuracy may provide useful information for modelling protein-nucleic acid interactions in systems biology studies. We have thus developed a web-based tool called BindN+ (http://bioinfo.ggc.org/bindn+/) to make the SVM classifiers accessible to the research community.
format Text
id pubmed-2880409
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28804092010-06-04 BindN+ for accurate prediction of DNA and RNA-binding residues from protein sequence features Wang, Liangjiang Huang, Caiyan Yang, Mary Qu Yang, Jack Y BMC Syst Biol Research BACKGROUND: Understanding how biomolecules interact is a major task of systems biology. To model protein-nucleic acid interactions, it is important to identify the DNA or RNA-binding residues in proteins. Protein sequence features, including the biochemical property of amino acids and evolutionary information in terms of position-specific scoring matrix (PSSM), have been used for DNA or RNA-binding site prediction. However, PSSM is rather designed for PSI-BLAST searches, and it may not contain all the evolutionary information for modelling DNA or RNA-binding sites in protein sequences. RESULTS: In the present study, several new descriptors of evolutionary information have been developed and evaluated for sequence-based prediction of DNA and RNA-binding residues using support vector machines (SVMs). The new descriptors were shown to improve classifier performance. Interestingly, the best classifiers were obtained by combining the new descriptors and PSSM, suggesting that they captured different aspects of evolutionary information for DNA and RNA-binding site prediction. The SVM classifiers achieved 77.3% sensitivity and 79.3% specificity for prediction of DNA-binding residues, and 71.6% sensitivity and 78.7% specificity for RNA-binding site prediction. CONCLUSIONS: Predictions at this level of accuracy may provide useful information for modelling protein-nucleic acid interactions in systems biology studies. We have thus developed a web-based tool called BindN+ (http://bioinfo.ggc.org/bindn+/) to make the SVM classifiers accessible to the research community. BioMed Central 2010-05-28 /pmc/articles/PMC2880409/ /pubmed/20522253 http://dx.doi.org/10.1186/1752-0509-4-S1-S3 Text en Copyright ©2010 Wang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Wang, Liangjiang
Huang, Caiyan
Yang, Mary Qu
Yang, Jack Y
BindN+ for accurate prediction of DNA and RNA-binding residues from protein sequence features
title BindN+ for accurate prediction of DNA and RNA-binding residues from protein sequence features
title_full BindN+ for accurate prediction of DNA and RNA-binding residues from protein sequence features
title_fullStr BindN+ for accurate prediction of DNA and RNA-binding residues from protein sequence features
title_full_unstemmed BindN+ for accurate prediction of DNA and RNA-binding residues from protein sequence features
title_short BindN+ for accurate prediction of DNA and RNA-binding residues from protein sequence features
title_sort bindn+ for accurate prediction of dna and rna-binding residues from protein sequence features
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2880409/
https://www.ncbi.nlm.nih.gov/pubmed/20522253
http://dx.doi.org/10.1186/1752-0509-4-S1-S3
work_keys_str_mv AT wangliangjiang bindnforaccuratepredictionofdnaandrnabindingresiduesfromproteinsequencefeatures
AT huangcaiyan bindnforaccuratepredictionofdnaandrnabindingresiduesfromproteinsequencefeatures
AT yangmaryqu bindnforaccuratepredictionofdnaandrnabindingresiduesfromproteinsequencefeatures
AT yangjacky bindnforaccuratepredictionofdnaandrnabindingresiduesfromproteinsequencefeatures