Cargando…

A Large-Scale Assessment of Nucleic Acids Binding Site Prediction Programs

Computational prediction of nucleic acid binding sites in proteins are necessary to disentangle functional mechanisms in most biological processes and to explore the binding mechanisms. Several strategies have been proposed, but the state-of-the-art approaches display a great diversity in i) the def...

Descripción completa

Detalles Bibliográficos
Autores principales: Miao, Zhichao, Westhof, Eric
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4683125/
https://www.ncbi.nlm.nih.gov/pubmed/26681179
http://dx.doi.org/10.1371/journal.pcbi.1004639
_version_ 1782405977890357248
author Miao, Zhichao
Westhof, Eric
author_facet Miao, Zhichao
Westhof, Eric
author_sort Miao, Zhichao
collection PubMed
description Computational prediction of nucleic acid binding sites in proteins are necessary to disentangle functional mechanisms in most biological processes and to explore the binding mechanisms. Several strategies have been proposed, but the state-of-the-art approaches display a great diversity in i) the definition of nucleic acid binding sites; ii) the training and test datasets; iii) the algorithmic methods for the prediction strategies; iv) the performance measures and v) the distribution and availability of the prediction programs. Here we report a large-scale assessment of 19 web servers and 3 stand-alone programs on 41 datasets including more than 5000 proteins derived from 3D structures of protein-nucleic acid complexes. Well-defined binary assessment criteria (specificity, sensitivity, precision, accuracy…) are applied. We found that i) the tools have been greatly improved over the years; ii) some of the approaches suffer from theoretical defects and there is still room for sorting out the essential mechanisms of binding; iii) RNA binding and DNA binding appear to follow similar driving forces and iv) dataset bias may exist in some methods.
format Online
Article
Text
id pubmed-4683125
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-46831252015-12-31 A Large-Scale Assessment of Nucleic Acids Binding Site Prediction Programs Miao, Zhichao Westhof, Eric PLoS Comput Biol Research Article Computational prediction of nucleic acid binding sites in proteins are necessary to disentangle functional mechanisms in most biological processes and to explore the binding mechanisms. Several strategies have been proposed, but the state-of-the-art approaches display a great diversity in i) the definition of nucleic acid binding sites; ii) the training and test datasets; iii) the algorithmic methods for the prediction strategies; iv) the performance measures and v) the distribution and availability of the prediction programs. Here we report a large-scale assessment of 19 web servers and 3 stand-alone programs on 41 datasets including more than 5000 proteins derived from 3D structures of protein-nucleic acid complexes. Well-defined binary assessment criteria (specificity, sensitivity, precision, accuracy…) are applied. We found that i) the tools have been greatly improved over the years; ii) some of the approaches suffer from theoretical defects and there is still room for sorting out the essential mechanisms of binding; iii) RNA binding and DNA binding appear to follow similar driving forces and iv) dataset bias may exist in some methods. Public Library of Science 2015-12-17 /pmc/articles/PMC4683125/ /pubmed/26681179 http://dx.doi.org/10.1371/journal.pcbi.1004639 Text en © 2015 Miao, Westhof http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Miao, Zhichao
Westhof, Eric
A Large-Scale Assessment of Nucleic Acids Binding Site Prediction Programs
title A Large-Scale Assessment of Nucleic Acids Binding Site Prediction Programs
title_full A Large-Scale Assessment of Nucleic Acids Binding Site Prediction Programs
title_fullStr A Large-Scale Assessment of Nucleic Acids Binding Site Prediction Programs
title_full_unstemmed A Large-Scale Assessment of Nucleic Acids Binding Site Prediction Programs
title_short A Large-Scale Assessment of Nucleic Acids Binding Site Prediction Programs
title_sort large-scale assessment of nucleic acids binding site prediction programs
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4683125/
https://www.ncbi.nlm.nih.gov/pubmed/26681179
http://dx.doi.org/10.1371/journal.pcbi.1004639
work_keys_str_mv AT miaozhichao alargescaleassessmentofnucleicacidsbindingsitepredictionprograms
AT westhoferic alargescaleassessmentofnucleicacidsbindingsitepredictionprograms
AT miaozhichao largescaleassessmentofnucleicacidsbindingsitepredictionprograms
AT westhoferic largescaleassessmentofnucleicacidsbindingsitepredictionprograms