Cargando…

Protein-specific prediction of mRNA binding using RNA sequences, binding motifs and predicted secondary structures

BACKGROUND: RNA-binding proteins interact with specific RNA molecules to regulate important cellular processes. It is therefore necessary to identify the RNA interaction partners in order to understand the precise functions of such proteins. Protein-RNA interactions are typically characterized using...

Descripción completa

Detalles Bibliográficos
Autores principales: Livi, Carmen M, Blanzieri, Enrico
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4098778/
https://www.ncbi.nlm.nih.gov/pubmed/24780077
http://dx.doi.org/10.1186/1471-2105-15-123
_version_ 1782326394278117376
author Livi, Carmen M
Blanzieri, Enrico
author_facet Livi, Carmen M
Blanzieri, Enrico
author_sort Livi, Carmen M
collection PubMed
description BACKGROUND: RNA-binding proteins interact with specific RNA molecules to regulate important cellular processes. It is therefore necessary to identify the RNA interaction partners in order to understand the precise functions of such proteins. Protein-RNA interactions are typically characterized using in vivo and in vitro experiments but these may not detect all binding partners. Therefore, computational methods that capture the protein-dependent nature of such binding interactions could help to predict potential binding partners in silico. RESULTS: We have developed three methods to predict whether an RNA can interact with a particular RNA-binding protein using support vector machines and different features based on the sequence (the Oli method), the motif score (the OliMo method) and the secondary structure (the OliMoSS method). We applied these approaches to different experimentally-derived datasets and compared the predictions with RNAcontext and RPISeq. Oli outperformed OliMoSS and RPISeq, confirming our protein-specific predictions and suggesting that tetranucleotide frequencies are appropriate discriminative features. Oli and RNAcontext were the most competitive methods in terms of the area under curve. A precision-recall curve analysis achieved higher precision values for Oli. On a second experimental dataset including real negative binding information, Oli outperformed RNAcontext with a precision of 0.73 vs. 0.59. CONCLUSIONS: Our experiments showed that features based on primary sequence information are sufficiently discriminating to predict specific RNA-protein interactions. Sequence motifs and secondary structure information were not necessary to improve these predictions. Finally we confirmed that protein-specific experimental data concerning RNA-protein interactions are valuable sources of information that can be used for the efficient training of models for in silico predictions. The scripts are available upon request to the corresponding author.
format Online
Article
Text
id pubmed-4098778
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40987782014-07-18 Protein-specific prediction of mRNA binding using RNA sequences, binding motifs and predicted secondary structures Livi, Carmen M Blanzieri, Enrico BMC Bioinformatics Research Article BACKGROUND: RNA-binding proteins interact with specific RNA molecules to regulate important cellular processes. It is therefore necessary to identify the RNA interaction partners in order to understand the precise functions of such proteins. Protein-RNA interactions are typically characterized using in vivo and in vitro experiments but these may not detect all binding partners. Therefore, computational methods that capture the protein-dependent nature of such binding interactions could help to predict potential binding partners in silico. RESULTS: We have developed three methods to predict whether an RNA can interact with a particular RNA-binding protein using support vector machines and different features based on the sequence (the Oli method), the motif score (the OliMo method) and the secondary structure (the OliMoSS method). We applied these approaches to different experimentally-derived datasets and compared the predictions with RNAcontext and RPISeq. Oli outperformed OliMoSS and RPISeq, confirming our protein-specific predictions and suggesting that tetranucleotide frequencies are appropriate discriminative features. Oli and RNAcontext were the most competitive methods in terms of the area under curve. A precision-recall curve analysis achieved higher precision values for Oli. On a second experimental dataset including real negative binding information, Oli outperformed RNAcontext with a precision of 0.73 vs. 0.59. CONCLUSIONS: Our experiments showed that features based on primary sequence information are sufficiently discriminating to predict specific RNA-protein interactions. Sequence motifs and secondary structure information were not necessary to improve these predictions. Finally we confirmed that protein-specific experimental data concerning RNA-protein interactions are valuable sources of information that can be used for the efficient training of models for in silico predictions. The scripts are available upon request to the corresponding author. BioMed Central 2014-04-29 /pmc/articles/PMC4098778/ /pubmed/24780077 http://dx.doi.org/10.1186/1471-2105-15-123 Text en Copyright © 2014 Livi and Blanzieri; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Livi, Carmen M
Blanzieri, Enrico
Protein-specific prediction of mRNA binding using RNA sequences, binding motifs and predicted secondary structures
title Protein-specific prediction of mRNA binding using RNA sequences, binding motifs and predicted secondary structures
title_full Protein-specific prediction of mRNA binding using RNA sequences, binding motifs and predicted secondary structures
title_fullStr Protein-specific prediction of mRNA binding using RNA sequences, binding motifs and predicted secondary structures
title_full_unstemmed Protein-specific prediction of mRNA binding using RNA sequences, binding motifs and predicted secondary structures
title_short Protein-specific prediction of mRNA binding using RNA sequences, binding motifs and predicted secondary structures
title_sort protein-specific prediction of mrna binding using rna sequences, binding motifs and predicted secondary structures
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4098778/
https://www.ncbi.nlm.nih.gov/pubmed/24780077
http://dx.doi.org/10.1186/1471-2105-15-123
work_keys_str_mv AT livicarmenm proteinspecificpredictionofmrnabindingusingrnasequencesbindingmotifsandpredictedsecondarystructures
AT blanzierienrico proteinspecificpredictionofmrnabindingusingrnasequencesbindingmotifsandpredictedsecondarystructures