Cargando…
Protein-specific prediction of mRNA binding using RNA sequences, binding motifs and predicted secondary structures
BACKGROUND: RNA-binding proteins interact with specific RNA molecules to regulate important cellular processes. It is therefore necessary to identify the RNA interaction partners in order to understand the precise functions of such proteins. Protein-RNA interactions are typically characterized using...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4098778/ https://www.ncbi.nlm.nih.gov/pubmed/24780077 http://dx.doi.org/10.1186/1471-2105-15-123 |
_version_ | 1782326394278117376 |
---|---|
author | Livi, Carmen M Blanzieri, Enrico |
author_facet | Livi, Carmen M Blanzieri, Enrico |
author_sort | Livi, Carmen M |
collection | PubMed |
description | BACKGROUND: RNA-binding proteins interact with specific RNA molecules to regulate important cellular processes. It is therefore necessary to identify the RNA interaction partners in order to understand the precise functions of such proteins. Protein-RNA interactions are typically characterized using in vivo and in vitro experiments but these may not detect all binding partners. Therefore, computational methods that capture the protein-dependent nature of such binding interactions could help to predict potential binding partners in silico. RESULTS: We have developed three methods to predict whether an RNA can interact with a particular RNA-binding protein using support vector machines and different features based on the sequence (the Oli method), the motif score (the OliMo method) and the secondary structure (the OliMoSS method). We applied these approaches to different experimentally-derived datasets and compared the predictions with RNAcontext and RPISeq. Oli outperformed OliMoSS and RPISeq, confirming our protein-specific predictions and suggesting that tetranucleotide frequencies are appropriate discriminative features. Oli and RNAcontext were the most competitive methods in terms of the area under curve. A precision-recall curve analysis achieved higher precision values for Oli. On a second experimental dataset including real negative binding information, Oli outperformed RNAcontext with a precision of 0.73 vs. 0.59. CONCLUSIONS: Our experiments showed that features based on primary sequence information are sufficiently discriminating to predict specific RNA-protein interactions. Sequence motifs and secondary structure information were not necessary to improve these predictions. Finally we confirmed that protein-specific experimental data concerning RNA-protein interactions are valuable sources of information that can be used for the efficient training of models for in silico predictions. The scripts are available upon request to the corresponding author. |
format | Online Article Text |
id | pubmed-4098778 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-40987782014-07-18 Protein-specific prediction of mRNA binding using RNA sequences, binding motifs and predicted secondary structures Livi, Carmen M Blanzieri, Enrico BMC Bioinformatics Research Article BACKGROUND: RNA-binding proteins interact with specific RNA molecules to regulate important cellular processes. It is therefore necessary to identify the RNA interaction partners in order to understand the precise functions of such proteins. Protein-RNA interactions are typically characterized using in vivo and in vitro experiments but these may not detect all binding partners. Therefore, computational methods that capture the protein-dependent nature of such binding interactions could help to predict potential binding partners in silico. RESULTS: We have developed three methods to predict whether an RNA can interact with a particular RNA-binding protein using support vector machines and different features based on the sequence (the Oli method), the motif score (the OliMo method) and the secondary structure (the OliMoSS method). We applied these approaches to different experimentally-derived datasets and compared the predictions with RNAcontext and RPISeq. Oli outperformed OliMoSS and RPISeq, confirming our protein-specific predictions and suggesting that tetranucleotide frequencies are appropriate discriminative features. Oli and RNAcontext were the most competitive methods in terms of the area under curve. A precision-recall curve analysis achieved higher precision values for Oli. On a second experimental dataset including real negative binding information, Oli outperformed RNAcontext with a precision of 0.73 vs. 0.59. CONCLUSIONS: Our experiments showed that features based on primary sequence information are sufficiently discriminating to predict specific RNA-protein interactions. Sequence motifs and secondary structure information were not necessary to improve these predictions. Finally we confirmed that protein-specific experimental data concerning RNA-protein interactions are valuable sources of information that can be used for the efficient training of models for in silico predictions. The scripts are available upon request to the corresponding author. BioMed Central 2014-04-29 /pmc/articles/PMC4098778/ /pubmed/24780077 http://dx.doi.org/10.1186/1471-2105-15-123 Text en Copyright © 2014 Livi and Blanzieri; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Livi, Carmen M Blanzieri, Enrico Protein-specific prediction of mRNA binding using RNA sequences, binding motifs and predicted secondary structures |
title | Protein-specific prediction of mRNA binding using RNA sequences, binding motifs and predicted secondary structures |
title_full | Protein-specific prediction of mRNA binding using RNA sequences, binding motifs and predicted secondary structures |
title_fullStr | Protein-specific prediction of mRNA binding using RNA sequences, binding motifs and predicted secondary structures |
title_full_unstemmed | Protein-specific prediction of mRNA binding using RNA sequences, binding motifs and predicted secondary structures |
title_short | Protein-specific prediction of mRNA binding using RNA sequences, binding motifs and predicted secondary structures |
title_sort | protein-specific prediction of mrna binding using rna sequences, binding motifs and predicted secondary structures |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4098778/ https://www.ncbi.nlm.nih.gov/pubmed/24780077 http://dx.doi.org/10.1186/1471-2105-15-123 |
work_keys_str_mv | AT livicarmenm proteinspecificpredictionofmrnabindingusingrnasequencesbindingmotifsandpredictedsecondarystructures AT blanzierienrico proteinspecificpredictionofmrnabindingusingrnasequencesbindingmotifsandpredictedsecondarystructures |