Cargando…

An accurate and interpretable model for siRNA efficacy prediction

BACKGROUND: The use of exogenous small interfering RNAs (siRNAs) for gene silencing has quickly become a widespread molecular tool providing a powerful means for gene functional study and new drug target identification. Although considerable progress has been made recently in understanding how the R...

Descripción completa

Detalles Bibliográficos
Autores principales: Vert, Jean-Philippe, Foveau, Nicolas, Lajaunie, Christian, Vandenbrouck, Yves
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1698581/
https://www.ncbi.nlm.nih.gov/pubmed/17137497
http://dx.doi.org/10.1186/1471-2105-7-520
_version_ 1782131238107086848
author Vert, Jean-Philippe
Foveau, Nicolas
Lajaunie, Christian
Vandenbrouck, Yves
author_facet Vert, Jean-Philippe
Foveau, Nicolas
Lajaunie, Christian
Vandenbrouck, Yves
author_sort Vert, Jean-Philippe
collection PubMed
description BACKGROUND: The use of exogenous small interfering RNAs (siRNAs) for gene silencing has quickly become a widespread molecular tool providing a powerful means for gene functional study and new drug target identification. Although considerable progress has been made recently in understanding how the RNAi pathway mediates gene silencing, the design of potent siRNAs remains challenging. RESULTS: We propose a simple linear model combining basic features of siRNA sequences for siRNA efficacy prediction. Trained and tested on a large dataset of siRNA sequences made recently available, it performs as well as more complex state-of-the-art models in terms of potency prediction accuracy, with the advantage of being directly interpretable. The analysis of this linear model allows us to detect and quantify the effect of nucleotide preferences at particular positions, including previously known and new observations. We also detect and quantify a strong propensity of potent siRNAs to contain short asymmetric motifs in their sequence, and show that, surprisingly, these motifs alone contain at least as much relevant information for potency prediction as the nucleotide preferences for particular positions. CONCLUSION: The model proposed for prediction of siRNA potency is as accurate as a state-of-the-art nonlinear model and is easily interpretable in terms of biological features. It is freely available on the web at
format Text
id pubmed-1698581
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-16985812006-12-19 An accurate and interpretable model for siRNA efficacy prediction Vert, Jean-Philippe Foveau, Nicolas Lajaunie, Christian Vandenbrouck, Yves BMC Bioinformatics Methodology Article BACKGROUND: The use of exogenous small interfering RNAs (siRNAs) for gene silencing has quickly become a widespread molecular tool providing a powerful means for gene functional study and new drug target identification. Although considerable progress has been made recently in understanding how the RNAi pathway mediates gene silencing, the design of potent siRNAs remains challenging. RESULTS: We propose a simple linear model combining basic features of siRNA sequences for siRNA efficacy prediction. Trained and tested on a large dataset of siRNA sequences made recently available, it performs as well as more complex state-of-the-art models in terms of potency prediction accuracy, with the advantage of being directly interpretable. The analysis of this linear model allows us to detect and quantify the effect of nucleotide preferences at particular positions, including previously known and new observations. We also detect and quantify a strong propensity of potent siRNAs to contain short asymmetric motifs in their sequence, and show that, surprisingly, these motifs alone contain at least as much relevant information for potency prediction as the nucleotide preferences for particular positions. CONCLUSION: The model proposed for prediction of siRNA potency is as accurate as a state-of-the-art nonlinear model and is easily interpretable in terms of biological features. It is freely available on the web at BioMed Central 2006-11-30 /pmc/articles/PMC1698581/ /pubmed/17137497 http://dx.doi.org/10.1186/1471-2105-7-520 Text en Copyright © 2006 Vert et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Vert, Jean-Philippe
Foveau, Nicolas
Lajaunie, Christian
Vandenbrouck, Yves
An accurate and interpretable model for siRNA efficacy prediction
title An accurate and interpretable model for siRNA efficacy prediction
title_full An accurate and interpretable model for siRNA efficacy prediction
title_fullStr An accurate and interpretable model for siRNA efficacy prediction
title_full_unstemmed An accurate and interpretable model for siRNA efficacy prediction
title_short An accurate and interpretable model for siRNA efficacy prediction
title_sort accurate and interpretable model for sirna efficacy prediction
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1698581/
https://www.ncbi.nlm.nih.gov/pubmed/17137497
http://dx.doi.org/10.1186/1471-2105-7-520
work_keys_str_mv AT vertjeanphilippe anaccurateandinterpretablemodelforsirnaefficacyprediction
AT foveaunicolas anaccurateandinterpretablemodelforsirnaefficacyprediction
AT lajauniechristian anaccurateandinterpretablemodelforsirnaefficacyprediction
AT vandenbrouckyves anaccurateandinterpretablemodelforsirnaefficacyprediction
AT vertjeanphilippe accurateandinterpretablemodelforsirnaefficacyprediction
AT foveaunicolas accurateandinterpretablemodelforsirnaefficacyprediction
AT lajauniechristian accurateandinterpretablemodelforsirnaefficacyprediction
AT vandenbrouckyves accurateandinterpretablemodelforsirnaefficacyprediction