Cargando…

IRSOM, a reliable identifier of ncRNAs based on supervised self-organizing maps with rejection

MOTIVATION: Non-coding RNAs (ncRNAs) play important roles in many biological processes and are involved in many diseases. Their identification is an important task, and many tools exist in the literature for this purpose. However, almost all of them are focused on the discrimination of coding and nc...

Descripción completa

Detalles Bibliográficos
Autores principales: Platon, Ludovic, Zehraoui, Farida, Bendahmane, Abdelhafid, Tahi, Fariza
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6129289/
https://www.ncbi.nlm.nih.gov/pubmed/30423081
http://dx.doi.org/10.1093/bioinformatics/bty572
_version_ 1783353774764457984
author Platon, Ludovic
Zehraoui, Farida
Bendahmane, Abdelhafid
Tahi, Fariza
author_facet Platon, Ludovic
Zehraoui, Farida
Bendahmane, Abdelhafid
Tahi, Fariza
author_sort Platon, Ludovic
collection PubMed
description MOTIVATION: Non-coding RNAs (ncRNAs) play important roles in many biological processes and are involved in many diseases. Their identification is an important task, and many tools exist in the literature for this purpose. However, almost all of them are focused on the discrimination of coding and ncRNAs without giving more biological insight. In this paper, we propose a new reliable method called IRSOM, based on a supervised Self-Organizing Map (SOM) with a rejection option, that overcomes these limitations. The rejection option in IRSOM improves the accuracy of the method and also allows identifing the ambiguous transcripts. Furthermore, with the visualization of the SOM, we analyze the rejected predictions and highlight the ambiguity of the transcripts. RESULTS: IRSOM was tested on datasets of several species from different reigns, and shown better results compared to state-of-art. The accuracy of IRSOM is always greater than 0.95 for all the species with an average specificity of 0.98 and an average sensitivity of 0.99. Besides, IRSOM is fast (it takes around 254 s to analyze a dataset of 147 000 transcripts) and is able to handle very large datasets. AVAILABILITY AND IMPLEMENTATION: IRSOM is implemented in Python and C++. It is available on our software platform EvryRNA (http://EvryRNA.ibisc.univ-evry.fr).
format Online
Article
Text
id pubmed-6129289
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-61292892018-09-12 IRSOM, a reliable identifier of ncRNAs based on supervised self-organizing maps with rejection Platon, Ludovic Zehraoui, Farida Bendahmane, Abdelhafid Tahi, Fariza Bioinformatics Eccb 2018: European Conference on Computational Biology Proceedings MOTIVATION: Non-coding RNAs (ncRNAs) play important roles in many biological processes and are involved in many diseases. Their identification is an important task, and many tools exist in the literature for this purpose. However, almost all of them are focused on the discrimination of coding and ncRNAs without giving more biological insight. In this paper, we propose a new reliable method called IRSOM, based on a supervised Self-Organizing Map (SOM) with a rejection option, that overcomes these limitations. The rejection option in IRSOM improves the accuracy of the method and also allows identifing the ambiguous transcripts. Furthermore, with the visualization of the SOM, we analyze the rejected predictions and highlight the ambiguity of the transcripts. RESULTS: IRSOM was tested on datasets of several species from different reigns, and shown better results compared to state-of-art. The accuracy of IRSOM is always greater than 0.95 for all the species with an average specificity of 0.98 and an average sensitivity of 0.99. Besides, IRSOM is fast (it takes around 254 s to analyze a dataset of 147 000 transcripts) and is able to handle very large datasets. AVAILABILITY AND IMPLEMENTATION: IRSOM is implemented in Python and C++. It is available on our software platform EvryRNA (http://EvryRNA.ibisc.univ-evry.fr). Oxford University Press 2018-09-01 2018-09-08 /pmc/articles/PMC6129289/ /pubmed/30423081 http://dx.doi.org/10.1093/bioinformatics/bty572 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Eccb 2018: European Conference on Computational Biology Proceedings
Platon, Ludovic
Zehraoui, Farida
Bendahmane, Abdelhafid
Tahi, Fariza
IRSOM, a reliable identifier of ncRNAs based on supervised self-organizing maps with rejection
title IRSOM, a reliable identifier of ncRNAs based on supervised self-organizing maps with rejection
title_full IRSOM, a reliable identifier of ncRNAs based on supervised self-organizing maps with rejection
title_fullStr IRSOM, a reliable identifier of ncRNAs based on supervised self-organizing maps with rejection
title_full_unstemmed IRSOM, a reliable identifier of ncRNAs based on supervised self-organizing maps with rejection
title_short IRSOM, a reliable identifier of ncRNAs based on supervised self-organizing maps with rejection
title_sort irsom, a reliable identifier of ncrnas based on supervised self-organizing maps with rejection
topic Eccb 2018: European Conference on Computational Biology Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6129289/
https://www.ncbi.nlm.nih.gov/pubmed/30423081
http://dx.doi.org/10.1093/bioinformatics/bty572
work_keys_str_mv AT platonludovic irsomareliableidentifierofncrnasbasedonsupervisedselforganizingmapswithrejection
AT zehraouifarida irsomareliableidentifierofncrnasbasedonsupervisedselforganizingmapswithrejection
AT bendahmaneabdelhafid irsomareliableidentifierofncrnasbasedonsupervisedselforganizingmapswithrejection
AT tahifariza irsomareliableidentifierofncrnasbasedonsupervisedselforganizingmapswithrejection