Cargando…

Deep learning tools are top performers in long non-coding RNA prediction

The increasing amount of transcriptomic data has brought to light vast numbers of potential novel RNA transcripts. Accurately distinguishing novel long non-coding RNAs (lncRNAs) from protein-coding messenger RNAs (mRNAs) has challenged bioinformatic tool developers. Most recently, tools implementing...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ammunét, Tea, Wang, Ning, Khan, Sofia, Elo, Laura L
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2022
Materias:	Technology Review
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9123429/ https://www.ncbi.nlm.nih.gov/pubmed/35136929 http://dx.doi.org/10.1093/bfgp/elab045

_version_	1784711553265696768
author	Ammunét, Tea Wang, Ning Khan, Sofia Elo, Laura L
author_facet	Ammunét, Tea Wang, Ning Khan, Sofia Elo, Laura L
author_sort	Ammunét, Tea
collection	PubMed
description	The increasing amount of transcriptomic data has brought to light vast numbers of potential novel RNA transcripts. Accurately distinguishing novel long non-coding RNAs (lncRNAs) from protein-coding messenger RNAs (mRNAs) has challenged bioinformatic tool developers. Most recently, tools implementing deep learning architectures have been developed for this task, with the potential of discovering sequence features and their interactions still not surfaced in current knowledge. We compared the performance of deep learning tools with other predictive tools that are currently used in lncRNA coding potential prediction. A total of 15 tools representing the variety of available methods were investigated. In addition to known annotated transcripts, we also evaluated the use of the tools in actual studies with real-life data. The robustness and scalability of the tools’ performance was tested with varying sized test sets and test sets with different proportions of lncRNAs and mRNAs. In addition, the ease-of-use for each tested tool was scored. Deep learning tools were top performers in most metrics and labelled transcripts similarly with each other in the real-life dataset. However, the proportion of lncRNAs and mRNAs in the test sets affected the performance of all tools. Computational resources were utilized differently between the top-ranking tools, thus the nature of the study may affect the decision of choosing one well-performing tool over another. Nonetheless, the results suggest favouring the novel deep learning tools over other tools currently in broad use.
format	Online Article Text
id	pubmed-9123429
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-91234292022-05-23 Deep learning tools are top performers in long non-coding RNA prediction Ammunét, Tea Wang, Ning Khan, Sofia Elo, Laura L Brief Funct Genomics Technology Review The increasing amount of transcriptomic data has brought to light vast numbers of potential novel RNA transcripts. Accurately distinguishing novel long non-coding RNAs (lncRNAs) from protein-coding messenger RNAs (mRNAs) has challenged bioinformatic tool developers. Most recently, tools implementing deep learning architectures have been developed for this task, with the potential of discovering sequence features and their interactions still not surfaced in current knowledge. We compared the performance of deep learning tools with other predictive tools that are currently used in lncRNA coding potential prediction. A total of 15 tools representing the variety of available methods were investigated. In addition to known annotated transcripts, we also evaluated the use of the tools in actual studies with real-life data. The robustness and scalability of the tools’ performance was tested with varying sized test sets and test sets with different proportions of lncRNAs and mRNAs. In addition, the ease-of-use for each tested tool was scored. Deep learning tools were top performers in most metrics and labelled transcripts similarly with each other in the real-life dataset. However, the proportion of lncRNAs and mRNAs in the test sets affected the performance of all tools. Computational resources were utilized differently between the top-ranking tools, thus the nature of the study may affect the decision of choosing one well-performing tool over another. Nonetheless, the results suggest favouring the novel deep learning tools over other tools currently in broad use. Oxford University Press 2022-02-06 /pmc/articles/PMC9123429/ /pubmed/35136929 http://dx.doi.org/10.1093/bfgp/elab045 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Technology Review Ammunét, Tea Wang, Ning Khan, Sofia Elo, Laura L Deep learning tools are top performers in long non-coding RNA prediction
title	Deep learning tools are top performers in long non-coding RNA prediction
title_full	Deep learning tools are top performers in long non-coding RNA prediction
title_fullStr	Deep learning tools are top performers in long non-coding RNA prediction
title_full_unstemmed	Deep learning tools are top performers in long non-coding RNA prediction
title_short	Deep learning tools are top performers in long non-coding RNA prediction
title_sort	deep learning tools are top performers in long non-coding rna prediction
topic	Technology Review
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9123429/ https://www.ncbi.nlm.nih.gov/pubmed/35136929 http://dx.doi.org/10.1093/bfgp/elab045
work_keys_str_mv	AT ammunettea deeplearningtoolsaretopperformersinlongnoncodingrnaprediction AT wangning deeplearningtoolsaretopperformersinlongnoncodingrnaprediction AT khansofia deeplearningtoolsaretopperformersinlongnoncodingrnaprediction AT elolaural deeplearningtoolsaretopperformersinlongnoncodingrnaprediction

Deep learning tools are top performers in long non-coding RNA prediction

Ejemplares similares