Cargando…

Tailored machine learning models for functional RNA detection in genome-wide screens

The in silico prediction of non-coding and protein-coding genetic loci has received considerable attention in comparative genomics aiming in particular at the identification of properties of nucleotide sequences that are informative of their biological role in the cell. We present here a software fr...

Descripción completa

Detalles Bibliográficos
Autores principales: Klapproth, Christopher, Zötzsche, Siegfried, Kühnl, Felix, Fallmann, Jörg, Stadler, Peter F, Findeiß, Sven
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10440787/
https://www.ncbi.nlm.nih.gov/pubmed/37608800
http://dx.doi.org/10.1093/nargab/lqad072
_version_ 1785093228125487104
author Klapproth, Christopher
Zötzsche, Siegfried
Kühnl, Felix
Fallmann, Jörg
Stadler, Peter F
Findeiß, Sven
author_facet Klapproth, Christopher
Zötzsche, Siegfried
Kühnl, Felix
Fallmann, Jörg
Stadler, Peter F
Findeiß, Sven
author_sort Klapproth, Christopher
collection PubMed
description The in silico prediction of non-coding and protein-coding genetic loci has received considerable attention in comparative genomics aiming in particular at the identification of properties of nucleotide sequences that are informative of their biological role in the cell. We present here a software framework for the alignment-based training, evaluation and application of machine learning models with user-defined parameters. Instead of focusing on the one-size-fits-all approach of pervasive in silico annotation pipelines, we offer a framework for the structured generation and evaluation of models based on arbitrary features and input data, focusing on stable and explainable results. Furthermore, we showcase the usage of our software package in a full-genome screen of Drosophila melanogaster and evaluate our results against the well-known but much less flexible program RNAz.
format Online
Article
Text
id pubmed-10440787
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-104407872023-08-22 Tailored machine learning models for functional RNA detection in genome-wide screens Klapproth, Christopher Zötzsche, Siegfried Kühnl, Felix Fallmann, Jörg Stadler, Peter F Findeiß, Sven NAR Genom Bioinform Standard Article The in silico prediction of non-coding and protein-coding genetic loci has received considerable attention in comparative genomics aiming in particular at the identification of properties of nucleotide sequences that are informative of their biological role in the cell. We present here a software framework for the alignment-based training, evaluation and application of machine learning models with user-defined parameters. Instead of focusing on the one-size-fits-all approach of pervasive in silico annotation pipelines, we offer a framework for the structured generation and evaluation of models based on arbitrary features and input data, focusing on stable and explainable results. Furthermore, we showcase the usage of our software package in a full-genome screen of Drosophila melanogaster and evaluate our results against the well-known but much less flexible program RNAz. Oxford University Press 2023-08-21 /pmc/articles/PMC10440787/ /pubmed/37608800 http://dx.doi.org/10.1093/nargab/lqad072 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Standard Article
Klapproth, Christopher
Zötzsche, Siegfried
Kühnl, Felix
Fallmann, Jörg
Stadler, Peter F
Findeiß, Sven
Tailored machine learning models for functional RNA detection in genome-wide screens
title Tailored machine learning models for functional RNA detection in genome-wide screens
title_full Tailored machine learning models for functional RNA detection in genome-wide screens
title_fullStr Tailored machine learning models for functional RNA detection in genome-wide screens
title_full_unstemmed Tailored machine learning models for functional RNA detection in genome-wide screens
title_short Tailored machine learning models for functional RNA detection in genome-wide screens
title_sort tailored machine learning models for functional rna detection in genome-wide screens
topic Standard Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10440787/
https://www.ncbi.nlm.nih.gov/pubmed/37608800
http://dx.doi.org/10.1093/nargab/lqad072
work_keys_str_mv AT klapprothchristopher tailoredmachinelearningmodelsforfunctionalrnadetectioningenomewidescreens
AT zotzschesiegfried tailoredmachinelearningmodelsforfunctionalrnadetectioningenomewidescreens
AT kuhnlfelix tailoredmachinelearningmodelsforfunctionalrnadetectioningenomewidescreens
AT fallmannjorg tailoredmachinelearningmodelsforfunctionalrnadetectioningenomewidescreens
AT stadlerpeterf tailoredmachinelearningmodelsforfunctionalrnadetectioningenomewidescreens
AT findeißsven tailoredmachinelearningmodelsforfunctionalrnadetectioningenomewidescreens