Cargando…

A bioinformatic-assisted workflow for genome-wide identification of ncRNAs

With the upcoming of affordable Next-Generation Sequencing technologies, the number of known non-protein coding RNAs increased drastically in recent years. Different types of non-coding RNAs (ncRNAs) emerged as key players in the regulation of gene expression on the RNA–RNA, RNA–DNA as well as RNA–p...

Descripción completa

Detalles Bibliográficos
Autores principales: Schmal, Matthias, Girod, Crystal, Yaver, Debbie, Mach, Robert L, Mach-Aigner, Astrid R
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9376865/
https://www.ncbi.nlm.nih.gov/pubmed/35979446
http://dx.doi.org/10.1093/nargab/lqac059
_version_ 1784768224596852736
author Schmal, Matthias
Girod, Crystal
Yaver, Debbie
Mach, Robert L
Mach-Aigner, Astrid R
author_facet Schmal, Matthias
Girod, Crystal
Yaver, Debbie
Mach, Robert L
Mach-Aigner, Astrid R
author_sort Schmal, Matthias
collection PubMed
description With the upcoming of affordable Next-Generation Sequencing technologies, the number of known non-protein coding RNAs increased drastically in recent years. Different types of non-coding RNAs (ncRNAs) emerged as key players in the regulation of gene expression on the RNA–RNA, RNA–DNA as well as RNA–protein level, ranging from involvement in chromatin remodeling and transcription regulation to post-transcriptional modifications. Prediction of ncRNAs involves the use of several bioinformatics tools and can be a daunting task for researchers. This led to the development of analysis pipelines such as UClncR and lncpipe. However, these pipelines are limited to datasets from human, mouse, zebrafish or fruit fly and are not able to analyze RNA sequencing data from other organisms. In this study, we developed the analysis pipeline Pinc (Pipeline for prediction of ncRNA) as an enhanced tool to predict ncRNAs based on sequencing data by removing transcripts that show protein-coding potential. Additionally, a feature for differential expression analysis of annotated genes as well as for identification of novel ncRNAs is implemented. Pinc uses Nextflow as a framework and is built with robust and well-established analysis tools. This will allow researchers to utilize sequencing data from every organism in order to reliably identify ncRNAs.
format Online
Article
Text
id pubmed-9376865
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-93768652022-08-16 A bioinformatic-assisted workflow for genome-wide identification of ncRNAs Schmal, Matthias Girod, Crystal Yaver, Debbie Mach, Robert L Mach-Aigner, Astrid R NAR Genom Bioinform Methods Article With the upcoming of affordable Next-Generation Sequencing technologies, the number of known non-protein coding RNAs increased drastically in recent years. Different types of non-coding RNAs (ncRNAs) emerged as key players in the regulation of gene expression on the RNA–RNA, RNA–DNA as well as RNA–protein level, ranging from involvement in chromatin remodeling and transcription regulation to post-transcriptional modifications. Prediction of ncRNAs involves the use of several bioinformatics tools and can be a daunting task for researchers. This led to the development of analysis pipelines such as UClncR and lncpipe. However, these pipelines are limited to datasets from human, mouse, zebrafish or fruit fly and are not able to analyze RNA sequencing data from other organisms. In this study, we developed the analysis pipeline Pinc (Pipeline for prediction of ncRNA) as an enhanced tool to predict ncRNAs based on sequencing data by removing transcripts that show protein-coding potential. Additionally, a feature for differential expression analysis of annotated genes as well as for identification of novel ncRNAs is implemented. Pinc uses Nextflow as a framework and is built with robust and well-established analysis tools. This will allow researchers to utilize sequencing data from every organism in order to reliably identify ncRNAs. Oxford University Press 2022-08-15 /pmc/articles/PMC9376865/ /pubmed/35979446 http://dx.doi.org/10.1093/nargab/lqac059 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Article
Schmal, Matthias
Girod, Crystal
Yaver, Debbie
Mach, Robert L
Mach-Aigner, Astrid R
A bioinformatic-assisted workflow for genome-wide identification of ncRNAs
title A bioinformatic-assisted workflow for genome-wide identification of ncRNAs
title_full A bioinformatic-assisted workflow for genome-wide identification of ncRNAs
title_fullStr A bioinformatic-assisted workflow for genome-wide identification of ncRNAs
title_full_unstemmed A bioinformatic-assisted workflow for genome-wide identification of ncRNAs
title_short A bioinformatic-assisted workflow for genome-wide identification of ncRNAs
title_sort bioinformatic-assisted workflow for genome-wide identification of ncrnas
topic Methods Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9376865/
https://www.ncbi.nlm.nih.gov/pubmed/35979446
http://dx.doi.org/10.1093/nargab/lqac059
work_keys_str_mv AT schmalmatthias abioinformaticassistedworkflowforgenomewideidentificationofncrnas
AT girodcrystal abioinformaticassistedworkflowforgenomewideidentificationofncrnas
AT yaverdebbie abioinformaticassistedworkflowforgenomewideidentificationofncrnas
AT machrobertl abioinformaticassistedworkflowforgenomewideidentificationofncrnas
AT machaignerastridr abioinformaticassistedworkflowforgenomewideidentificationofncrnas
AT schmalmatthias bioinformaticassistedworkflowforgenomewideidentificationofncrnas
AT girodcrystal bioinformaticassistedworkflowforgenomewideidentificationofncrnas
AT yaverdebbie bioinformaticassistedworkflowforgenomewideidentificationofncrnas
AT machrobertl bioinformaticassistedworkflowforgenomewideidentificationofncrnas
AT machaignerastridr bioinformaticassistedworkflowforgenomewideidentificationofncrnas