Cargando…

Computational Analysis Predicts Hundreds of Coding lncRNAs in Zebrafish

SIMPLE SUMMARY: Noncoding RNAs (ncRNAs) regulate a variety of fundamental life processes such as development, physiology, metabolism and circadian rhythmicity. RNA-sequencing (RNA-seq) technology has facilitated the sequencing of the whole transcriptome, thereby capturing and quantifying the dynamis...

Descripción completa

Detalles Bibliográficos
Autores principales: Mishra, Shital Kumar, Wang, Han
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8145020/
https://www.ncbi.nlm.nih.gov/pubmed/33925925
http://dx.doi.org/10.3390/biology10050371
_version_ 1783697082717044736
author Mishra, Shital Kumar
Wang, Han
author_facet Mishra, Shital Kumar
Wang, Han
author_sort Mishra, Shital Kumar
collection PubMed
description SIMPLE SUMMARY: Noncoding RNAs (ncRNAs) regulate a variety of fundamental life processes such as development, physiology, metabolism and circadian rhythmicity. RNA-sequencing (RNA-seq) technology has facilitated the sequencing of the whole transcriptome, thereby capturing and quantifying the dynamism of transcriptome-wide RNA expression profiles. However, much remains unrevealed in the huge noncoding RNA datasets that require further bioinformatic analysis. In this study, we applied six bioinformatic tools to investigate coding potentials of approximately 21,000 lncRNAs. A total of 313 lncRNAs are predicted to be coded by all the six tools. Our findings provide insights into the regulatory roles of lncRNAs and set the stage for the functional investigation of these lncRNAs and their encoded micropeptides. ABSTRACT: Recent studies have demonstrated that numerous long noncoding RNAs (ncRNAs having more than 200 nucleotide base pairs (lncRNAs)) actually encode functional micropeptides, which likely represents the next regulatory biology frontier. Thus, identification of coding lncRNAs from ever-increasing lncRNA databases would be a bioinformatic challenge. Here we employed the Coding Potential Alignment Tool (CPAT), Coding Potential Calculator 2 (CPC2), LGC web server, Coding-Non-Coding Identifying Tool (CNIT), RNAsamba, and MicroPeptide identification tool (MiPepid) to analyze approximately 21,000 zebrafish lncRNAs and computationally to identify 2730–6676 zebrafish lncRNAs with high coding potentials, including 313 coding lncRNAs predicted by all the six bioinformatic tools. We also compared the sensitivity and specificity of these six bioinformatic tools for identifying lncRNAs with coding potentials and summarized their strengths and weaknesses. These predicted zebrafish coding lncRNAs set the stage for further experimental studies.
format Online
Article
Text
id pubmed-8145020
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-81450202021-05-26 Computational Analysis Predicts Hundreds of Coding lncRNAs in Zebrafish Mishra, Shital Kumar Wang, Han Biology (Basel) Communication SIMPLE SUMMARY: Noncoding RNAs (ncRNAs) regulate a variety of fundamental life processes such as development, physiology, metabolism and circadian rhythmicity. RNA-sequencing (RNA-seq) technology has facilitated the sequencing of the whole transcriptome, thereby capturing and quantifying the dynamism of transcriptome-wide RNA expression profiles. However, much remains unrevealed in the huge noncoding RNA datasets that require further bioinformatic analysis. In this study, we applied six bioinformatic tools to investigate coding potentials of approximately 21,000 lncRNAs. A total of 313 lncRNAs are predicted to be coded by all the six tools. Our findings provide insights into the regulatory roles of lncRNAs and set the stage for the functional investigation of these lncRNAs and their encoded micropeptides. ABSTRACT: Recent studies have demonstrated that numerous long noncoding RNAs (ncRNAs having more than 200 nucleotide base pairs (lncRNAs)) actually encode functional micropeptides, which likely represents the next regulatory biology frontier. Thus, identification of coding lncRNAs from ever-increasing lncRNA databases would be a bioinformatic challenge. Here we employed the Coding Potential Alignment Tool (CPAT), Coding Potential Calculator 2 (CPC2), LGC web server, Coding-Non-Coding Identifying Tool (CNIT), RNAsamba, and MicroPeptide identification tool (MiPepid) to analyze approximately 21,000 zebrafish lncRNAs and computationally to identify 2730–6676 zebrafish lncRNAs with high coding potentials, including 313 coding lncRNAs predicted by all the six bioinformatic tools. We also compared the sensitivity and specificity of these six bioinformatic tools for identifying lncRNAs with coding potentials and summarized their strengths and weaknesses. These predicted zebrafish coding lncRNAs set the stage for further experimental studies. MDPI 2021-04-26 /pmc/articles/PMC8145020/ /pubmed/33925925 http://dx.doi.org/10.3390/biology10050371 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Communication
Mishra, Shital Kumar
Wang, Han
Computational Analysis Predicts Hundreds of Coding lncRNAs in Zebrafish
title Computational Analysis Predicts Hundreds of Coding lncRNAs in Zebrafish
title_full Computational Analysis Predicts Hundreds of Coding lncRNAs in Zebrafish
title_fullStr Computational Analysis Predicts Hundreds of Coding lncRNAs in Zebrafish
title_full_unstemmed Computational Analysis Predicts Hundreds of Coding lncRNAs in Zebrafish
title_short Computational Analysis Predicts Hundreds of Coding lncRNAs in Zebrafish
title_sort computational analysis predicts hundreds of coding lncrnas in zebrafish
topic Communication
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8145020/
https://www.ncbi.nlm.nih.gov/pubmed/33925925
http://dx.doi.org/10.3390/biology10050371
work_keys_str_mv AT mishrashitalkumar computationalanalysispredictshundredsofcodinglncrnasinzebrafish
AT wanghan computationalanalysispredictshundredsofcodinglncrnasinzebrafish