Cargando…

TACOS: a novel approach for accurate prediction of cell-specific long noncoding RNAs subcellular localization

Long noncoding RNAs (lncRNAs) are primarily regulated by their cellular localization, which is responsible for their molecular functions, including cell cycle regulation and genome rearrangements. Accurately identifying the subcellular location of lncRNAs from sequence information is crucial for a b...

Descripción completa

Detalles Bibliográficos
Autores principales: Jeon, Young-Jun, Hasan, Md Mehedi, Park, Hyun Woo, Lee, Ki Wook, Manavalan, Balachandran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9294414/
https://www.ncbi.nlm.nih.gov/pubmed/35753698
http://dx.doi.org/10.1093/bib/bbac243
_version_ 1784749848741806080
author Jeon, Young-Jun
Hasan, Md Mehedi
Park, Hyun Woo
Lee, Ki Wook
Manavalan, Balachandran
author_facet Jeon, Young-Jun
Hasan, Md Mehedi
Park, Hyun Woo
Lee, Ki Wook
Manavalan, Balachandran
author_sort Jeon, Young-Jun
collection PubMed
description Long noncoding RNAs (lncRNAs) are primarily regulated by their cellular localization, which is responsible for their molecular functions, including cell cycle regulation and genome rearrangements. Accurately identifying the subcellular location of lncRNAs from sequence information is crucial for a better understanding of their biological functions and mechanisms. In contrast to traditional experimental methods, bioinformatics or computational methods can be applied for the annotation of lncRNA subcellular locations in humans more effectively. In the past, several machine learning-based methods have been developed to identify lncRNA subcellular localization, but relevant work for identifying cell-specific localization of human lncRNA remains limited. In this study, we present the first application of the tree-based stacking approach, TACOS, which allows users to identify the subcellular localization of human lncRNA in 10 different cell types. Specifically, we conducted comprehensive evaluations of six tree-based classifiers with 10 different feature descriptors, using a newly constructed balanced training dataset for each cell type. Subsequently, the strengths of the AdaBoost baseline models were integrated via a stacking approach, with an appropriate tree-based classifier for the final prediction. TACOS displayed consistent performance in both the cross-validation and independent assessments compared with the other two approaches employed in this study. The user-friendly online TACOS web server can be accessed at https://balalab-skku.org/TACOS.
format Online
Article
Text
id pubmed-9294414
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-92944142022-07-20 TACOS: a novel approach for accurate prediction of cell-specific long noncoding RNAs subcellular localization Jeon, Young-Jun Hasan, Md Mehedi Park, Hyun Woo Lee, Ki Wook Manavalan, Balachandran Brief Bioinform Problem Solving Protocol Long noncoding RNAs (lncRNAs) are primarily regulated by their cellular localization, which is responsible for their molecular functions, including cell cycle regulation and genome rearrangements. Accurately identifying the subcellular location of lncRNAs from sequence information is crucial for a better understanding of their biological functions and mechanisms. In contrast to traditional experimental methods, bioinformatics or computational methods can be applied for the annotation of lncRNA subcellular locations in humans more effectively. In the past, several machine learning-based methods have been developed to identify lncRNA subcellular localization, but relevant work for identifying cell-specific localization of human lncRNA remains limited. In this study, we present the first application of the tree-based stacking approach, TACOS, which allows users to identify the subcellular localization of human lncRNA in 10 different cell types. Specifically, we conducted comprehensive evaluations of six tree-based classifiers with 10 different feature descriptors, using a newly constructed balanced training dataset for each cell type. Subsequently, the strengths of the AdaBoost baseline models were integrated via a stacking approach, with an appropriate tree-based classifier for the final prediction. TACOS displayed consistent performance in both the cross-validation and independent assessments compared with the other two approaches employed in this study. The user-friendly online TACOS web server can be accessed at https://balalab-skku.org/TACOS. Oxford University Press 2022-06-27 /pmc/articles/PMC9294414/ /pubmed/35753698 http://dx.doi.org/10.1093/bib/bbac243 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Problem Solving Protocol
Jeon, Young-Jun
Hasan, Md Mehedi
Park, Hyun Woo
Lee, Ki Wook
Manavalan, Balachandran
TACOS: a novel approach for accurate prediction of cell-specific long noncoding RNAs subcellular localization
title TACOS: a novel approach for accurate prediction of cell-specific long noncoding RNAs subcellular localization
title_full TACOS: a novel approach for accurate prediction of cell-specific long noncoding RNAs subcellular localization
title_fullStr TACOS: a novel approach for accurate prediction of cell-specific long noncoding RNAs subcellular localization
title_full_unstemmed TACOS: a novel approach for accurate prediction of cell-specific long noncoding RNAs subcellular localization
title_short TACOS: a novel approach for accurate prediction of cell-specific long noncoding RNAs subcellular localization
title_sort tacos: a novel approach for accurate prediction of cell-specific long noncoding rnas subcellular localization
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9294414/
https://www.ncbi.nlm.nih.gov/pubmed/35753698
http://dx.doi.org/10.1093/bib/bbac243
work_keys_str_mv AT jeonyoungjun tacosanovelapproachforaccuratepredictionofcellspecificlongnoncodingrnassubcellularlocalization
AT hasanmdmehedi tacosanovelapproachforaccuratepredictionofcellspecificlongnoncodingrnassubcellularlocalization
AT parkhyunwoo tacosanovelapproachforaccuratepredictionofcellspecificlongnoncodingrnassubcellularlocalization
AT leekiwook tacosanovelapproachforaccuratepredictionofcellspecificlongnoncodingrnassubcellularlocalization
AT manavalanbalachandran tacosanovelapproachforaccuratepredictionofcellspecificlongnoncodingrnassubcellularlocalization