Cargando…
TACOS: a novel approach for accurate prediction of cell-specific long noncoding RNAs subcellular localization
Long noncoding RNAs (lncRNAs) are primarily regulated by their cellular localization, which is responsible for their molecular functions, including cell cycle regulation and genome rearrangements. Accurately identifying the subcellular location of lncRNAs from sequence information is crucial for a b...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9294414/ https://www.ncbi.nlm.nih.gov/pubmed/35753698 http://dx.doi.org/10.1093/bib/bbac243 |
_version_ | 1784749848741806080 |
---|---|
author | Jeon, Young-Jun Hasan, Md Mehedi Park, Hyun Woo Lee, Ki Wook Manavalan, Balachandran |
author_facet | Jeon, Young-Jun Hasan, Md Mehedi Park, Hyun Woo Lee, Ki Wook Manavalan, Balachandran |
author_sort | Jeon, Young-Jun |
collection | PubMed |
description | Long noncoding RNAs (lncRNAs) are primarily regulated by their cellular localization, which is responsible for their molecular functions, including cell cycle regulation and genome rearrangements. Accurately identifying the subcellular location of lncRNAs from sequence information is crucial for a better understanding of their biological functions and mechanisms. In contrast to traditional experimental methods, bioinformatics or computational methods can be applied for the annotation of lncRNA subcellular locations in humans more effectively. In the past, several machine learning-based methods have been developed to identify lncRNA subcellular localization, but relevant work for identifying cell-specific localization of human lncRNA remains limited. In this study, we present the first application of the tree-based stacking approach, TACOS, which allows users to identify the subcellular localization of human lncRNA in 10 different cell types. Specifically, we conducted comprehensive evaluations of six tree-based classifiers with 10 different feature descriptors, using a newly constructed balanced training dataset for each cell type. Subsequently, the strengths of the AdaBoost baseline models were integrated via a stacking approach, with an appropriate tree-based classifier for the final prediction. TACOS displayed consistent performance in both the cross-validation and independent assessments compared with the other two approaches employed in this study. The user-friendly online TACOS web server can be accessed at https://balalab-skku.org/TACOS. |
format | Online Article Text |
id | pubmed-9294414 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-92944142022-07-20 TACOS: a novel approach for accurate prediction of cell-specific long noncoding RNAs subcellular localization Jeon, Young-Jun Hasan, Md Mehedi Park, Hyun Woo Lee, Ki Wook Manavalan, Balachandran Brief Bioinform Problem Solving Protocol Long noncoding RNAs (lncRNAs) are primarily regulated by their cellular localization, which is responsible for their molecular functions, including cell cycle regulation and genome rearrangements. Accurately identifying the subcellular location of lncRNAs from sequence information is crucial for a better understanding of their biological functions and mechanisms. In contrast to traditional experimental methods, bioinformatics or computational methods can be applied for the annotation of lncRNA subcellular locations in humans more effectively. In the past, several machine learning-based methods have been developed to identify lncRNA subcellular localization, but relevant work for identifying cell-specific localization of human lncRNA remains limited. In this study, we present the first application of the tree-based stacking approach, TACOS, which allows users to identify the subcellular localization of human lncRNA in 10 different cell types. Specifically, we conducted comprehensive evaluations of six tree-based classifiers with 10 different feature descriptors, using a newly constructed balanced training dataset for each cell type. Subsequently, the strengths of the AdaBoost baseline models were integrated via a stacking approach, with an appropriate tree-based classifier for the final prediction. TACOS displayed consistent performance in both the cross-validation and independent assessments compared with the other two approaches employed in this study. The user-friendly online TACOS web server can be accessed at https://balalab-skku.org/TACOS. Oxford University Press 2022-06-27 /pmc/articles/PMC9294414/ /pubmed/35753698 http://dx.doi.org/10.1093/bib/bbac243 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Problem Solving Protocol Jeon, Young-Jun Hasan, Md Mehedi Park, Hyun Woo Lee, Ki Wook Manavalan, Balachandran TACOS: a novel approach for accurate prediction of cell-specific long noncoding RNAs subcellular localization |
title | TACOS: a novel approach for accurate prediction of cell-specific long noncoding RNAs subcellular localization |
title_full | TACOS: a novel approach for accurate prediction of cell-specific long noncoding RNAs subcellular localization |
title_fullStr | TACOS: a novel approach for accurate prediction of cell-specific long noncoding RNAs subcellular localization |
title_full_unstemmed | TACOS: a novel approach for accurate prediction of cell-specific long noncoding RNAs subcellular localization |
title_short | TACOS: a novel approach for accurate prediction of cell-specific long noncoding RNAs subcellular localization |
title_sort | tacos: a novel approach for accurate prediction of cell-specific long noncoding rnas subcellular localization |
topic | Problem Solving Protocol |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9294414/ https://www.ncbi.nlm.nih.gov/pubmed/35753698 http://dx.doi.org/10.1093/bib/bbac243 |
work_keys_str_mv | AT jeonyoungjun tacosanovelapproachforaccuratepredictionofcellspecificlongnoncodingrnassubcellularlocalization AT hasanmdmehedi tacosanovelapproachforaccuratepredictionofcellspecificlongnoncodingrnassubcellularlocalization AT parkhyunwoo tacosanovelapproachforaccuratepredictionofcellspecificlongnoncodingrnassubcellularlocalization AT leekiwook tacosanovelapproachforaccuratepredictionofcellspecificlongnoncodingrnassubcellularlocalization AT manavalanbalachandran tacosanovelapproachforaccuratepredictionofcellspecificlongnoncodingrnassubcellularlocalization |