Cargando…

iDHU-Ensem: Identification of dihydrouridine sites through ensemble learning models

BACKGROUND: Dihydrouridine (D) is one of the most significant uridine modifications that have a prominent occurrence in eukaryotes. The folding and conformational flexibility of transfer RNA (tRNA) can be attained through this modification. OBJECTIVE: The modification also triggers lung cancer in hu...

Descripción completa

Detalles Bibliográficos
Autores principales: Suleman, Muhammad Taseer, Alturise, Fahad, Alkhalifah, Tamim, Khan, Yaser Daanial
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10064468/
https://www.ncbi.nlm.nih.gov/pubmed/37009307
http://dx.doi.org/10.1177/20552076231165963
_version_ 1785017902131314688
author Suleman, Muhammad Taseer
Alturise, Fahad
Alkhalifah, Tamim
Khan, Yaser Daanial
author_facet Suleman, Muhammad Taseer
Alturise, Fahad
Alkhalifah, Tamim
Khan, Yaser Daanial
author_sort Suleman, Muhammad Taseer
collection PubMed
description BACKGROUND: Dihydrouridine (D) is one of the most significant uridine modifications that have a prominent occurrence in eukaryotes. The folding and conformational flexibility of transfer RNA (tRNA) can be attained through this modification. OBJECTIVE: The modification also triggers lung cancer in humans. The identification of D sites was carried out through conventional laboratory methods; however, those were costly and time-consuming. The readiness of RNA sequences helps in the identification of D sites through computationally intelligent models. However, the most challenging part is turning these biological sequences into distinct vectors. METHODS: The current research proposed novel feature extraction mechanisms and the identification of D sites in tRNA sequences using ensemble models. The ensemble models were then subjected to evaluation using k-fold cross-validation and independent testing. RESULTS: The results revealed that the stacking ensemble model outperformed all the ensemble models by revealing 0.98 accuracy, 0.98 specificity, 0.97 sensitivity, and 0.92 Matthews Correlation Coefficient. The proposed model, iDHU-Ensem, was also compared with pre-existing predictors using an independent test. The accuracy scores have shown that the proposed model in this research study performed better than the available predictors. CONCLUSION: The current research contributed towards the enhancement of D site identification capabilities through computationally intelligent methods. A web-based server, iDHU-Ensem, was also made available for the researchers at https://taseersuleman-idhu-ensem-idhu-ensem.streamlit.app/.
format Online
Article
Text
id pubmed-10064468
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-100644682023-04-01 iDHU-Ensem: Identification of dihydrouridine sites through ensemble learning models Suleman, Muhammad Taseer Alturise, Fahad Alkhalifah, Tamim Khan, Yaser Daanial Digit Health Original Research BACKGROUND: Dihydrouridine (D) is one of the most significant uridine modifications that have a prominent occurrence in eukaryotes. The folding and conformational flexibility of transfer RNA (tRNA) can be attained through this modification. OBJECTIVE: The modification also triggers lung cancer in humans. The identification of D sites was carried out through conventional laboratory methods; however, those were costly and time-consuming. The readiness of RNA sequences helps in the identification of D sites through computationally intelligent models. However, the most challenging part is turning these biological sequences into distinct vectors. METHODS: The current research proposed novel feature extraction mechanisms and the identification of D sites in tRNA sequences using ensemble models. The ensemble models were then subjected to evaluation using k-fold cross-validation and independent testing. RESULTS: The results revealed that the stacking ensemble model outperformed all the ensemble models by revealing 0.98 accuracy, 0.98 specificity, 0.97 sensitivity, and 0.92 Matthews Correlation Coefficient. The proposed model, iDHU-Ensem, was also compared with pre-existing predictors using an independent test. The accuracy scores have shown that the proposed model in this research study performed better than the available predictors. CONCLUSION: The current research contributed towards the enhancement of D site identification capabilities through computationally intelligent methods. A web-based server, iDHU-Ensem, was also made available for the researchers at https://taseersuleman-idhu-ensem-idhu-ensem.streamlit.app/. SAGE Publications 2023-03-29 /pmc/articles/PMC10064468/ /pubmed/37009307 http://dx.doi.org/10.1177/20552076231165963 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Research
Suleman, Muhammad Taseer
Alturise, Fahad
Alkhalifah, Tamim
Khan, Yaser Daanial
iDHU-Ensem: Identification of dihydrouridine sites through ensemble learning models
title iDHU-Ensem: Identification of dihydrouridine sites through ensemble learning models
title_full iDHU-Ensem: Identification of dihydrouridine sites through ensemble learning models
title_fullStr iDHU-Ensem: Identification of dihydrouridine sites through ensemble learning models
title_full_unstemmed iDHU-Ensem: Identification of dihydrouridine sites through ensemble learning models
title_short iDHU-Ensem: Identification of dihydrouridine sites through ensemble learning models
title_sort idhu-ensem: identification of dihydrouridine sites through ensemble learning models
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10064468/
https://www.ncbi.nlm.nih.gov/pubmed/37009307
http://dx.doi.org/10.1177/20552076231165963
work_keys_str_mv AT sulemanmuhammadtaseer idhuensemidentificationofdihydrouridinesitesthroughensemblelearningmodels
AT alturisefahad idhuensemidentificationofdihydrouridinesitesthroughensemblelearningmodels
AT alkhalifahtamim idhuensemidentificationofdihydrouridinesitesthroughensemblelearningmodels
AT khanyaserdaanial idhuensemidentificationofdihydrouridinesitesthroughensemblelearningmodels