Cargando…
iDHU-Ensem: Identification of dihydrouridine sites through ensemble learning models
BACKGROUND: Dihydrouridine (D) is one of the most significant uridine modifications that have a prominent occurrence in eukaryotes. The folding and conformational flexibility of transfer RNA (tRNA) can be attained through this modification. OBJECTIVE: The modification also triggers lung cancer in hu...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
SAGE Publications
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10064468/ https://www.ncbi.nlm.nih.gov/pubmed/37009307 http://dx.doi.org/10.1177/20552076231165963 |
_version_ | 1785017902131314688 |
---|---|
author | Suleman, Muhammad Taseer Alturise, Fahad Alkhalifah, Tamim Khan, Yaser Daanial |
author_facet | Suleman, Muhammad Taseer Alturise, Fahad Alkhalifah, Tamim Khan, Yaser Daanial |
author_sort | Suleman, Muhammad Taseer |
collection | PubMed |
description | BACKGROUND: Dihydrouridine (D) is one of the most significant uridine modifications that have a prominent occurrence in eukaryotes. The folding and conformational flexibility of transfer RNA (tRNA) can be attained through this modification. OBJECTIVE: The modification also triggers lung cancer in humans. The identification of D sites was carried out through conventional laboratory methods; however, those were costly and time-consuming. The readiness of RNA sequences helps in the identification of D sites through computationally intelligent models. However, the most challenging part is turning these biological sequences into distinct vectors. METHODS: The current research proposed novel feature extraction mechanisms and the identification of D sites in tRNA sequences using ensemble models. The ensemble models were then subjected to evaluation using k-fold cross-validation and independent testing. RESULTS: The results revealed that the stacking ensemble model outperformed all the ensemble models by revealing 0.98 accuracy, 0.98 specificity, 0.97 sensitivity, and 0.92 Matthews Correlation Coefficient. The proposed model, iDHU-Ensem, was also compared with pre-existing predictors using an independent test. The accuracy scores have shown that the proposed model in this research study performed better than the available predictors. CONCLUSION: The current research contributed towards the enhancement of D site identification capabilities through computationally intelligent methods. A web-based server, iDHU-Ensem, was also made available for the researchers at https://taseersuleman-idhu-ensem-idhu-ensem.streamlit.app/. |
format | Online Article Text |
id | pubmed-10064468 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | SAGE Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-100644682023-04-01 iDHU-Ensem: Identification of dihydrouridine sites through ensemble learning models Suleman, Muhammad Taseer Alturise, Fahad Alkhalifah, Tamim Khan, Yaser Daanial Digit Health Original Research BACKGROUND: Dihydrouridine (D) is one of the most significant uridine modifications that have a prominent occurrence in eukaryotes. The folding and conformational flexibility of transfer RNA (tRNA) can be attained through this modification. OBJECTIVE: The modification also triggers lung cancer in humans. The identification of D sites was carried out through conventional laboratory methods; however, those were costly and time-consuming. The readiness of RNA sequences helps in the identification of D sites through computationally intelligent models. However, the most challenging part is turning these biological sequences into distinct vectors. METHODS: The current research proposed novel feature extraction mechanisms and the identification of D sites in tRNA sequences using ensemble models. The ensemble models were then subjected to evaluation using k-fold cross-validation and independent testing. RESULTS: The results revealed that the stacking ensemble model outperformed all the ensemble models by revealing 0.98 accuracy, 0.98 specificity, 0.97 sensitivity, and 0.92 Matthews Correlation Coefficient. The proposed model, iDHU-Ensem, was also compared with pre-existing predictors using an independent test. The accuracy scores have shown that the proposed model in this research study performed better than the available predictors. CONCLUSION: The current research contributed towards the enhancement of D site identification capabilities through computationally intelligent methods. A web-based server, iDHU-Ensem, was also made available for the researchers at https://taseersuleman-idhu-ensem-idhu-ensem.streamlit.app/. SAGE Publications 2023-03-29 /pmc/articles/PMC10064468/ /pubmed/37009307 http://dx.doi.org/10.1177/20552076231165963 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage). |
spellingShingle | Original Research Suleman, Muhammad Taseer Alturise, Fahad Alkhalifah, Tamim Khan, Yaser Daanial iDHU-Ensem: Identification of dihydrouridine sites through ensemble learning models |
title | iDHU-Ensem: Identification of dihydrouridine sites through ensemble learning models |
title_full | iDHU-Ensem: Identification of dihydrouridine sites through ensemble learning models |
title_fullStr | iDHU-Ensem: Identification of dihydrouridine sites through ensemble learning models |
title_full_unstemmed | iDHU-Ensem: Identification of dihydrouridine sites through ensemble learning models |
title_short | iDHU-Ensem: Identification of dihydrouridine sites through ensemble learning models |
title_sort | idhu-ensem: identification of dihydrouridine sites through ensemble learning models |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10064468/ https://www.ncbi.nlm.nih.gov/pubmed/37009307 http://dx.doi.org/10.1177/20552076231165963 |
work_keys_str_mv | AT sulemanmuhammadtaseer idhuensemidentificationofdihydrouridinesitesthroughensemblelearningmodels AT alturisefahad idhuensemidentificationofdihydrouridinesitesthroughensemblelearningmodels AT alkhalifahtamim idhuensemidentificationofdihydrouridinesitesthroughensemblelearningmodels AT khanyaserdaanial idhuensemidentificationofdihydrouridinesitesthroughensemblelearningmodels |