Cargando…

ORI-Explorer: a unified cell-specific tool for origin of replication sites prediction by feature fusion

MOTIVATION: The origins of replication sites (ORIs) are precise regions inside the DNA sequence where the replication process begins. These locations are critical for preserving the genome’s integrity during cell division and guaranteeing the faithful transfer of genetic data from generation to gene...

Descripción completa

Detalles Bibliográficos
Autores principales: Abbas, Zeeshan, Rehman, Mobeen Ur, Tayara, Hilal, Chong, Kil To
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10639035/
https://www.ncbi.nlm.nih.gov/pubmed/37929975
http://dx.doi.org/10.1093/bioinformatics/btad664
_version_ 1785133711459614720
author Abbas, Zeeshan
Rehman, Mobeen Ur
Tayara, Hilal
Chong, Kil To
author_facet Abbas, Zeeshan
Rehman, Mobeen Ur
Tayara, Hilal
Chong, Kil To
author_sort Abbas, Zeeshan
collection PubMed
description MOTIVATION: The origins of replication sites (ORIs) are precise regions inside the DNA sequence where the replication process begins. These locations are critical for preserving the genome’s integrity during cell division and guaranteeing the faithful transfer of genetic data from generation to generation. The advent of experimental techniques has aided in the discovery of ORIs in many species. Experimentation, on the other hand, is often more time-consuming and pricey than computational approaches, and it necessitates specific equipment and knowledge. Recently, ORI sites have been predicted using computational techniques like motif-based searches and artificial intelligence algorithms based on sequence characteristics and chromatin states. RESULTS: In this article, we developed ORI-Explorer, a unique artificial intelligence-based technique that combines multiple feature engineering techniques to train CatBoost Classifier for recognizing ORIs from four distinct eukaryotic species. ORI-Explorer was created by utilizing a unique combination of three traditional feature-encoding techniques and a feature set obtained from a deep-learning neural network model. The ORI-Explorer has significantly outperformed current predictors on the testing dataset. Furthermore, by employing the sophisticated SHapley Additive exPlanation method, we give crucial insights that aid in comprehending model success, highlighting the most relevant features vital for forecasting cell-specific ORIs. ORI-Explorer is also intended to aid community-wide attempts in discovering potential ORIs and developing innovative verifiable biological hypotheses. AVAILABILITY AND IMPLEMENTATION: The used datasets along with the source code are made available through https://github.com/Z-Abbas/ORI-Explorer and https://zenodo.org/record/8358679.
format Online
Article
Text
id pubmed-10639035
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-106390352023-11-11 ORI-Explorer: a unified cell-specific tool for origin of replication sites prediction by feature fusion Abbas, Zeeshan Rehman, Mobeen Ur Tayara, Hilal Chong, Kil To Bioinformatics Original Paper MOTIVATION: The origins of replication sites (ORIs) are precise regions inside the DNA sequence where the replication process begins. These locations are critical for preserving the genome’s integrity during cell division and guaranteeing the faithful transfer of genetic data from generation to generation. The advent of experimental techniques has aided in the discovery of ORIs in many species. Experimentation, on the other hand, is often more time-consuming and pricey than computational approaches, and it necessitates specific equipment and knowledge. Recently, ORI sites have been predicted using computational techniques like motif-based searches and artificial intelligence algorithms based on sequence characteristics and chromatin states. RESULTS: In this article, we developed ORI-Explorer, a unique artificial intelligence-based technique that combines multiple feature engineering techniques to train CatBoost Classifier for recognizing ORIs from four distinct eukaryotic species. ORI-Explorer was created by utilizing a unique combination of three traditional feature-encoding techniques and a feature set obtained from a deep-learning neural network model. The ORI-Explorer has significantly outperformed current predictors on the testing dataset. Furthermore, by employing the sophisticated SHapley Additive exPlanation method, we give crucial insights that aid in comprehending model success, highlighting the most relevant features vital for forecasting cell-specific ORIs. ORI-Explorer is also intended to aid community-wide attempts in discovering potential ORIs and developing innovative verifiable biological hypotheses. AVAILABILITY AND IMPLEMENTATION: The used datasets along with the source code are made available through https://github.com/Z-Abbas/ORI-Explorer and https://zenodo.org/record/8358679. Oxford University Press 2023-10-31 /pmc/articles/PMC10639035/ /pubmed/37929975 http://dx.doi.org/10.1093/bioinformatics/btad664 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Abbas, Zeeshan
Rehman, Mobeen Ur
Tayara, Hilal
Chong, Kil To
ORI-Explorer: a unified cell-specific tool for origin of replication sites prediction by feature fusion
title ORI-Explorer: a unified cell-specific tool for origin of replication sites prediction by feature fusion
title_full ORI-Explorer: a unified cell-specific tool for origin of replication sites prediction by feature fusion
title_fullStr ORI-Explorer: a unified cell-specific tool for origin of replication sites prediction by feature fusion
title_full_unstemmed ORI-Explorer: a unified cell-specific tool for origin of replication sites prediction by feature fusion
title_short ORI-Explorer: a unified cell-specific tool for origin of replication sites prediction by feature fusion
title_sort ori-explorer: a unified cell-specific tool for origin of replication sites prediction by feature fusion
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10639035/
https://www.ncbi.nlm.nih.gov/pubmed/37929975
http://dx.doi.org/10.1093/bioinformatics/btad664
work_keys_str_mv AT abbaszeeshan oriexploreraunifiedcellspecifictoolfororiginofreplicationsitespredictionbyfeaturefusion
AT rehmanmobeenur oriexploreraunifiedcellspecifictoolfororiginofreplicationsitespredictionbyfeaturefusion
AT tayarahilal oriexploreraunifiedcellspecifictoolfororiginofreplicationsitespredictionbyfeaturefusion
AT chongkilto oriexploreraunifiedcellspecifictoolfororiginofreplicationsitespredictionbyfeaturefusion