Cargando…

Unveiling human origins of replication using deep learning: accurate prediction and comprehensive analysis

Accurate identification of replication origins (ORIs) is crucial for a comprehensive investigation into the progression of human cell growth and cancer therapy. Here, we proposed a computational approach Ori-FinderH, which can efficiently and precisely predict the human ORIs of various lengths by co...

Descripción completa

Detalles Bibliográficos
Autores principales: Yin, Zhen-Ning, Lai, Fei-Liao, Gao, Feng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10676776/
https://www.ncbi.nlm.nih.gov/pubmed/38008420
http://dx.doi.org/10.1093/bib/bbad432
_version_ 1785149976637079552
author Yin, Zhen-Ning
Lai, Fei-Liao
Gao, Feng
author_facet Yin, Zhen-Ning
Lai, Fei-Liao
Gao, Feng
author_sort Yin, Zhen-Ning
collection PubMed
description Accurate identification of replication origins (ORIs) is crucial for a comprehensive investigation into the progression of human cell growth and cancer therapy. Here, we proposed a computational approach Ori-FinderH, which can efficiently and precisely predict the human ORIs of various lengths by combining the Z-curve method with deep learning approach. Compared with existing methods, Ori-FinderH exhibits superior performance, achieving an area under the receiver operating characteristic curve (AUC) of 0.9616 for K562 cell line in 10-fold cross-validation. In addition, we also established a cross-cell-line predictive model, which yielded a further improved AUC of 0.9706. The model was subsequently employed as a fitness function to support genetic algorithm for generating artificial ORIs. Sequence analysis through iORI-Euk revealed that a vast majority of the created sequences, specifically 98% or more, incorporate at least one ORI for three cell lines (Hela, MCF7 and K562). This innovative approach could provide more efficient, accurate and comprehensive information for experimental investigation, thereby further advancing the development of this field.
format Online
Article
Text
id pubmed-10676776
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-106767762023-11-25 Unveiling human origins of replication using deep learning: accurate prediction and comprehensive analysis Yin, Zhen-Ning Lai, Fei-Liao Gao, Feng Brief Bioinform Problem Solving Protocol Accurate identification of replication origins (ORIs) is crucial for a comprehensive investigation into the progression of human cell growth and cancer therapy. Here, we proposed a computational approach Ori-FinderH, which can efficiently and precisely predict the human ORIs of various lengths by combining the Z-curve method with deep learning approach. Compared with existing methods, Ori-FinderH exhibits superior performance, achieving an area under the receiver operating characteristic curve (AUC) of 0.9616 for K562 cell line in 10-fold cross-validation. In addition, we also established a cross-cell-line predictive model, which yielded a further improved AUC of 0.9706. The model was subsequently employed as a fitness function to support genetic algorithm for generating artificial ORIs. Sequence analysis through iORI-Euk revealed that a vast majority of the created sequences, specifically 98% or more, incorporate at least one ORI for three cell lines (Hela, MCF7 and K562). This innovative approach could provide more efficient, accurate and comprehensive information for experimental investigation, thereby further advancing the development of this field. Oxford University Press 2023-11-25 /pmc/articles/PMC10676776/ /pubmed/38008420 http://dx.doi.org/10.1093/bib/bbad432 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Problem Solving Protocol
Yin, Zhen-Ning
Lai, Fei-Liao
Gao, Feng
Unveiling human origins of replication using deep learning: accurate prediction and comprehensive analysis
title Unveiling human origins of replication using deep learning: accurate prediction and comprehensive analysis
title_full Unveiling human origins of replication using deep learning: accurate prediction and comprehensive analysis
title_fullStr Unveiling human origins of replication using deep learning: accurate prediction and comprehensive analysis
title_full_unstemmed Unveiling human origins of replication using deep learning: accurate prediction and comprehensive analysis
title_short Unveiling human origins of replication using deep learning: accurate prediction and comprehensive analysis
title_sort unveiling human origins of replication using deep learning: accurate prediction and comprehensive analysis
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10676776/
https://www.ncbi.nlm.nih.gov/pubmed/38008420
http://dx.doi.org/10.1093/bib/bbad432
work_keys_str_mv AT yinzhenning unveilinghumanoriginsofreplicationusingdeeplearningaccuratepredictionandcomprehensiveanalysis
AT laifeiliao unveilinghumanoriginsofreplicationusingdeeplearningaccuratepredictionandcomprehensiveanalysis
AT gaofeng unveilinghumanoriginsofreplicationusingdeeplearningaccuratepredictionandcomprehensiveanalysis