Cargando…

G4Beacon: An In Vivo G4 Prediction Method Using Chromatin and Sequence Information

G-quadruplex (G4) structures are critical epigenetic regulatory elements, which usually form in guanine-rich regions in DNA. However, predicting the formation of G4 structures within living cells remains a challenge. Here, we present an ultra-robust machine learning method, G4Beacon, which utilizes...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Zhuofan, Zhang, Rongxin, Xiao, Ke, Sun, Xiao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9953394/
https://www.ncbi.nlm.nih.gov/pubmed/36830661
http://dx.doi.org/10.3390/biom13020292
_version_ 1784893867025235968
author Zhang, Zhuofan
Zhang, Rongxin
Xiao, Ke
Sun, Xiao
author_facet Zhang, Zhuofan
Zhang, Rongxin
Xiao, Ke
Sun, Xiao
author_sort Zhang, Zhuofan
collection PubMed
description G-quadruplex (G4) structures are critical epigenetic regulatory elements, which usually form in guanine-rich regions in DNA. However, predicting the formation of G4 structures within living cells remains a challenge. Here, we present an ultra-robust machine learning method, G4Beacon, which utilizes the Gradient-Boosting Decision Tree (GBDT) algorithm, coupled with the ATAC-seq data and the surrounding sequences of in vitro G4s, to accurately predict the formation ability of these in vitro G4s in different cell types. As a result, our model achieved excellent performance even when the test set was extremely skewed. Besides this, G4Beacon can also identify the in vivo G4s of other cell lines precisely with the model built on a special cell line, regardless of the experimental techniques or platforms. Altogether, G4Beacon is an accurate, reliable, and easy-to-use method for the prediction of in vivo G4s of various cell lines.
format Online
Article
Text
id pubmed-9953394
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-99533942023-02-25 G4Beacon: An In Vivo G4 Prediction Method Using Chromatin and Sequence Information Zhang, Zhuofan Zhang, Rongxin Xiao, Ke Sun, Xiao Biomolecules Article G-quadruplex (G4) structures are critical epigenetic regulatory elements, which usually form in guanine-rich regions in DNA. However, predicting the formation of G4 structures within living cells remains a challenge. Here, we present an ultra-robust machine learning method, G4Beacon, which utilizes the Gradient-Boosting Decision Tree (GBDT) algorithm, coupled with the ATAC-seq data and the surrounding sequences of in vitro G4s, to accurately predict the formation ability of these in vitro G4s in different cell types. As a result, our model achieved excellent performance even when the test set was extremely skewed. Besides this, G4Beacon can also identify the in vivo G4s of other cell lines precisely with the model built on a special cell line, regardless of the experimental techniques or platforms. Altogether, G4Beacon is an accurate, reliable, and easy-to-use method for the prediction of in vivo G4s of various cell lines. MDPI 2023-02-03 /pmc/articles/PMC9953394/ /pubmed/36830661 http://dx.doi.org/10.3390/biom13020292 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhang, Zhuofan
Zhang, Rongxin
Xiao, Ke
Sun, Xiao
G4Beacon: An In Vivo G4 Prediction Method Using Chromatin and Sequence Information
title G4Beacon: An In Vivo G4 Prediction Method Using Chromatin and Sequence Information
title_full G4Beacon: An In Vivo G4 Prediction Method Using Chromatin and Sequence Information
title_fullStr G4Beacon: An In Vivo G4 Prediction Method Using Chromatin and Sequence Information
title_full_unstemmed G4Beacon: An In Vivo G4 Prediction Method Using Chromatin and Sequence Information
title_short G4Beacon: An In Vivo G4 Prediction Method Using Chromatin and Sequence Information
title_sort g4beacon: an in vivo g4 prediction method using chromatin and sequence information
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9953394/
https://www.ncbi.nlm.nih.gov/pubmed/36830661
http://dx.doi.org/10.3390/biom13020292
work_keys_str_mv AT zhangzhuofan g4beaconaninvivog4predictionmethodusingchromatinandsequenceinformation
AT zhangrongxin g4beaconaninvivog4predictionmethodusingchromatinandsequenceinformation
AT xiaoke g4beaconaninvivog4predictionmethodusingchromatinandsequenceinformation
AT sunxiao g4beaconaninvivog4predictionmethodusingchromatinandsequenceinformation