Cargando…

Prediction of G4 formation in live cells with epigenetic data: a deep learning approach

G-quadruplexes (G4s) are secondary structures abundant in DNA that may play regulatory roles in cells. Despite the ubiquity of the putative G-quadruplex-forming sequences (PQS) in the human genome, only a small fraction forms G4 structures in cells. Folded G4, histone methylation and chromatin acces...

Descripción completa

Detalles Bibliográficos
Autores principales: Korsakova, Anna, Phan, Anh Tuân
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10448861/
https://www.ncbi.nlm.nih.gov/pubmed/37636021
http://dx.doi.org/10.1093/nargab/lqad071
_version_ 1785094825564962816
author Korsakova, Anna
Phan, Anh Tuân
author_facet Korsakova, Anna
Phan, Anh Tuân
author_sort Korsakova, Anna
collection PubMed
description G-quadruplexes (G4s) are secondary structures abundant in DNA that may play regulatory roles in cells. Despite the ubiquity of the putative G-quadruplex-forming sequences (PQS) in the human genome, only a small fraction forms G4 structures in cells. Folded G4, histone methylation and chromatin accessibility are all parts of the complex cis regulatory landscape. We propose an approach for prediction of G4 formation in cells that incorporates epigenetic and chromatin accessibility data. The novel approach termed epiG4NN efficiently predicts cell-specific G4 formation in live cells based on a local epigenomic snapshot. Our results confirm the close relationship between H3K4me3 histone methylation, chromatin accessibility and G4 structure formation. Trained on A549 cell data, epiG4NN was then able to predict G4 formation in HEK293T and K562 cell lines. We observe the dependency of model performance with different epigenetic features on the underlying experimental condition of G4 detection. We expect that this approach will contribute to the systematic understanding of correlations between structural and epigenomic feature landscape.
format Online
Article
Text
id pubmed-10448861
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-104488612023-08-25 Prediction of G4 formation in live cells with epigenetic data: a deep learning approach Korsakova, Anna Phan, Anh Tuân NAR Genom Bioinform Standard Article G-quadruplexes (G4s) are secondary structures abundant in DNA that may play regulatory roles in cells. Despite the ubiquity of the putative G-quadruplex-forming sequences (PQS) in the human genome, only a small fraction forms G4 structures in cells. Folded G4, histone methylation and chromatin accessibility are all parts of the complex cis regulatory landscape. We propose an approach for prediction of G4 formation in cells that incorporates epigenetic and chromatin accessibility data. The novel approach termed epiG4NN efficiently predicts cell-specific G4 formation in live cells based on a local epigenomic snapshot. Our results confirm the close relationship between H3K4me3 histone methylation, chromatin accessibility and G4 structure formation. Trained on A549 cell data, epiG4NN was then able to predict G4 formation in HEK293T and K562 cell lines. We observe the dependency of model performance with different epigenetic features on the underlying experimental condition of G4 detection. We expect that this approach will contribute to the systematic understanding of correlations between structural and epigenomic feature landscape. Oxford University Press 2023-08-24 /pmc/articles/PMC10448861/ /pubmed/37636021 http://dx.doi.org/10.1093/nargab/lqad071 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Standard Article
Korsakova, Anna
Phan, Anh Tuân
Prediction of G4 formation in live cells with epigenetic data: a deep learning approach
title Prediction of G4 formation in live cells with epigenetic data: a deep learning approach
title_full Prediction of G4 formation in live cells with epigenetic data: a deep learning approach
title_fullStr Prediction of G4 formation in live cells with epigenetic data: a deep learning approach
title_full_unstemmed Prediction of G4 formation in live cells with epigenetic data: a deep learning approach
title_short Prediction of G4 formation in live cells with epigenetic data: a deep learning approach
title_sort prediction of g4 formation in live cells with epigenetic data: a deep learning approach
topic Standard Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10448861/
https://www.ncbi.nlm.nih.gov/pubmed/37636021
http://dx.doi.org/10.1093/nargab/lqad071
work_keys_str_mv AT korsakovaanna predictionofg4formationinlivecellswithepigeneticdataadeeplearningapproach
AT phananhtuan predictionofg4formationinlivecellswithepigeneticdataadeeplearningapproach