Cargando…

UNADON: transformer-based model to predict genome-wide chromosome spatial position

MOTIVATION: The spatial positioning of chromosomes relative to functional nuclear bodies is intertwined with genome functions such as transcription. However, the sequence patterns and epigenomic features that collectively influence chromatin spatial positioning in a genome-wide manner are not well u...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Muyu, Ma, Jian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10311299/
https://www.ncbi.nlm.nih.gov/pubmed/37387176
http://dx.doi.org/10.1093/bioinformatics/btad246
_version_ 1785066713430097920
author Yang, Muyu
Ma, Jian
author_facet Yang, Muyu
Ma, Jian
author_sort Yang, Muyu
collection PubMed
description MOTIVATION: The spatial positioning of chromosomes relative to functional nuclear bodies is intertwined with genome functions such as transcription. However, the sequence patterns and epigenomic features that collectively influence chromatin spatial positioning in a genome-wide manner are not well understood. RESULTS: Here, we develop a new transformer-based deep learning model called UNADON, which predicts the genome-wide cytological distance to a specific type of nuclear body, as measured by TSA-seq, using both sequence features and epigenomic signals. Evaluations of UNADON in four cell lines (K562, H1, HFFc6, HCT116) show high accuracy in predicting chromatin spatial positioning to nuclear bodies when trained on a single cell line. UNADON also performed well in an unseen cell type. Importantly, we reveal potential sequence and epigenomic factors that affect large-scale chromatin compartmentalization in nuclear bodies. Together, UNADON provides new insights into the principles between sequence features and large-scale chromatin spatial localization, which has important implications for understanding nuclear structure and function. AVAILABILITY AND IMPLEMENTATION: The source code of UNADON can be found at https://github.com/ma-compbio/UNADON.
format Online
Article
Text
id pubmed-10311299
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-103112992023-07-01 UNADON: transformer-based model to predict genome-wide chromosome spatial position Yang, Muyu Ma, Jian Bioinformatics General Computational Biology MOTIVATION: The spatial positioning of chromosomes relative to functional nuclear bodies is intertwined with genome functions such as transcription. However, the sequence patterns and epigenomic features that collectively influence chromatin spatial positioning in a genome-wide manner are not well understood. RESULTS: Here, we develop a new transformer-based deep learning model called UNADON, which predicts the genome-wide cytological distance to a specific type of nuclear body, as measured by TSA-seq, using both sequence features and epigenomic signals. Evaluations of UNADON in four cell lines (K562, H1, HFFc6, HCT116) show high accuracy in predicting chromatin spatial positioning to nuclear bodies when trained on a single cell line. UNADON also performed well in an unseen cell type. Importantly, we reveal potential sequence and epigenomic factors that affect large-scale chromatin compartmentalization in nuclear bodies. Together, UNADON provides new insights into the principles between sequence features and large-scale chromatin spatial localization, which has important implications for understanding nuclear structure and function. AVAILABILITY AND IMPLEMENTATION: The source code of UNADON can be found at https://github.com/ma-compbio/UNADON. Oxford University Press 2023-06-30 /pmc/articles/PMC10311299/ /pubmed/37387176 http://dx.doi.org/10.1093/bioinformatics/btad246 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle General Computational Biology
Yang, Muyu
Ma, Jian
UNADON: transformer-based model to predict genome-wide chromosome spatial position
title UNADON: transformer-based model to predict genome-wide chromosome spatial position
title_full UNADON: transformer-based model to predict genome-wide chromosome spatial position
title_fullStr UNADON: transformer-based model to predict genome-wide chromosome spatial position
title_full_unstemmed UNADON: transformer-based model to predict genome-wide chromosome spatial position
title_short UNADON: transformer-based model to predict genome-wide chromosome spatial position
title_sort unadon: transformer-based model to predict genome-wide chromosome spatial position
topic General Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10311299/
https://www.ncbi.nlm.nih.gov/pubmed/37387176
http://dx.doi.org/10.1093/bioinformatics/btad246
work_keys_str_mv AT yangmuyu unadontransformerbasedmodeltopredictgenomewidechromosomespatialposition
AT majian unadontransformerbasedmodeltopredictgenomewidechromosomespatialposition