Cargando…

Deep learning for MYC binding site recognition

Motivation: The definition of the genome distribution of the Myc transcription factor is extremely important since it may help predict its transcriptional activity particularly in the context of cancer. Myc is among the most powerful oncogenes involved in the occurrence and development of more than...

Descripción completa

Detalles Bibliográficos
Autores principales: Fioresi, R., Demurtas, P., Perini, G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9760990/
https://www.ncbi.nlm.nih.gov/pubmed/36544623
http://dx.doi.org/10.3389/fbinf.2022.1015993
_version_ 1784852607590727680
author Fioresi, R.
Demurtas, P.
Perini, G.
author_facet Fioresi, R.
Demurtas, P.
Perini, G.
author_sort Fioresi, R.
collection PubMed
description Motivation: The definition of the genome distribution of the Myc transcription factor is extremely important since it may help predict its transcriptional activity particularly in the context of cancer. Myc is among the most powerful oncogenes involved in the occurrence and development of more than 80% of different types of pediatric and adult cancers. Myc regulates thousands of genes which can be in part different, depending on the type of tissues and tumours. Myc distribution along the genome has been determined experimentally through chromatin immunoprecipitation This approach, although powerful, is very time consuming and cannot be routinely applied to tumours of individual patients. Thus, it becomes of paramount importance to develop in silico tools that can effectively and rapidly predict its distribution on a given cell genome. New advanced computational tools (DeeperBind) can then be successfully employed to determine the function of Myc in a specific tumour, and may help to devise new directions and approaches to experiments first and personalized and more effective therapeutic treatments for a single patient later on. Results: The use of DeeperBind with DeepRAM on Colab platform (Google) can effectively predict the binding sites for the MYC factor with an accuracy above 0.96 AUC, when trained with multiple cell lines. The analysis of the filters in DeeperBind trained models shows, besides the consensus sequence CACGTG classically associated to the MYC factor, also the other consensus sequences G/C box or TGGGA, respectively bound by the SP1 and MIZ-1 transcription factors, which are known to mediate the MYC repressive response. Overall, our findings suggest a stronger synergy between the machine learning tools as DeeperBind and biological experiments, which may reduce the time consuming experiments by providing a direction to guide them.
format Online
Article
Text
id pubmed-9760990
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-97609902022-12-20 Deep learning for MYC binding site recognition Fioresi, R. Demurtas, P. Perini, G. Front Bioinform Bioinformatics Motivation: The definition of the genome distribution of the Myc transcription factor is extremely important since it may help predict its transcriptional activity particularly in the context of cancer. Myc is among the most powerful oncogenes involved in the occurrence and development of more than 80% of different types of pediatric and adult cancers. Myc regulates thousands of genes which can be in part different, depending on the type of tissues and tumours. Myc distribution along the genome has been determined experimentally through chromatin immunoprecipitation This approach, although powerful, is very time consuming and cannot be routinely applied to tumours of individual patients. Thus, it becomes of paramount importance to develop in silico tools that can effectively and rapidly predict its distribution on a given cell genome. New advanced computational tools (DeeperBind) can then be successfully employed to determine the function of Myc in a specific tumour, and may help to devise new directions and approaches to experiments first and personalized and more effective therapeutic treatments for a single patient later on. Results: The use of DeeperBind with DeepRAM on Colab platform (Google) can effectively predict the binding sites for the MYC factor with an accuracy above 0.96 AUC, when trained with multiple cell lines. The analysis of the filters in DeeperBind trained models shows, besides the consensus sequence CACGTG classically associated to the MYC factor, also the other consensus sequences G/C box or TGGGA, respectively bound by the SP1 and MIZ-1 transcription factors, which are known to mediate the MYC repressive response. Overall, our findings suggest a stronger synergy between the machine learning tools as DeeperBind and biological experiments, which may reduce the time consuming experiments by providing a direction to guide them. Frontiers Media S.A. 2022-12-05 /pmc/articles/PMC9760990/ /pubmed/36544623 http://dx.doi.org/10.3389/fbinf.2022.1015993 Text en Copyright © 2022 Fioresi, Demurtas and Perini. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Bioinformatics
Fioresi, R.
Demurtas, P.
Perini, G.
Deep learning for MYC binding site recognition
title Deep learning for MYC binding site recognition
title_full Deep learning for MYC binding site recognition
title_fullStr Deep learning for MYC binding site recognition
title_full_unstemmed Deep learning for MYC binding site recognition
title_short Deep learning for MYC binding site recognition
title_sort deep learning for myc binding site recognition
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9760990/
https://www.ncbi.nlm.nih.gov/pubmed/36544623
http://dx.doi.org/10.3389/fbinf.2022.1015993
work_keys_str_mv AT fioresir deeplearningformycbindingsiterecognition
AT demurtasp deeplearningformycbindingsiterecognition
AT perinig deeplearningformycbindingsiterecognition