Cargando…

Predicting regional somatic mutation rates using DNA motifs

How the locus-specificity of epigenetic modifications is regulated remains an unanswered question. A contributing mechanism is that epigenetic enzymes are recruited to specific loci by DNA binding factors recognizing particular sequence motifs (referred to as epi-motifs). Using these motifs to predi...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Cong, Wang, Zengmiao, Wang, Jun, Liu, Chengyu, Wang, Mengchi, Ngo, Vu, Wang, Wei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10569533/
https://www.ncbi.nlm.nih.gov/pubmed/37782656
http://dx.doi.org/10.1371/journal.pcbi.1011536
_version_ 1785119565796081664
author Liu, Cong
Wang, Zengmiao
Wang, Jun
Liu, Chengyu
Wang, Mengchi
Ngo, Vu
Wang, Wei
author_facet Liu, Cong
Wang, Zengmiao
Wang, Jun
Liu, Chengyu
Wang, Mengchi
Ngo, Vu
Wang, Wei
author_sort Liu, Cong
collection PubMed
description How the locus-specificity of epigenetic modifications is regulated remains an unanswered question. A contributing mechanism is that epigenetic enzymes are recruited to specific loci by DNA binding factors recognizing particular sequence motifs (referred to as epi-motifs). Using these motifs to predict biological outputs depending on local epigenetic state such as somatic mutation rates would confirm their functionality. Here, we used DNA motifs including known TF motifs and epi-motifs as a surrogate of epigenetic signals to predict somatic mutation rates in 13 cancers at an average 23kbp resolution. We implemented an interpretable neural network model, called contextual regression, to successfully learn the universal relationship between mutations and DNA motifs, and uncovered motifs that are most impactful on the regional mutation rates such as TP53 and epi-motifs associated with H3K9me3. Furthermore, we identified genomic regions with significantly higher mutation rates than the expected values in each individual tumor and demonstrated that such cancer-related regions can accurately predict cancer types. Interestingly, we found that the same mutation signatures often have different contributions to cancer-related and cancer-independent regions, and we also identified the motifs with the most contribution to each mutation signature.
format Online
Article
Text
id pubmed-10569533
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-105695332023-10-13 Predicting regional somatic mutation rates using DNA motifs Liu, Cong Wang, Zengmiao Wang, Jun Liu, Chengyu Wang, Mengchi Ngo, Vu Wang, Wei PLoS Comput Biol Research Article How the locus-specificity of epigenetic modifications is regulated remains an unanswered question. A contributing mechanism is that epigenetic enzymes are recruited to specific loci by DNA binding factors recognizing particular sequence motifs (referred to as epi-motifs). Using these motifs to predict biological outputs depending on local epigenetic state such as somatic mutation rates would confirm their functionality. Here, we used DNA motifs including known TF motifs and epi-motifs as a surrogate of epigenetic signals to predict somatic mutation rates in 13 cancers at an average 23kbp resolution. We implemented an interpretable neural network model, called contextual regression, to successfully learn the universal relationship between mutations and DNA motifs, and uncovered motifs that are most impactful on the regional mutation rates such as TP53 and epi-motifs associated with H3K9me3. Furthermore, we identified genomic regions with significantly higher mutation rates than the expected values in each individual tumor and demonstrated that such cancer-related regions can accurately predict cancer types. Interestingly, we found that the same mutation signatures often have different contributions to cancer-related and cancer-independent regions, and we also identified the motifs with the most contribution to each mutation signature. Public Library of Science 2023-10-02 /pmc/articles/PMC10569533/ /pubmed/37782656 http://dx.doi.org/10.1371/journal.pcbi.1011536 Text en © 2023 Liu et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Liu, Cong
Wang, Zengmiao
Wang, Jun
Liu, Chengyu
Wang, Mengchi
Ngo, Vu
Wang, Wei
Predicting regional somatic mutation rates using DNA motifs
title Predicting regional somatic mutation rates using DNA motifs
title_full Predicting regional somatic mutation rates using DNA motifs
title_fullStr Predicting regional somatic mutation rates using DNA motifs
title_full_unstemmed Predicting regional somatic mutation rates using DNA motifs
title_short Predicting regional somatic mutation rates using DNA motifs
title_sort predicting regional somatic mutation rates using dna motifs
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10569533/
https://www.ncbi.nlm.nih.gov/pubmed/37782656
http://dx.doi.org/10.1371/journal.pcbi.1011536
work_keys_str_mv AT liucong predictingregionalsomaticmutationratesusingdnamotifs
AT wangzengmiao predictingregionalsomaticmutationratesusingdnamotifs
AT wangjun predictingregionalsomaticmutationratesusingdnamotifs
AT liuchengyu predictingregionalsomaticmutationratesusingdnamotifs
AT wangmengchi predictingregionalsomaticmutationratesusingdnamotifs
AT ngovu predictingregionalsomaticmutationratesusingdnamotifs
AT wangwei predictingregionalsomaticmutationratesusingdnamotifs