Cargando…
Predicting regional somatic mutation rates using DNA motifs
How the locus-specificity of epigenetic modifications is regulated remains an unanswered question. A contributing mechanism is that epigenetic enzymes are recruited to specific loci by DNA binding factors recognizing particular sequence motifs (referred to as epi-motifs). Using these motifs to predi...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10569533/ https://www.ncbi.nlm.nih.gov/pubmed/37782656 http://dx.doi.org/10.1371/journal.pcbi.1011536 |
_version_ | 1785119565796081664 |
---|---|
author | Liu, Cong Wang, Zengmiao Wang, Jun Liu, Chengyu Wang, Mengchi Ngo, Vu Wang, Wei |
author_facet | Liu, Cong Wang, Zengmiao Wang, Jun Liu, Chengyu Wang, Mengchi Ngo, Vu Wang, Wei |
author_sort | Liu, Cong |
collection | PubMed |
description | How the locus-specificity of epigenetic modifications is regulated remains an unanswered question. A contributing mechanism is that epigenetic enzymes are recruited to specific loci by DNA binding factors recognizing particular sequence motifs (referred to as epi-motifs). Using these motifs to predict biological outputs depending on local epigenetic state such as somatic mutation rates would confirm their functionality. Here, we used DNA motifs including known TF motifs and epi-motifs as a surrogate of epigenetic signals to predict somatic mutation rates in 13 cancers at an average 23kbp resolution. We implemented an interpretable neural network model, called contextual regression, to successfully learn the universal relationship between mutations and DNA motifs, and uncovered motifs that are most impactful on the regional mutation rates such as TP53 and epi-motifs associated with H3K9me3. Furthermore, we identified genomic regions with significantly higher mutation rates than the expected values in each individual tumor and demonstrated that such cancer-related regions can accurately predict cancer types. Interestingly, we found that the same mutation signatures often have different contributions to cancer-related and cancer-independent regions, and we also identified the motifs with the most contribution to each mutation signature. |
format | Online Article Text |
id | pubmed-10569533 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-105695332023-10-13 Predicting regional somatic mutation rates using DNA motifs Liu, Cong Wang, Zengmiao Wang, Jun Liu, Chengyu Wang, Mengchi Ngo, Vu Wang, Wei PLoS Comput Biol Research Article How the locus-specificity of epigenetic modifications is regulated remains an unanswered question. A contributing mechanism is that epigenetic enzymes are recruited to specific loci by DNA binding factors recognizing particular sequence motifs (referred to as epi-motifs). Using these motifs to predict biological outputs depending on local epigenetic state such as somatic mutation rates would confirm their functionality. Here, we used DNA motifs including known TF motifs and epi-motifs as a surrogate of epigenetic signals to predict somatic mutation rates in 13 cancers at an average 23kbp resolution. We implemented an interpretable neural network model, called contextual regression, to successfully learn the universal relationship between mutations and DNA motifs, and uncovered motifs that are most impactful on the regional mutation rates such as TP53 and epi-motifs associated with H3K9me3. Furthermore, we identified genomic regions with significantly higher mutation rates than the expected values in each individual tumor and demonstrated that such cancer-related regions can accurately predict cancer types. Interestingly, we found that the same mutation signatures often have different contributions to cancer-related and cancer-independent regions, and we also identified the motifs with the most contribution to each mutation signature. Public Library of Science 2023-10-02 /pmc/articles/PMC10569533/ /pubmed/37782656 http://dx.doi.org/10.1371/journal.pcbi.1011536 Text en © 2023 Liu et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Liu, Cong Wang, Zengmiao Wang, Jun Liu, Chengyu Wang, Mengchi Ngo, Vu Wang, Wei Predicting regional somatic mutation rates using DNA motifs |
title | Predicting regional somatic mutation rates using DNA motifs |
title_full | Predicting regional somatic mutation rates using DNA motifs |
title_fullStr | Predicting regional somatic mutation rates using DNA motifs |
title_full_unstemmed | Predicting regional somatic mutation rates using DNA motifs |
title_short | Predicting regional somatic mutation rates using DNA motifs |
title_sort | predicting regional somatic mutation rates using dna motifs |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10569533/ https://www.ncbi.nlm.nih.gov/pubmed/37782656 http://dx.doi.org/10.1371/journal.pcbi.1011536 |
work_keys_str_mv | AT liucong predictingregionalsomaticmutationratesusingdnamotifs AT wangzengmiao predictingregionalsomaticmutationratesusingdnamotifs AT wangjun predictingregionalsomaticmutationratesusingdnamotifs AT liuchengyu predictingregionalsomaticmutationratesusingdnamotifs AT wangmengchi predictingregionalsomaticmutationratesusingdnamotifs AT ngovu predictingregionalsomaticmutationratesusingdnamotifs AT wangwei predictingregionalsomaticmutationratesusingdnamotifs |