Cargando…

Deep learning model of somatic hypermutation reveals importance of sequence context beyond hotspot targeting

B cells undergo somatic hypermutation (SHM) of the Immunoglobulin (Ig) variable region to generate high-affinity antibodies. SHM relies on the activity of activation-induced deaminase (AID), which mutates C>U preferentially targeting WRC (W=A/T, R=A/G) hotspots. Downstream mutations at WA Polymer...

Descripción completa

Detalles Bibliográficos
Autores principales: Tang, Catherine, Krantsevich, Artem, MacCarthy, Thomas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8749460/
https://www.ncbi.nlm.nih.gov/pubmed/35036866
http://dx.doi.org/10.1016/j.isci.2021.103668
_version_ 1784631234051178496
author Tang, Catherine
Krantsevich, Artem
MacCarthy, Thomas
author_facet Tang, Catherine
Krantsevich, Artem
MacCarthy, Thomas
author_sort Tang, Catherine
collection PubMed
description B cells undergo somatic hypermutation (SHM) of the Immunoglobulin (Ig) variable region to generate high-affinity antibodies. SHM relies on the activity of activation-induced deaminase (AID), which mutates C>U preferentially targeting WRC (W=A/T, R=A/G) hotspots. Downstream mutations at WA Polymerase η hotspots contribute further mutations. Computational models of SHM can describe the probability of mutations essential for vaccine responses. Previous studies using short subsequences (k-mers) failed to explain divergent mutability for the same k-mer. We developed the DeepSHM (Deep learning on SHM) model using k-mers of size 5–21, improving accuracy over previous models. Interpretation of DeepSHM identified an extended WWRCT motif with particularly high mutability. Increased mutability was further associated with lower surrounding G content. Our model also discovered a conserved AGYCTGGGGG (Y=C/T) motif within FW1 of IGHV3 family genes with unusually high T>G substitution rates. Thus, a wider sequence context increases predictive power and identifies features that drive mutational targeting.
format Online
Article
Text
id pubmed-8749460
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-87494602022-01-14 Deep learning model of somatic hypermutation reveals importance of sequence context beyond hotspot targeting Tang, Catherine Krantsevich, Artem MacCarthy, Thomas iScience Article B cells undergo somatic hypermutation (SHM) of the Immunoglobulin (Ig) variable region to generate high-affinity antibodies. SHM relies on the activity of activation-induced deaminase (AID), which mutates C>U preferentially targeting WRC (W=A/T, R=A/G) hotspots. Downstream mutations at WA Polymerase η hotspots contribute further mutations. Computational models of SHM can describe the probability of mutations essential for vaccine responses. Previous studies using short subsequences (k-mers) failed to explain divergent mutability for the same k-mer. We developed the DeepSHM (Deep learning on SHM) model using k-mers of size 5–21, improving accuracy over previous models. Interpretation of DeepSHM identified an extended WWRCT motif with particularly high mutability. Increased mutability was further associated with lower surrounding G content. Our model also discovered a conserved AGYCTGGGGG (Y=C/T) motif within FW1 of IGHV3 family genes with unusually high T>G substitution rates. Thus, a wider sequence context increases predictive power and identifies features that drive mutational targeting. Elsevier 2021-12-20 /pmc/articles/PMC8749460/ /pubmed/35036866 http://dx.doi.org/10.1016/j.isci.2021.103668 Text en © 2022 The Authors. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Tang, Catherine
Krantsevich, Artem
MacCarthy, Thomas
Deep learning model of somatic hypermutation reveals importance of sequence context beyond hotspot targeting
title Deep learning model of somatic hypermutation reveals importance of sequence context beyond hotspot targeting
title_full Deep learning model of somatic hypermutation reveals importance of sequence context beyond hotspot targeting
title_fullStr Deep learning model of somatic hypermutation reveals importance of sequence context beyond hotspot targeting
title_full_unstemmed Deep learning model of somatic hypermutation reveals importance of sequence context beyond hotspot targeting
title_short Deep learning model of somatic hypermutation reveals importance of sequence context beyond hotspot targeting
title_sort deep learning model of somatic hypermutation reveals importance of sequence context beyond hotspot targeting
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8749460/
https://www.ncbi.nlm.nih.gov/pubmed/35036866
http://dx.doi.org/10.1016/j.isci.2021.103668
work_keys_str_mv AT tangcatherine deeplearningmodelofsomatichypermutationrevealsimportanceofsequencecontextbeyondhotspottargeting
AT krantsevichartem deeplearningmodelofsomatichypermutationrevealsimportanceofsequencecontextbeyondhotspottargeting
AT maccarthythomas deeplearningmodelofsomatichypermutationrevealsimportanceofsequencecontextbeyondhotspottargeting