Cargando…

Examining clustered somatic mutations with SigProfilerClusters

MOTIVATION: Clustered mutations are found in the human germline as well as in the genomes of cancer and normal somatic cells. Clustered events can be imprinted by a multitude of mutational processes, and they have been implicated in both cancer evolution and development disorders. Existing tools for...

Descripción completa

Detalles Bibliográficos
Autores principales: Bergstrom, Erik N, Kundu, Mousumy, Tbeileh, Noura, Alexandrov, Ludmil B
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9237733/
https://www.ncbi.nlm.nih.gov/pubmed/35595234
http://dx.doi.org/10.1093/bioinformatics/btac335
_version_ 1784736862790746112
author Bergstrom, Erik N
Kundu, Mousumy
Tbeileh, Noura
Alexandrov, Ludmil B
author_facet Bergstrom, Erik N
Kundu, Mousumy
Tbeileh, Noura
Alexandrov, Ludmil B
author_sort Bergstrom, Erik N
collection PubMed
description MOTIVATION: Clustered mutations are found in the human germline as well as in the genomes of cancer and normal somatic cells. Clustered events can be imprinted by a multitude of mutational processes, and they have been implicated in both cancer evolution and development disorders. Existing tools for identifying clustered mutations have been optimized for a particular subtype of clustered event and, in most cases, relied on a predefined inter-mutational distance (IMD) cutoff combined with a piecewise linear regression analysis. RESULTS: Here, we present SigProfilerClusters, an automated tool for detecting all types of clustered mutations by calculating a sample-dependent IMD threshold using a simulated background model that takes into account extended sequence context, transcriptional strand asymmetries and regional mutation densities. SigProfilerClusters disentangles all types of clustered events from non-clustered mutations and annotates each clustered event into an established subclass, including the widely used classes of doublet-base substitutions, multi-base substitutions, omikli and kataegis. SigProfilerClusters outputs non-clustered mutations and clustered events using standard data formats as well as provides multiple visualizations for exploring the distributions and patterns of clustered mutations across the genome. AVAILABILITY AND IMPLEMENTATION: SigProfilerClusters is supported across most operating systems and made freely available at https://github.com/AlexandrovLab/SigProfilerClusters with an extensive documentation located at https://osf.io/qpmzw/wiki/home/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9237733
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-92377332022-06-29 Examining clustered somatic mutations with SigProfilerClusters Bergstrom, Erik N Kundu, Mousumy Tbeileh, Noura Alexandrov, Ludmil B Bioinformatics Applications Note MOTIVATION: Clustered mutations are found in the human germline as well as in the genomes of cancer and normal somatic cells. Clustered events can be imprinted by a multitude of mutational processes, and they have been implicated in both cancer evolution and development disorders. Existing tools for identifying clustered mutations have been optimized for a particular subtype of clustered event and, in most cases, relied on a predefined inter-mutational distance (IMD) cutoff combined with a piecewise linear regression analysis. RESULTS: Here, we present SigProfilerClusters, an automated tool for detecting all types of clustered mutations by calculating a sample-dependent IMD threshold using a simulated background model that takes into account extended sequence context, transcriptional strand asymmetries and regional mutation densities. SigProfilerClusters disentangles all types of clustered events from non-clustered mutations and annotates each clustered event into an established subclass, including the widely used classes of doublet-base substitutions, multi-base substitutions, omikli and kataegis. SigProfilerClusters outputs non-clustered mutations and clustered events using standard data formats as well as provides multiple visualizations for exploring the distributions and patterns of clustered mutations across the genome. AVAILABILITY AND IMPLEMENTATION: SigProfilerClusters is supported across most operating systems and made freely available at https://github.com/AlexandrovLab/SigProfilerClusters with an extensive documentation located at https://osf.io/qpmzw/wiki/home/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-05-20 /pmc/articles/PMC9237733/ /pubmed/35595234 http://dx.doi.org/10.1093/bioinformatics/btac335 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Note
Bergstrom, Erik N
Kundu, Mousumy
Tbeileh, Noura
Alexandrov, Ludmil B
Examining clustered somatic mutations with SigProfilerClusters
title Examining clustered somatic mutations with SigProfilerClusters
title_full Examining clustered somatic mutations with SigProfilerClusters
title_fullStr Examining clustered somatic mutations with SigProfilerClusters
title_full_unstemmed Examining clustered somatic mutations with SigProfilerClusters
title_short Examining clustered somatic mutations with SigProfilerClusters
title_sort examining clustered somatic mutations with sigprofilerclusters
topic Applications Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9237733/
https://www.ncbi.nlm.nih.gov/pubmed/35595234
http://dx.doi.org/10.1093/bioinformatics/btac335
work_keys_str_mv AT bergstromerikn examiningclusteredsomaticmutationswithsigprofilerclusters
AT kundumousumy examiningclusteredsomaticmutationswithsigprofilerclusters
AT tbeilehnoura examiningclusteredsomaticmutationswithsigprofilerclusters
AT alexandrovludmilb examiningclusteredsomaticmutationswithsigprofilerclusters