Cargando…
Examining clustered somatic mutations with SigProfilerClusters
MOTIVATION: Clustered mutations are found in the human germline as well as in the genomes of cancer and normal somatic cells. Clustered events can be imprinted by a multitude of mutational processes, and they have been implicated in both cancer evolution and development disorders. Existing tools for...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9237733/ https://www.ncbi.nlm.nih.gov/pubmed/35595234 http://dx.doi.org/10.1093/bioinformatics/btac335 |
_version_ | 1784736862790746112 |
---|---|
author | Bergstrom, Erik N Kundu, Mousumy Tbeileh, Noura Alexandrov, Ludmil B |
author_facet | Bergstrom, Erik N Kundu, Mousumy Tbeileh, Noura Alexandrov, Ludmil B |
author_sort | Bergstrom, Erik N |
collection | PubMed |
description | MOTIVATION: Clustered mutations are found in the human germline as well as in the genomes of cancer and normal somatic cells. Clustered events can be imprinted by a multitude of mutational processes, and they have been implicated in both cancer evolution and development disorders. Existing tools for identifying clustered mutations have been optimized for a particular subtype of clustered event and, in most cases, relied on a predefined inter-mutational distance (IMD) cutoff combined with a piecewise linear regression analysis. RESULTS: Here, we present SigProfilerClusters, an automated tool for detecting all types of clustered mutations by calculating a sample-dependent IMD threshold using a simulated background model that takes into account extended sequence context, transcriptional strand asymmetries and regional mutation densities. SigProfilerClusters disentangles all types of clustered events from non-clustered mutations and annotates each clustered event into an established subclass, including the widely used classes of doublet-base substitutions, multi-base substitutions, omikli and kataegis. SigProfilerClusters outputs non-clustered mutations and clustered events using standard data formats as well as provides multiple visualizations for exploring the distributions and patterns of clustered mutations across the genome. AVAILABILITY AND IMPLEMENTATION: SigProfilerClusters is supported across most operating systems and made freely available at https://github.com/AlexandrovLab/SigProfilerClusters with an extensive documentation located at https://osf.io/qpmzw/wiki/home/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-9237733 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-92377332022-06-29 Examining clustered somatic mutations with SigProfilerClusters Bergstrom, Erik N Kundu, Mousumy Tbeileh, Noura Alexandrov, Ludmil B Bioinformatics Applications Note MOTIVATION: Clustered mutations are found in the human germline as well as in the genomes of cancer and normal somatic cells. Clustered events can be imprinted by a multitude of mutational processes, and they have been implicated in both cancer evolution and development disorders. Existing tools for identifying clustered mutations have been optimized for a particular subtype of clustered event and, in most cases, relied on a predefined inter-mutational distance (IMD) cutoff combined with a piecewise linear regression analysis. RESULTS: Here, we present SigProfilerClusters, an automated tool for detecting all types of clustered mutations by calculating a sample-dependent IMD threshold using a simulated background model that takes into account extended sequence context, transcriptional strand asymmetries and regional mutation densities. SigProfilerClusters disentangles all types of clustered events from non-clustered mutations and annotates each clustered event into an established subclass, including the widely used classes of doublet-base substitutions, multi-base substitutions, omikli and kataegis. SigProfilerClusters outputs non-clustered mutations and clustered events using standard data formats as well as provides multiple visualizations for exploring the distributions and patterns of clustered mutations across the genome. AVAILABILITY AND IMPLEMENTATION: SigProfilerClusters is supported across most operating systems and made freely available at https://github.com/AlexandrovLab/SigProfilerClusters with an extensive documentation located at https://osf.io/qpmzw/wiki/home/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-05-20 /pmc/articles/PMC9237733/ /pubmed/35595234 http://dx.doi.org/10.1093/bioinformatics/btac335 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Applications Note Bergstrom, Erik N Kundu, Mousumy Tbeileh, Noura Alexandrov, Ludmil B Examining clustered somatic mutations with SigProfilerClusters |
title | Examining clustered somatic mutations with SigProfilerClusters |
title_full | Examining clustered somatic mutations with SigProfilerClusters |
title_fullStr | Examining clustered somatic mutations with SigProfilerClusters |
title_full_unstemmed | Examining clustered somatic mutations with SigProfilerClusters |
title_short | Examining clustered somatic mutations with SigProfilerClusters |
title_sort | examining clustered somatic mutations with sigprofilerclusters |
topic | Applications Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9237733/ https://www.ncbi.nlm.nih.gov/pubmed/35595234 http://dx.doi.org/10.1093/bioinformatics/btac335 |
work_keys_str_mv | AT bergstromerikn examiningclusteredsomaticmutationswithsigprofilerclusters AT kundumousumy examiningclusteredsomaticmutationswithsigprofilerclusters AT tbeilehnoura examiningclusteredsomaticmutationswithsigprofilerclusters AT alexandrovludmilb examiningclusteredsomaticmutationswithsigprofilerclusters |