Cargando…
Unravelling the instability of mutational signatures extraction via archetypal analysis
The high cosine similarity between some single-base substitution mutational signatures and their characteristic flat profiles could suggest the presence of overfitting and mathematical artefacts. The newest version (v3.3) of the signature database available in the Catalogue Of Somatic Mutations In C...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9846778/ https://www.ncbi.nlm.nih.gov/pubmed/36685831 http://dx.doi.org/10.3389/fgene.2022.1049501 |
_version_ | 1784871270451511296 |
---|---|
author | Pancotti, Corrado Rollo, Cesare Birolo, Giovanni Benevenuta, Silvia Fariselli, Piero Sanavia, Tiziana |
author_facet | Pancotti, Corrado Rollo, Cesare Birolo, Giovanni Benevenuta, Silvia Fariselli, Piero Sanavia, Tiziana |
author_sort | Pancotti, Corrado |
collection | PubMed |
description | The high cosine similarity between some single-base substitution mutational signatures and their characteristic flat profiles could suggest the presence of overfitting and mathematical artefacts. The newest version (v3.3) of the signature database available in the Catalogue Of Somatic Mutations In Cancer (COSMIC) provides a collection of 79 mutational signatures, which has more than doubled with respect to previous version (30 profiles available in COSMIC signatures v2), making more critical the associations between signatures and specific mutagenic processes. This study both provides a systematic assessment of the de novo extraction task through simulation scenarios based on the latest version of the COSMIC signatures and highlights, through a novel approach using archetypal analysis, which COSMIC signatures are redundant and more likely to be considered as mathematical artefacts. 29 archetypes were able to reconstruct the profile of all the COSMIC signatures with cosine similarity [Formula: see text] 0.8. Interestingly, these archetypes tend to group similar original signatures sharing either the same aetiology or similar biological processes. We believe that these findings will be useful to encourage the development of new de novo extraction methods avoiding the redundancy of information among the signatures while preserving the biological interpretation. |
format | Online Article Text |
id | pubmed-9846778 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-98467782023-01-19 Unravelling the instability of mutational signatures extraction via archetypal analysis Pancotti, Corrado Rollo, Cesare Birolo, Giovanni Benevenuta, Silvia Fariselli, Piero Sanavia, Tiziana Front Genet Genetics The high cosine similarity between some single-base substitution mutational signatures and their characteristic flat profiles could suggest the presence of overfitting and mathematical artefacts. The newest version (v3.3) of the signature database available in the Catalogue Of Somatic Mutations In Cancer (COSMIC) provides a collection of 79 mutational signatures, which has more than doubled with respect to previous version (30 profiles available in COSMIC signatures v2), making more critical the associations between signatures and specific mutagenic processes. This study both provides a systematic assessment of the de novo extraction task through simulation scenarios based on the latest version of the COSMIC signatures and highlights, through a novel approach using archetypal analysis, which COSMIC signatures are redundant and more likely to be considered as mathematical artefacts. 29 archetypes were able to reconstruct the profile of all the COSMIC signatures with cosine similarity [Formula: see text] 0.8. Interestingly, these archetypes tend to group similar original signatures sharing either the same aetiology or similar biological processes. We believe that these findings will be useful to encourage the development of new de novo extraction methods avoiding the redundancy of information among the signatures while preserving the biological interpretation. Frontiers Media S.A. 2023-01-04 /pmc/articles/PMC9846778/ /pubmed/36685831 http://dx.doi.org/10.3389/fgene.2022.1049501 Text en Copyright © 2023 Pancotti, Rollo, Birolo, Benevenuta, Fariselli and Sanavia. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Pancotti, Corrado Rollo, Cesare Birolo, Giovanni Benevenuta, Silvia Fariselli, Piero Sanavia, Tiziana Unravelling the instability of mutational signatures extraction via archetypal analysis |
title | Unravelling the instability of mutational signatures extraction via archetypal analysis |
title_full | Unravelling the instability of mutational signatures extraction via archetypal analysis |
title_fullStr | Unravelling the instability of mutational signatures extraction via archetypal analysis |
title_full_unstemmed | Unravelling the instability of mutational signatures extraction via archetypal analysis |
title_short | Unravelling the instability of mutational signatures extraction via archetypal analysis |
title_sort | unravelling the instability of mutational signatures extraction via archetypal analysis |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9846778/ https://www.ncbi.nlm.nih.gov/pubmed/36685831 http://dx.doi.org/10.3389/fgene.2022.1049501 |
work_keys_str_mv | AT pancotticorrado unravellingtheinstabilityofmutationalsignaturesextractionviaarchetypalanalysis AT rollocesare unravellingtheinstabilityofmutationalsignaturesextractionviaarchetypalanalysis AT birologiovanni unravellingtheinstabilityofmutationalsignaturesextractionviaarchetypalanalysis AT benevenutasilvia unravellingtheinstabilityofmutationalsignaturesextractionviaarchetypalanalysis AT farisellipiero unravellingtheinstabilityofmutationalsignaturesextractionviaarchetypalanalysis AT sanaviatiziana unravellingtheinstabilityofmutationalsignaturesextractionviaarchetypalanalysis |