Cargando…

Accuracy of mutational signature software on correlated signatures

Mutational signatures are characteristic patterns of mutations generated by exogenous mutagens or by endogenous mutational processes. Mutational signatures are important for research into DNA damage and repair, aging, cancer biology, genetic toxicology, and epidemiology. Unsupervised learning can in...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Yang, Chua, Ellora Hui Zhen, Ng, Alvin Wei Tian, Boot, Arnoud, Rozen, Steven G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8748538/
https://www.ncbi.nlm.nih.gov/pubmed/35013428
http://dx.doi.org/10.1038/s41598-021-04207-6
_version_ 1784631031088807936
author Wu, Yang
Chua, Ellora Hui Zhen
Ng, Alvin Wei Tian
Boot, Arnoud
Rozen, Steven G.
author_facet Wu, Yang
Chua, Ellora Hui Zhen
Ng, Alvin Wei Tian
Boot, Arnoud
Rozen, Steven G.
author_sort Wu, Yang
collection PubMed
description Mutational signatures are characteristic patterns of mutations generated by exogenous mutagens or by endogenous mutational processes. Mutational signatures are important for research into DNA damage and repair, aging, cancer biology, genetic toxicology, and epidemiology. Unsupervised learning can infer mutational signatures from the somatic mutations in large numbers of tumors, and separating correlated signatures is a notable challenge for this task. To investigate which methods can best meet this challenge, we assessed 18 computational methods for inferring mutational signatures on 20 synthetic data sets that incorporated varying degrees of correlated activity of two common mutational signatures. Performance varied widely, and four methods noticeably outperformed the others: hdp (based on hierarchical Dirichlet processes), SigProExtractor (based on multiple non-negative matrix factorizations over resampled data), TCSM (based on an approach used in document topic analysis), and mutSpec.NMF (also based on non-negative matrix factorization). The results underscored the complexities of mutational signature extraction, including the importance and difficulty of determining the correct number of signatures and the importance of hyperparameters. Our findings indicate directions for improvement of the software and show a need for care when interpreting results from any of these methods, including the need for assessing sensitivity of the results to input parameters.
format Online
Article
Text
id pubmed-8748538
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-87485382022-01-11 Accuracy of mutational signature software on correlated signatures Wu, Yang Chua, Ellora Hui Zhen Ng, Alvin Wei Tian Boot, Arnoud Rozen, Steven G. Sci Rep Article Mutational signatures are characteristic patterns of mutations generated by exogenous mutagens or by endogenous mutational processes. Mutational signatures are important for research into DNA damage and repair, aging, cancer biology, genetic toxicology, and epidemiology. Unsupervised learning can infer mutational signatures from the somatic mutations in large numbers of tumors, and separating correlated signatures is a notable challenge for this task. To investigate which methods can best meet this challenge, we assessed 18 computational methods for inferring mutational signatures on 20 synthetic data sets that incorporated varying degrees of correlated activity of two common mutational signatures. Performance varied widely, and four methods noticeably outperformed the others: hdp (based on hierarchical Dirichlet processes), SigProExtractor (based on multiple non-negative matrix factorizations over resampled data), TCSM (based on an approach used in document topic analysis), and mutSpec.NMF (also based on non-negative matrix factorization). The results underscored the complexities of mutational signature extraction, including the importance and difficulty of determining the correct number of signatures and the importance of hyperparameters. Our findings indicate directions for improvement of the software and show a need for care when interpreting results from any of these methods, including the need for assessing sensitivity of the results to input parameters. Nature Publishing Group UK 2022-01-10 /pmc/articles/PMC8748538/ /pubmed/35013428 http://dx.doi.org/10.1038/s41598-021-04207-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Wu, Yang
Chua, Ellora Hui Zhen
Ng, Alvin Wei Tian
Boot, Arnoud
Rozen, Steven G.
Accuracy of mutational signature software on correlated signatures
title Accuracy of mutational signature software on correlated signatures
title_full Accuracy of mutational signature software on correlated signatures
title_fullStr Accuracy of mutational signature software on correlated signatures
title_full_unstemmed Accuracy of mutational signature software on correlated signatures
title_short Accuracy of mutational signature software on correlated signatures
title_sort accuracy of mutational signature software on correlated signatures
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8748538/
https://www.ncbi.nlm.nih.gov/pubmed/35013428
http://dx.doi.org/10.1038/s41598-021-04207-6
work_keys_str_mv AT wuyang accuracyofmutationalsignaturesoftwareoncorrelatedsignatures
AT chuaellorahuizhen accuracyofmutationalsignaturesoftwareoncorrelatedsignatures
AT ngalvinweitian accuracyofmutationalsignaturesoftwareoncorrelatedsignatures
AT bootarnoud accuracyofmutationalsignaturesoftwareoncorrelatedsignatures
AT rozensteveng accuracyofmutationalsignaturesoftwareoncorrelatedsignatures