Cargando…

Using sigLASSO to optimize cancer mutation signatures jointly with sampling likelihood

Multiple mutational processes drive carcinogenesis, leaving characteristic signatures in tumor genomes. Determining the active signatures from a full repertoire of potential ones helps elucidate mechanisms of cancer development. This involves optimally decomposing the counts of cancer mutations, tab...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Shantao, Crawford, Forrest W., Gerstein, Mark B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7368050/
https://www.ncbi.nlm.nih.gov/pubmed/32681003
http://dx.doi.org/10.1038/s41467-020-17388-x
_version_ 1783560538926612480
author Li, Shantao
Crawford, Forrest W.
Gerstein, Mark B.
author_facet Li, Shantao
Crawford, Forrest W.
Gerstein, Mark B.
author_sort Li, Shantao
collection PubMed
description Multiple mutational processes drive carcinogenesis, leaving characteristic signatures in tumor genomes. Determining the active signatures from a full repertoire of potential ones helps elucidate mechanisms of cancer development. This involves optimally decomposing the counts of cancer mutations, tabulated according to their trinucleotide context, into a linear combination of known signatures. Here, we develop sigLASSO (a software tool at github.com/gersteinlab/siglasso) to carry out this optimization efficiently. sigLASSO has four key aspects: (1) It jointly optimizes the likelihood of sampling and signature fitting, by explicitly factoring multinomial sampling into the objective function. This is particularly important when mutation counts are low and sampling variance is high (e.g., in exome sequencing). (2) sigLASSO uses L1 regularization to parsimoniously assign signatures, leading to sparse and interpretable solutions. (3) It fine-tunes model complexity, informed by data scale and biological priors. (4) Consequently, sigLASSO can assess model uncertainty and abstain from making assignments in low-confidence contexts.
format Online
Article
Text
id pubmed-7368050
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-73680502020-07-21 Using sigLASSO to optimize cancer mutation signatures jointly with sampling likelihood Li, Shantao Crawford, Forrest W. Gerstein, Mark B. Nat Commun Article Multiple mutational processes drive carcinogenesis, leaving characteristic signatures in tumor genomes. Determining the active signatures from a full repertoire of potential ones helps elucidate mechanisms of cancer development. This involves optimally decomposing the counts of cancer mutations, tabulated according to their trinucleotide context, into a linear combination of known signatures. Here, we develop sigLASSO (a software tool at github.com/gersteinlab/siglasso) to carry out this optimization efficiently. sigLASSO has four key aspects: (1) It jointly optimizes the likelihood of sampling and signature fitting, by explicitly factoring multinomial sampling into the objective function. This is particularly important when mutation counts are low and sampling variance is high (e.g., in exome sequencing). (2) sigLASSO uses L1 regularization to parsimoniously assign signatures, leading to sparse and interpretable solutions. (3) It fine-tunes model complexity, informed by data scale and biological priors. (4) Consequently, sigLASSO can assess model uncertainty and abstain from making assignments in low-confidence contexts. Nature Publishing Group UK 2020-07-17 /pmc/articles/PMC7368050/ /pubmed/32681003 http://dx.doi.org/10.1038/s41467-020-17388-x Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Li, Shantao
Crawford, Forrest W.
Gerstein, Mark B.
Using sigLASSO to optimize cancer mutation signatures jointly with sampling likelihood
title Using sigLASSO to optimize cancer mutation signatures jointly with sampling likelihood
title_full Using sigLASSO to optimize cancer mutation signatures jointly with sampling likelihood
title_fullStr Using sigLASSO to optimize cancer mutation signatures jointly with sampling likelihood
title_full_unstemmed Using sigLASSO to optimize cancer mutation signatures jointly with sampling likelihood
title_short Using sigLASSO to optimize cancer mutation signatures jointly with sampling likelihood
title_sort using siglasso to optimize cancer mutation signatures jointly with sampling likelihood
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7368050/
https://www.ncbi.nlm.nih.gov/pubmed/32681003
http://dx.doi.org/10.1038/s41467-020-17388-x
work_keys_str_mv AT lishantao usingsiglassotooptimizecancermutationsignaturesjointlywithsamplinglikelihood
AT crawfordforrestw usingsiglassotooptimizecancermutationsignaturesjointlywithsamplinglikelihood
AT gersteinmarkb usingsiglassotooptimizecancermutationsignaturesjointlywithsamplinglikelihood