Cargando…
Using sigLASSO to optimize cancer mutation signatures jointly with sampling likelihood
Multiple mutational processes drive carcinogenesis, leaving characteristic signatures in tumor genomes. Determining the active signatures from a full repertoire of potential ones helps elucidate mechanisms of cancer development. This involves optimally decomposing the counts of cancer mutations, tab...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7368050/ https://www.ncbi.nlm.nih.gov/pubmed/32681003 http://dx.doi.org/10.1038/s41467-020-17388-x |
_version_ | 1783560538926612480 |
---|---|
author | Li, Shantao Crawford, Forrest W. Gerstein, Mark B. |
author_facet | Li, Shantao Crawford, Forrest W. Gerstein, Mark B. |
author_sort | Li, Shantao |
collection | PubMed |
description | Multiple mutational processes drive carcinogenesis, leaving characteristic signatures in tumor genomes. Determining the active signatures from a full repertoire of potential ones helps elucidate mechanisms of cancer development. This involves optimally decomposing the counts of cancer mutations, tabulated according to their trinucleotide context, into a linear combination of known signatures. Here, we develop sigLASSO (a software tool at github.com/gersteinlab/siglasso) to carry out this optimization efficiently. sigLASSO has four key aspects: (1) It jointly optimizes the likelihood of sampling and signature fitting, by explicitly factoring multinomial sampling into the objective function. This is particularly important when mutation counts are low and sampling variance is high (e.g., in exome sequencing). (2) sigLASSO uses L1 regularization to parsimoniously assign signatures, leading to sparse and interpretable solutions. (3) It fine-tunes model complexity, informed by data scale and biological priors. (4) Consequently, sigLASSO can assess model uncertainty and abstain from making assignments in low-confidence contexts. |
format | Online Article Text |
id | pubmed-7368050 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-73680502020-07-21 Using sigLASSO to optimize cancer mutation signatures jointly with sampling likelihood Li, Shantao Crawford, Forrest W. Gerstein, Mark B. Nat Commun Article Multiple mutational processes drive carcinogenesis, leaving characteristic signatures in tumor genomes. Determining the active signatures from a full repertoire of potential ones helps elucidate mechanisms of cancer development. This involves optimally decomposing the counts of cancer mutations, tabulated according to their trinucleotide context, into a linear combination of known signatures. Here, we develop sigLASSO (a software tool at github.com/gersteinlab/siglasso) to carry out this optimization efficiently. sigLASSO has four key aspects: (1) It jointly optimizes the likelihood of sampling and signature fitting, by explicitly factoring multinomial sampling into the objective function. This is particularly important when mutation counts are low and sampling variance is high (e.g., in exome sequencing). (2) sigLASSO uses L1 regularization to parsimoniously assign signatures, leading to sparse and interpretable solutions. (3) It fine-tunes model complexity, informed by data scale and biological priors. (4) Consequently, sigLASSO can assess model uncertainty and abstain from making assignments in low-confidence contexts. Nature Publishing Group UK 2020-07-17 /pmc/articles/PMC7368050/ /pubmed/32681003 http://dx.doi.org/10.1038/s41467-020-17388-x Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Li, Shantao Crawford, Forrest W. Gerstein, Mark B. Using sigLASSO to optimize cancer mutation signatures jointly with sampling likelihood |
title | Using sigLASSO to optimize cancer mutation signatures jointly with sampling likelihood |
title_full | Using sigLASSO to optimize cancer mutation signatures jointly with sampling likelihood |
title_fullStr | Using sigLASSO to optimize cancer mutation signatures jointly with sampling likelihood |
title_full_unstemmed | Using sigLASSO to optimize cancer mutation signatures jointly with sampling likelihood |
title_short | Using sigLASSO to optimize cancer mutation signatures jointly with sampling likelihood |
title_sort | using siglasso to optimize cancer mutation signatures jointly with sampling likelihood |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7368050/ https://www.ncbi.nlm.nih.gov/pubmed/32681003 http://dx.doi.org/10.1038/s41467-020-17388-x |
work_keys_str_mv | AT lishantao usingsiglassotooptimizecancermutationsignaturesjointlywithsamplinglikelihood AT crawfordforrestw usingsiglassotooptimizecancermutationsignaturesjointlywithsamplinglikelihood AT gersteinmarkb usingsiglassotooptimizecancermutationsignaturesjointlywithsamplinglikelihood |