Cargando…

Extreme value theory as a framework for understanding mutation frequency distribution in cancer genomes

Currently, the population dynamics of preclonal cancer cells before clonal expansion of tumors has not been sufficiently addressed thus far. By focusing on preclonal cancer cell population as a Darwinian evolutionary system, we formulated and analyzed the observed mutation frequency among tumors (MF...

Descripción completa

Detalles Bibliográficos
Autores principales: Tokutomi, Natsuki, Nakai, Kenta, Sugano, Sumio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8382180/
https://www.ncbi.nlm.nih.gov/pubmed/34424899
http://dx.doi.org/10.1371/journal.pone.0243595
_version_ 1783741501147185152
author Tokutomi, Natsuki
Nakai, Kenta
Sugano, Sumio
author_facet Tokutomi, Natsuki
Nakai, Kenta
Sugano, Sumio
author_sort Tokutomi, Natsuki
collection PubMed
description Currently, the population dynamics of preclonal cancer cells before clonal expansion of tumors has not been sufficiently addressed thus far. By focusing on preclonal cancer cell population as a Darwinian evolutionary system, we formulated and analyzed the observed mutation frequency among tumors (MFaT) as a proxy for the hypothesized sequence read frequency and beneficial fitness effect of a cancer driver mutation. Analogous to intestinal crypts, we assumed that sample donor patients are separate culture tanks where proliferating cells follow certain population dynamics described by extreme value theory (EVT). To validate this, we analyzed three large-scale cancer genome datasets, each harboring > 10000 tumor samples and in total involving > 177898 observed mutation sites. We clarified the necessary premises for the application of EVT in the strong selection and weak mutation (SSWM) regime in relation to cancer genome sequences at scale. We also confirmed that the stochastic distribution of MFaT is likely of the Fréchet type, which challenges the well-known Gumbel hypothesis of beneficial fitness effects. Based on statistical data analysis, we demonstrated the potential of EVT as a population genetics framework to understand and explain the stochastic behavior of driver-mutation frequency in cancer genomes as well as its applicability in real cancer genome sequence data.
format Online
Article
Text
id pubmed-8382180
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-83821802021-08-24 Extreme value theory as a framework for understanding mutation frequency distribution in cancer genomes Tokutomi, Natsuki Nakai, Kenta Sugano, Sumio PLoS One Research Article Currently, the population dynamics of preclonal cancer cells before clonal expansion of tumors has not been sufficiently addressed thus far. By focusing on preclonal cancer cell population as a Darwinian evolutionary system, we formulated and analyzed the observed mutation frequency among tumors (MFaT) as a proxy for the hypothesized sequence read frequency and beneficial fitness effect of a cancer driver mutation. Analogous to intestinal crypts, we assumed that sample donor patients are separate culture tanks where proliferating cells follow certain population dynamics described by extreme value theory (EVT). To validate this, we analyzed three large-scale cancer genome datasets, each harboring > 10000 tumor samples and in total involving > 177898 observed mutation sites. We clarified the necessary premises for the application of EVT in the strong selection and weak mutation (SSWM) regime in relation to cancer genome sequences at scale. We also confirmed that the stochastic distribution of MFaT is likely of the Fréchet type, which challenges the well-known Gumbel hypothesis of beneficial fitness effects. Based on statistical data analysis, we demonstrated the potential of EVT as a population genetics framework to understand and explain the stochastic behavior of driver-mutation frequency in cancer genomes as well as its applicability in real cancer genome sequence data. Public Library of Science 2021-08-23 /pmc/articles/PMC8382180/ /pubmed/34424899 http://dx.doi.org/10.1371/journal.pone.0243595 Text en © 2021 Tokutomi et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Tokutomi, Natsuki
Nakai, Kenta
Sugano, Sumio
Extreme value theory as a framework for understanding mutation frequency distribution in cancer genomes
title Extreme value theory as a framework for understanding mutation frequency distribution in cancer genomes
title_full Extreme value theory as a framework for understanding mutation frequency distribution in cancer genomes
title_fullStr Extreme value theory as a framework for understanding mutation frequency distribution in cancer genomes
title_full_unstemmed Extreme value theory as a framework for understanding mutation frequency distribution in cancer genomes
title_short Extreme value theory as a framework for understanding mutation frequency distribution in cancer genomes
title_sort extreme value theory as a framework for understanding mutation frequency distribution in cancer genomes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8382180/
https://www.ncbi.nlm.nih.gov/pubmed/34424899
http://dx.doi.org/10.1371/journal.pone.0243595
work_keys_str_mv AT tokutominatsuki extremevaluetheoryasaframeworkforunderstandingmutationfrequencydistributionincancergenomes
AT nakaikenta extremevaluetheoryasaframeworkforunderstandingmutationfrequencydistributionincancergenomes
AT suganosumio extremevaluetheoryasaframeworkforunderstandingmutationfrequencydistributionincancergenomes