Cargando…
Cancer progression models and fitness landscapes: a many-to-many relationship
MOTIVATION: The identification of constraints, due to gene interactions, in the order of accumulation of mutations during cancer progression can allow us to single out therapeutic targets. Cancer progression models (CPMs) use genotype frequency data from cross-sectional samples to identify these con...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6031050/ https://www.ncbi.nlm.nih.gov/pubmed/29048486 http://dx.doi.org/10.1093/bioinformatics/btx663 |
_version_ | 1783337243105034240 |
---|---|
author | Diaz-Uriarte, Ramon |
author_facet | Diaz-Uriarte, Ramon |
author_sort | Diaz-Uriarte, Ramon |
collection | PubMed |
description | MOTIVATION: The identification of constraints, due to gene interactions, in the order of accumulation of mutations during cancer progression can allow us to single out therapeutic targets. Cancer progression models (CPMs) use genotype frequency data from cross-sectional samples to identify these constraints, and return Directed Acyclic Graphs (DAGs) of restrictions where arrows indicate dependencies or constraints. On the other hand, fitness landscapes, which map genotypes to fitness, contain all possible paths of tumor progression. Thus, we expect a correspondence between DAGs from CPMs and the fitness landscapes where evolution happened. But many fitness landscapes—e.g. those with reciprocal sign epistasis—cannot be represented by CPMs. RESULTS: Using simulated data under 500 fitness landscapes, I show that CPMs’ performance (prediction of genotypes that can exist) degrades with reciprocal sign epistasis. There is large variability in the DAGs inferred from each landscape, which is also affected by mutation rate, detection regime and fitness landscape features, in ways that depend on CPM method. Using three cancer datasets, I show that these problems strongly affect the analysis of empirical data: fitness landscapes that are widely different from each other produce data similar to the empirically observed ones and lead to DAGs that infer very different restrictions. Because reciprocal sign epistasis can be common in cancer, these results question the use and interpretation of CPMs. AVAILABILITY AND IMPLEMENTATION: Code available from Supplementary Material. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-6031050 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-60310502018-07-10 Cancer progression models and fitness landscapes: a many-to-many relationship Diaz-Uriarte, Ramon Bioinformatics Original Papers MOTIVATION: The identification of constraints, due to gene interactions, in the order of accumulation of mutations during cancer progression can allow us to single out therapeutic targets. Cancer progression models (CPMs) use genotype frequency data from cross-sectional samples to identify these constraints, and return Directed Acyclic Graphs (DAGs) of restrictions where arrows indicate dependencies or constraints. On the other hand, fitness landscapes, which map genotypes to fitness, contain all possible paths of tumor progression. Thus, we expect a correspondence between DAGs from CPMs and the fitness landscapes where evolution happened. But many fitness landscapes—e.g. those with reciprocal sign epistasis—cannot be represented by CPMs. RESULTS: Using simulated data under 500 fitness landscapes, I show that CPMs’ performance (prediction of genotypes that can exist) degrades with reciprocal sign epistasis. There is large variability in the DAGs inferred from each landscape, which is also affected by mutation rate, detection regime and fitness landscape features, in ways that depend on CPM method. Using three cancer datasets, I show that these problems strongly affect the analysis of empirical data: fitness landscapes that are widely different from each other produce data similar to the empirically observed ones and lead to DAGs that infer very different restrictions. Because reciprocal sign epistasis can be common in cancer, these results question the use and interpretation of CPMs. AVAILABILITY AND IMPLEMENTATION: Code available from Supplementary Material. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-03-01 2017-10-18 /pmc/articles/PMC6031050/ /pubmed/29048486 http://dx.doi.org/10.1093/bioinformatics/btx663 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Diaz-Uriarte, Ramon Cancer progression models and fitness landscapes: a many-to-many relationship |
title | Cancer progression models and fitness landscapes: a many-to-many relationship |
title_full | Cancer progression models and fitness landscapes: a many-to-many relationship |
title_fullStr | Cancer progression models and fitness landscapes: a many-to-many relationship |
title_full_unstemmed | Cancer progression models and fitness landscapes: a many-to-many relationship |
title_short | Cancer progression models and fitness landscapes: a many-to-many relationship |
title_sort | cancer progression models and fitness landscapes: a many-to-many relationship |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6031050/ https://www.ncbi.nlm.nih.gov/pubmed/29048486 http://dx.doi.org/10.1093/bioinformatics/btx663 |
work_keys_str_mv | AT diazuriarteramon cancerprogressionmodelsandfitnesslandscapesamanytomanyrelationship |