Cargando…

Cancer progression models and fitness landscapes: a many-to-many relationship

MOTIVATION: The identification of constraints, due to gene interactions, in the order of accumulation of mutations during cancer progression can allow us to single out therapeutic targets. Cancer progression models (CPMs) use genotype frequency data from cross-sectional samples to identify these con...

Descripción completa

Detalles Bibliográficos
Autor principal: Diaz-Uriarte, Ramon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6031050/
https://www.ncbi.nlm.nih.gov/pubmed/29048486
http://dx.doi.org/10.1093/bioinformatics/btx663
_version_ 1783337243105034240
author Diaz-Uriarte, Ramon
author_facet Diaz-Uriarte, Ramon
author_sort Diaz-Uriarte, Ramon
collection PubMed
description MOTIVATION: The identification of constraints, due to gene interactions, in the order of accumulation of mutations during cancer progression can allow us to single out therapeutic targets. Cancer progression models (CPMs) use genotype frequency data from cross-sectional samples to identify these constraints, and return Directed Acyclic Graphs (DAGs) of restrictions where arrows indicate dependencies or constraints. On the other hand, fitness landscapes, which map genotypes to fitness, contain all possible paths of tumor progression. Thus, we expect a correspondence between DAGs from CPMs and the fitness landscapes where evolution happened. But many fitness landscapes—e.g. those with reciprocal sign epistasis—cannot be represented by CPMs. RESULTS: Using simulated data under 500 fitness landscapes, I show that CPMs’ performance (prediction of genotypes that can exist) degrades with reciprocal sign epistasis. There is large variability in the DAGs inferred from each landscape, which is also affected by mutation rate, detection regime and fitness landscape features, in ways that depend on CPM method. Using three cancer datasets, I show that these problems strongly affect the analysis of empirical data: fitness landscapes that are widely different from each other produce data similar to the empirically observed ones and lead to DAGs that infer very different restrictions. Because reciprocal sign epistasis can be common in cancer, these results question the use and interpretation of CPMs. AVAILABILITY AND IMPLEMENTATION: Code available from Supplementary Material. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6031050
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-60310502018-07-10 Cancer progression models and fitness landscapes: a many-to-many relationship Diaz-Uriarte, Ramon Bioinformatics Original Papers MOTIVATION: The identification of constraints, due to gene interactions, in the order of accumulation of mutations during cancer progression can allow us to single out therapeutic targets. Cancer progression models (CPMs) use genotype frequency data from cross-sectional samples to identify these constraints, and return Directed Acyclic Graphs (DAGs) of restrictions where arrows indicate dependencies or constraints. On the other hand, fitness landscapes, which map genotypes to fitness, contain all possible paths of tumor progression. Thus, we expect a correspondence between DAGs from CPMs and the fitness landscapes where evolution happened. But many fitness landscapes—e.g. those with reciprocal sign epistasis—cannot be represented by CPMs. RESULTS: Using simulated data under 500 fitness landscapes, I show that CPMs’ performance (prediction of genotypes that can exist) degrades with reciprocal sign epistasis. There is large variability in the DAGs inferred from each landscape, which is also affected by mutation rate, detection regime and fitness landscape features, in ways that depend on CPM method. Using three cancer datasets, I show that these problems strongly affect the analysis of empirical data: fitness landscapes that are widely different from each other produce data similar to the empirically observed ones and lead to DAGs that infer very different restrictions. Because reciprocal sign epistasis can be common in cancer, these results question the use and interpretation of CPMs. AVAILABILITY AND IMPLEMENTATION: Code available from Supplementary Material. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-03-01 2017-10-18 /pmc/articles/PMC6031050/ /pubmed/29048486 http://dx.doi.org/10.1093/bioinformatics/btx663 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Diaz-Uriarte, Ramon
Cancer progression models and fitness landscapes: a many-to-many relationship
title Cancer progression models and fitness landscapes: a many-to-many relationship
title_full Cancer progression models and fitness landscapes: a many-to-many relationship
title_fullStr Cancer progression models and fitness landscapes: a many-to-many relationship
title_full_unstemmed Cancer progression models and fitness landscapes: a many-to-many relationship
title_short Cancer progression models and fitness landscapes: a many-to-many relationship
title_sort cancer progression models and fitness landscapes: a many-to-many relationship
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6031050/
https://www.ncbi.nlm.nih.gov/pubmed/29048486
http://dx.doi.org/10.1093/bioinformatics/btx663
work_keys_str_mv AT diazuriarteramon cancerprogressionmodelsandfitnesslandscapesamanytomanyrelationship