Cargando…

Inferring tumor progression in large datasets

Identification of mutations of the genes that give cancer a selective advantage is an important step towards research and clinical objectives. As such, there has been a growing interest in developing methods for identification of driver genes and their temporal order within a single patient (intra-t...

Descripción completa

Detalles Bibliográficos
Autores principales: Mohaghegh Neyshabouri, Mohammadreza, Jun, Seong-Hwan, Lagergren, Jens
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7577444/
https://www.ncbi.nlm.nih.gov/pubmed/33035204
http://dx.doi.org/10.1371/journal.pcbi.1008183
_version_ 1783598190548746240
author Mohaghegh Neyshabouri, Mohammadreza
Jun, Seong-Hwan
Lagergren, Jens
author_facet Mohaghegh Neyshabouri, Mohammadreza
Jun, Seong-Hwan
Lagergren, Jens
author_sort Mohaghegh Neyshabouri, Mohammadreza
collection PubMed
description Identification of mutations of the genes that give cancer a selective advantage is an important step towards research and clinical objectives. As such, there has been a growing interest in developing methods for identification of driver genes and their temporal order within a single patient (intra-tumor) as well as across a cohort of patients (inter-tumor). In this paper, we develop a probabilistic model for tumor progression, in which the driver genes are clustered into several ordered driver pathways. We develop an efficient inference algorithm that exhibits favorable scalability to the number of genes and samples compared to a previously introduced ILP-based method. Adopting a probabilistic approach also allows principled approaches to model selection and uncertainty quantification. Using a large set of experiments on synthetic datasets, we demonstrate our superior performance compared to the ILP-based method. We also analyze two biological datasets of colorectal and glioblastoma cancers. We emphasize that while the ILP-based method puts many seemingly passenger genes in the driver pathways, our algorithm keeps focused on truly driver genes and outputs more accurate models for cancer progression.
format Online
Article
Text
id pubmed-7577444
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-75774442020-10-26 Inferring tumor progression in large datasets Mohaghegh Neyshabouri, Mohammadreza Jun, Seong-Hwan Lagergren, Jens PLoS Comput Biol Research Article Identification of mutations of the genes that give cancer a selective advantage is an important step towards research and clinical objectives. As such, there has been a growing interest in developing methods for identification of driver genes and their temporal order within a single patient (intra-tumor) as well as across a cohort of patients (inter-tumor). In this paper, we develop a probabilistic model for tumor progression, in which the driver genes are clustered into several ordered driver pathways. We develop an efficient inference algorithm that exhibits favorable scalability to the number of genes and samples compared to a previously introduced ILP-based method. Adopting a probabilistic approach also allows principled approaches to model selection and uncertainty quantification. Using a large set of experiments on synthetic datasets, we demonstrate our superior performance compared to the ILP-based method. We also analyze two biological datasets of colorectal and glioblastoma cancers. We emphasize that while the ILP-based method puts many seemingly passenger genes in the driver pathways, our algorithm keeps focused on truly driver genes and outputs more accurate models for cancer progression. Public Library of Science 2020-10-09 /pmc/articles/PMC7577444/ /pubmed/33035204 http://dx.doi.org/10.1371/journal.pcbi.1008183 Text en © 2020 Mohaghegh Neyshabouri et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Mohaghegh Neyshabouri, Mohammadreza
Jun, Seong-Hwan
Lagergren, Jens
Inferring tumor progression in large datasets
title Inferring tumor progression in large datasets
title_full Inferring tumor progression in large datasets
title_fullStr Inferring tumor progression in large datasets
title_full_unstemmed Inferring tumor progression in large datasets
title_short Inferring tumor progression in large datasets
title_sort inferring tumor progression in large datasets
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7577444/
https://www.ncbi.nlm.nih.gov/pubmed/33035204
http://dx.doi.org/10.1371/journal.pcbi.1008183
work_keys_str_mv AT mohagheghneyshabourimohammadreza inferringtumorprogressioninlargedatasets
AT junseonghwan inferringtumorprogressioninlargedatasets
AT lagergrenjens inferringtumorprogressioninlargedatasets