Cargando…
Inferring tumor progression in large datasets
Identification of mutations of the genes that give cancer a selective advantage is an important step towards research and clinical objectives. As such, there has been a growing interest in developing methods for identification of driver genes and their temporal order within a single patient (intra-t...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7577444/ https://www.ncbi.nlm.nih.gov/pubmed/33035204 http://dx.doi.org/10.1371/journal.pcbi.1008183 |
_version_ | 1783598190548746240 |
---|---|
author | Mohaghegh Neyshabouri, Mohammadreza Jun, Seong-Hwan Lagergren, Jens |
author_facet | Mohaghegh Neyshabouri, Mohammadreza Jun, Seong-Hwan Lagergren, Jens |
author_sort | Mohaghegh Neyshabouri, Mohammadreza |
collection | PubMed |
description | Identification of mutations of the genes that give cancer a selective advantage is an important step towards research and clinical objectives. As such, there has been a growing interest in developing methods for identification of driver genes and their temporal order within a single patient (intra-tumor) as well as across a cohort of patients (inter-tumor). In this paper, we develop a probabilistic model for tumor progression, in which the driver genes are clustered into several ordered driver pathways. We develop an efficient inference algorithm that exhibits favorable scalability to the number of genes and samples compared to a previously introduced ILP-based method. Adopting a probabilistic approach also allows principled approaches to model selection and uncertainty quantification. Using a large set of experiments on synthetic datasets, we demonstrate our superior performance compared to the ILP-based method. We also analyze two biological datasets of colorectal and glioblastoma cancers. We emphasize that while the ILP-based method puts many seemingly passenger genes in the driver pathways, our algorithm keeps focused on truly driver genes and outputs more accurate models for cancer progression. |
format | Online Article Text |
id | pubmed-7577444 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-75774442020-10-26 Inferring tumor progression in large datasets Mohaghegh Neyshabouri, Mohammadreza Jun, Seong-Hwan Lagergren, Jens PLoS Comput Biol Research Article Identification of mutations of the genes that give cancer a selective advantage is an important step towards research and clinical objectives. As such, there has been a growing interest in developing methods for identification of driver genes and their temporal order within a single patient (intra-tumor) as well as across a cohort of patients (inter-tumor). In this paper, we develop a probabilistic model for tumor progression, in which the driver genes are clustered into several ordered driver pathways. We develop an efficient inference algorithm that exhibits favorable scalability to the number of genes and samples compared to a previously introduced ILP-based method. Adopting a probabilistic approach also allows principled approaches to model selection and uncertainty quantification. Using a large set of experiments on synthetic datasets, we demonstrate our superior performance compared to the ILP-based method. We also analyze two biological datasets of colorectal and glioblastoma cancers. We emphasize that while the ILP-based method puts many seemingly passenger genes in the driver pathways, our algorithm keeps focused on truly driver genes and outputs more accurate models for cancer progression. Public Library of Science 2020-10-09 /pmc/articles/PMC7577444/ /pubmed/33035204 http://dx.doi.org/10.1371/journal.pcbi.1008183 Text en © 2020 Mohaghegh Neyshabouri et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Mohaghegh Neyshabouri, Mohammadreza Jun, Seong-Hwan Lagergren, Jens Inferring tumor progression in large datasets |
title | Inferring tumor progression in large datasets |
title_full | Inferring tumor progression in large datasets |
title_fullStr | Inferring tumor progression in large datasets |
title_full_unstemmed | Inferring tumor progression in large datasets |
title_short | Inferring tumor progression in large datasets |
title_sort | inferring tumor progression in large datasets |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7577444/ https://www.ncbi.nlm.nih.gov/pubmed/33035204 http://dx.doi.org/10.1371/journal.pcbi.1008183 |
work_keys_str_mv | AT mohagheghneyshabourimohammadreza inferringtumorprogressioninlargedatasets AT junseonghwan inferringtumorprogressioninlargedatasets AT lagergrenjens inferringtumorprogressioninlargedatasets |