Cargando…

Subclonal reconstruction of tumors using machine learning and population genetics

The majority of cancer genomic data are generated from bulk samples composed of mixtures of cancer subpopulations, as well as normal cells. Subclonal reconstruction approaches based on machine learning aim to separate those subpopulations in a sample and reconstruct their evolutionary history. Howev...

Descripción completa

Detalles Bibliográficos
Autores principales: Caravagna, Giulio, Heide, Timon, Williams, Marc J., Zapata, Luis, Nichol, Daniel, Chkhaidze, Ketevan, Cross, William, Cresswell, George D., Werner, Benjamin, Acar, Ahmet, Chesler, Louis, Barnes, Chris P., Sanguinetti, Guido, Graham, Trevor A., Sottoriva, Andrea
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7610388/
https://www.ncbi.nlm.nih.gov/pubmed/32879509
http://dx.doi.org/10.1038/s41588-020-0675-5
_version_ 1783605184264404992
author Caravagna, Giulio
Heide, Timon
Williams, Marc J.
Zapata, Luis
Nichol, Daniel
Chkhaidze, Ketevan
Cross, William
Cresswell, George D.
Werner, Benjamin
Acar, Ahmet
Chesler, Louis
Barnes, Chris P.
Sanguinetti, Guido
Graham, Trevor A.
Sottoriva, Andrea
author_facet Caravagna, Giulio
Heide, Timon
Williams, Marc J.
Zapata, Luis
Nichol, Daniel
Chkhaidze, Ketevan
Cross, William
Cresswell, George D.
Werner, Benjamin
Acar, Ahmet
Chesler, Louis
Barnes, Chris P.
Sanguinetti, Guido
Graham, Trevor A.
Sottoriva, Andrea
author_sort Caravagna, Giulio
collection PubMed
description The majority of cancer genomic data are generated from bulk samples composed of mixtures of cancer subpopulations, as well as normal cells. Subclonal reconstruction approaches based on machine learning aim to separate those subpopulations in a sample and reconstruct their evolutionary history. However, current approaches are entirely data-driven and agnostic to evolutionary theory. We demonstrate that systematic errors occur in the analysis if evolution is not accounted for, and this is exacerbated by multi-sampling of the same tumor. We present a novel approach for model-based tumor subclonal reconstruction (MOBSTER) that combines machine learning with theoretical population genetics. Using public whole-genome sequencing data from 2,606 samples from different cohorts, new data and synthetic validation, we show this method is more robust and accurate than current techniques in single sample, multi-region and longitudinal data. This approach minimizes the confounding factors of non-evolutionary methods, leading to more accurate recovery of the evolutionary history of human cancers.
format Online
Article
Text
id pubmed-7610388
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-76103882021-03-23 Subclonal reconstruction of tumors using machine learning and population genetics Caravagna, Giulio Heide, Timon Williams, Marc J. Zapata, Luis Nichol, Daniel Chkhaidze, Ketevan Cross, William Cresswell, George D. Werner, Benjamin Acar, Ahmet Chesler, Louis Barnes, Chris P. Sanguinetti, Guido Graham, Trevor A. Sottoriva, Andrea Nat Genet Article The majority of cancer genomic data are generated from bulk samples composed of mixtures of cancer subpopulations, as well as normal cells. Subclonal reconstruction approaches based on machine learning aim to separate those subpopulations in a sample and reconstruct their evolutionary history. However, current approaches are entirely data-driven and agnostic to evolutionary theory. We demonstrate that systematic errors occur in the analysis if evolution is not accounted for, and this is exacerbated by multi-sampling of the same tumor. We present a novel approach for model-based tumor subclonal reconstruction (MOBSTER) that combines machine learning with theoretical population genetics. Using public whole-genome sequencing data from 2,606 samples from different cohorts, new data and synthetic validation, we show this method is more robust and accurate than current techniques in single sample, multi-region and longitudinal data. This approach minimizes the confounding factors of non-evolutionary methods, leading to more accurate recovery of the evolutionary history of human cancers. 2020-09-01 2020-09-02 /pmc/articles/PMC7610388/ /pubmed/32879509 http://dx.doi.org/10.1038/s41588-020-0675-5 Text en http://www.nature.com/authors/editorial_policies/license.html#termsUsers may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use: http://www.nature.com/authors/editorial_policies/license.html#terms
spellingShingle Article
Caravagna, Giulio
Heide, Timon
Williams, Marc J.
Zapata, Luis
Nichol, Daniel
Chkhaidze, Ketevan
Cross, William
Cresswell, George D.
Werner, Benjamin
Acar, Ahmet
Chesler, Louis
Barnes, Chris P.
Sanguinetti, Guido
Graham, Trevor A.
Sottoriva, Andrea
Subclonal reconstruction of tumors using machine learning and population genetics
title Subclonal reconstruction of tumors using machine learning and population genetics
title_full Subclonal reconstruction of tumors using machine learning and population genetics
title_fullStr Subclonal reconstruction of tumors using machine learning and population genetics
title_full_unstemmed Subclonal reconstruction of tumors using machine learning and population genetics
title_short Subclonal reconstruction of tumors using machine learning and population genetics
title_sort subclonal reconstruction of tumors using machine learning and population genetics
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7610388/
https://www.ncbi.nlm.nih.gov/pubmed/32879509
http://dx.doi.org/10.1038/s41588-020-0675-5
work_keys_str_mv AT caravagnagiulio subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics
AT heidetimon subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics
AT williamsmarcj subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics
AT zapataluis subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics
AT nicholdaniel subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics
AT chkhaidzeketevan subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics
AT crosswilliam subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics
AT cresswellgeorged subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics
AT wernerbenjamin subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics
AT acarahmet subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics
AT cheslerlouis subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics
AT barneschrisp subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics
AT sanguinettiguido subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics
AT grahamtrevora subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics
AT sottorivaandrea subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics