Cargando…
Subclonal reconstruction of tumors using machine learning and population genetics
The majority of cancer genomic data are generated from bulk samples composed of mixtures of cancer subpopulations, as well as normal cells. Subclonal reconstruction approaches based on machine learning aim to separate those subpopulations in a sample and reconstruct their evolutionary history. Howev...
Autores principales: | , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7610388/ https://www.ncbi.nlm.nih.gov/pubmed/32879509 http://dx.doi.org/10.1038/s41588-020-0675-5 |
_version_ | 1783605184264404992 |
---|---|
author | Caravagna, Giulio Heide, Timon Williams, Marc J. Zapata, Luis Nichol, Daniel Chkhaidze, Ketevan Cross, William Cresswell, George D. Werner, Benjamin Acar, Ahmet Chesler, Louis Barnes, Chris P. Sanguinetti, Guido Graham, Trevor A. Sottoriva, Andrea |
author_facet | Caravagna, Giulio Heide, Timon Williams, Marc J. Zapata, Luis Nichol, Daniel Chkhaidze, Ketevan Cross, William Cresswell, George D. Werner, Benjamin Acar, Ahmet Chesler, Louis Barnes, Chris P. Sanguinetti, Guido Graham, Trevor A. Sottoriva, Andrea |
author_sort | Caravagna, Giulio |
collection | PubMed |
description | The majority of cancer genomic data are generated from bulk samples composed of mixtures of cancer subpopulations, as well as normal cells. Subclonal reconstruction approaches based on machine learning aim to separate those subpopulations in a sample and reconstruct their evolutionary history. However, current approaches are entirely data-driven and agnostic to evolutionary theory. We demonstrate that systematic errors occur in the analysis if evolution is not accounted for, and this is exacerbated by multi-sampling of the same tumor. We present a novel approach for model-based tumor subclonal reconstruction (MOBSTER) that combines machine learning with theoretical population genetics. Using public whole-genome sequencing data from 2,606 samples from different cohorts, new data and synthetic validation, we show this method is more robust and accurate than current techniques in single sample, multi-region and longitudinal data. This approach minimizes the confounding factors of non-evolutionary methods, leading to more accurate recovery of the evolutionary history of human cancers. |
format | Online Article Text |
id | pubmed-7610388 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-76103882021-03-23 Subclonal reconstruction of tumors using machine learning and population genetics Caravagna, Giulio Heide, Timon Williams, Marc J. Zapata, Luis Nichol, Daniel Chkhaidze, Ketevan Cross, William Cresswell, George D. Werner, Benjamin Acar, Ahmet Chesler, Louis Barnes, Chris P. Sanguinetti, Guido Graham, Trevor A. Sottoriva, Andrea Nat Genet Article The majority of cancer genomic data are generated from bulk samples composed of mixtures of cancer subpopulations, as well as normal cells. Subclonal reconstruction approaches based on machine learning aim to separate those subpopulations in a sample and reconstruct their evolutionary history. However, current approaches are entirely data-driven and agnostic to evolutionary theory. We demonstrate that systematic errors occur in the analysis if evolution is not accounted for, and this is exacerbated by multi-sampling of the same tumor. We present a novel approach for model-based tumor subclonal reconstruction (MOBSTER) that combines machine learning with theoretical population genetics. Using public whole-genome sequencing data from 2,606 samples from different cohorts, new data and synthetic validation, we show this method is more robust and accurate than current techniques in single sample, multi-region and longitudinal data. This approach minimizes the confounding factors of non-evolutionary methods, leading to more accurate recovery of the evolutionary history of human cancers. 2020-09-01 2020-09-02 /pmc/articles/PMC7610388/ /pubmed/32879509 http://dx.doi.org/10.1038/s41588-020-0675-5 Text en http://www.nature.com/authors/editorial_policies/license.html#termsUsers may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use: http://www.nature.com/authors/editorial_policies/license.html#terms |
spellingShingle | Article Caravagna, Giulio Heide, Timon Williams, Marc J. Zapata, Luis Nichol, Daniel Chkhaidze, Ketevan Cross, William Cresswell, George D. Werner, Benjamin Acar, Ahmet Chesler, Louis Barnes, Chris P. Sanguinetti, Guido Graham, Trevor A. Sottoriva, Andrea Subclonal reconstruction of tumors using machine learning and population genetics |
title | Subclonal reconstruction of tumors using machine learning and population genetics |
title_full | Subclonal reconstruction of tumors using machine learning and population genetics |
title_fullStr | Subclonal reconstruction of tumors using machine learning and population genetics |
title_full_unstemmed | Subclonal reconstruction of tumors using machine learning and population genetics |
title_short | Subclonal reconstruction of tumors using machine learning and population genetics |
title_sort | subclonal reconstruction of tumors using machine learning and population genetics |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7610388/ https://www.ncbi.nlm.nih.gov/pubmed/32879509 http://dx.doi.org/10.1038/s41588-020-0675-5 |
work_keys_str_mv | AT caravagnagiulio subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics AT heidetimon subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics AT williamsmarcj subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics AT zapataluis subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics AT nicholdaniel subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics AT chkhaidzeketevan subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics AT crosswilliam subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics AT cresswellgeorged subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics AT wernerbenjamin subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics AT acarahmet subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics AT cheslerlouis subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics AT barneschrisp subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics AT sanguinettiguido subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics AT grahamtrevora subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics AT sottorivaandrea subclonalreconstructionoftumorsusingmachinelearningandpopulationgenetics |