Cargando…

Conifer: clonal tree inference for tumor heterogeneity with single-cell and bulk sequencing data

BACKGROUND: Genetic heterogeneity of a cancer tumor that develops during clonal evolution is one of the reasons for cancer treatment failure, by increasing the chance of drug resistance. Clones are cell populations with different genotypes, resulting from differences in somatic mutations that occur...

Descripción completa

Detalles Bibliográficos
Autores principales: Baghaarabani, Leila, Goliaei, Sama, Foroughmand-Araabi, Mohammad-Hadi, Shariatpanahi, Seyed Peyman, Goliaei, Bahram
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8404257/
https://www.ncbi.nlm.nih.gov/pubmed/34461827
http://dx.doi.org/10.1186/s12859-021-04338-7
_version_ 1783746134217326592
author Baghaarabani, Leila
Goliaei, Sama
Foroughmand-Araabi, Mohammad-Hadi
Shariatpanahi, Seyed Peyman
Goliaei, Bahram
author_facet Baghaarabani, Leila
Goliaei, Sama
Foroughmand-Araabi, Mohammad-Hadi
Shariatpanahi, Seyed Peyman
Goliaei, Bahram
author_sort Baghaarabani, Leila
collection PubMed
description BACKGROUND: Genetic heterogeneity of a cancer tumor that develops during clonal evolution is one of the reasons for cancer treatment failure, by increasing the chance of drug resistance. Clones are cell populations with different genotypes, resulting from differences in somatic mutations that occur and accumulate during cancer development. An appropriate approach for identifying clones is determining the variant allele frequency of mutations that occurred in the tumor. Although bulk sequencing data can be used to provide that information, the frequencies are not informative enough for identifying different clones with the same prevalence and their evolutionary relationships. On the other hand, single-cell sequencing data provides valuable information about branching events in the evolution of a cancerous tumor. However, the temporal order of mutations may be determined with ambiguities using only single-cell data, while variant allele frequencies from bulk sequencing data can provide beneficial information for inferring the temporal order of mutations with fewer ambiguities. RESULT: In this study, a new method called Conifer (ClONal tree Inference For hEterogeneity of tumoR) is proposed which combines aggregated variant allele frequency from bulk sequencing data with branching event information from single-cell sequencing data to more accurately identify clones and their evolutionary relationships. It is proven that the accuracy of clone identification and clonal tree inference is increased by using Conifer compared to other existing methods on various sets of simulated data. In addition, it is discussed that the evolutionary tree provided by Conifer on real cancer data sets is highly consistent with information in both bulk and single-cell data. CONCLUSIONS: In this study, we have provided an accurate and robust method to identify clones of tumor heterogeneity and their evolutionary history by combining single-cell and bulk sequencing data. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04338-7.
format Online
Article
Text
id pubmed-8404257
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-84042572021-08-30 Conifer: clonal tree inference for tumor heterogeneity with single-cell and bulk sequencing data Baghaarabani, Leila Goliaei, Sama Foroughmand-Araabi, Mohammad-Hadi Shariatpanahi, Seyed Peyman Goliaei, Bahram BMC Bioinformatics Research BACKGROUND: Genetic heterogeneity of a cancer tumor that develops during clonal evolution is one of the reasons for cancer treatment failure, by increasing the chance of drug resistance. Clones are cell populations with different genotypes, resulting from differences in somatic mutations that occur and accumulate during cancer development. An appropriate approach for identifying clones is determining the variant allele frequency of mutations that occurred in the tumor. Although bulk sequencing data can be used to provide that information, the frequencies are not informative enough for identifying different clones with the same prevalence and their evolutionary relationships. On the other hand, single-cell sequencing data provides valuable information about branching events in the evolution of a cancerous tumor. However, the temporal order of mutations may be determined with ambiguities using only single-cell data, while variant allele frequencies from bulk sequencing data can provide beneficial information for inferring the temporal order of mutations with fewer ambiguities. RESULT: In this study, a new method called Conifer (ClONal tree Inference For hEterogeneity of tumoR) is proposed which combines aggregated variant allele frequency from bulk sequencing data with branching event information from single-cell sequencing data to more accurately identify clones and their evolutionary relationships. It is proven that the accuracy of clone identification and clonal tree inference is increased by using Conifer compared to other existing methods on various sets of simulated data. In addition, it is discussed that the evolutionary tree provided by Conifer on real cancer data sets is highly consistent with information in both bulk and single-cell data. CONCLUSIONS: In this study, we have provided an accurate and robust method to identify clones of tumor heterogeneity and their evolutionary history by combining single-cell and bulk sequencing data. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04338-7. BioMed Central 2021-08-30 /pmc/articles/PMC8404257/ /pubmed/34461827 http://dx.doi.org/10.1186/s12859-021-04338-7 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Baghaarabani, Leila
Goliaei, Sama
Foroughmand-Araabi, Mohammad-Hadi
Shariatpanahi, Seyed Peyman
Goliaei, Bahram
Conifer: clonal tree inference for tumor heterogeneity with single-cell and bulk sequencing data
title Conifer: clonal tree inference for tumor heterogeneity with single-cell and bulk sequencing data
title_full Conifer: clonal tree inference for tumor heterogeneity with single-cell and bulk sequencing data
title_fullStr Conifer: clonal tree inference for tumor heterogeneity with single-cell and bulk sequencing data
title_full_unstemmed Conifer: clonal tree inference for tumor heterogeneity with single-cell and bulk sequencing data
title_short Conifer: clonal tree inference for tumor heterogeneity with single-cell and bulk sequencing data
title_sort conifer: clonal tree inference for tumor heterogeneity with single-cell and bulk sequencing data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8404257/
https://www.ncbi.nlm.nih.gov/pubmed/34461827
http://dx.doi.org/10.1186/s12859-021-04338-7
work_keys_str_mv AT baghaarabanileila coniferclonaltreeinferencefortumorheterogeneitywithsinglecellandbulksequencingdata
AT goliaeisama coniferclonaltreeinferencefortumorheterogeneitywithsinglecellandbulksequencingdata
AT foroughmandaraabimohammadhadi coniferclonaltreeinferencefortumorheterogeneitywithsinglecellandbulksequencingdata
AT shariatpanahiseyedpeyman coniferclonaltreeinferencefortumorheterogeneitywithsinglecellandbulksequencingdata
AT goliaeibahram coniferclonaltreeinferencefortumorheterogeneitywithsinglecellandbulksequencingdata