Cargando…

Scelestial: Fast and accurate single-cell lineage tree inference based on a Steiner tree approximation algorithm

Single-cell genome sequencing provides a highly granular view of biological systems but is affected by high error rates, allelic amplification bias, and uneven genome coverage. This creates a need for data-specific computational methods, for purposes such as for cell lineage tree inference. The obje...

Descripción completa

Detalles Bibliográficos
Autores principales: Foroughmand-Araabi, Mohammad-Hadi, Goliaei, Sama, McHardy, Alice C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9426887/
https://www.ncbi.nlm.nih.gov/pubmed/35951662
http://dx.doi.org/10.1371/journal.pcbi.1009100
_version_ 1784778778674724864
author Foroughmand-Araabi, Mohammad-Hadi
Goliaei, Sama
McHardy, Alice C.
author_facet Foroughmand-Araabi, Mohammad-Hadi
Goliaei, Sama
McHardy, Alice C.
author_sort Foroughmand-Araabi, Mohammad-Hadi
collection PubMed
description Single-cell genome sequencing provides a highly granular view of biological systems but is affected by high error rates, allelic amplification bias, and uneven genome coverage. This creates a need for data-specific computational methods, for purposes such as for cell lineage tree inference. The objective of cell lineage tree reconstruction is to infer the evolutionary process that generated a set of observed cell genomes. Lineage trees may enable a better understanding of tumor formation and growth, as well as of organ development for healthy body cells. We describe a method, Scelestial, for lineage tree reconstruction from single-cell data, which is based on an approximation algorithm for the Steiner tree problem and is a generalization of the neighbor-joining method. We adapt the algorithm to efficiently select a limited subset of potential sequences as internal nodes, in the presence of missing values, and to minimize cost by lineage tree-based missing value imputation. In a comparison against seven state-of-the-art single-cell lineage tree reconstruction algorithms—BitPhylogeny, OncoNEM, SCITE, SiFit, SASC, SCIPhI, and SiCloneFit—on simulated and real single-cell tumor samples, Scelestial performed best at reconstructing trees in terms of accuracy and run time. Scelestial has been implemented in C++. It is also available as an R package named RScelestial.
format Online
Article
Text
id pubmed-9426887
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-94268872022-08-31 Scelestial: Fast and accurate single-cell lineage tree inference based on a Steiner tree approximation algorithm Foroughmand-Araabi, Mohammad-Hadi Goliaei, Sama McHardy, Alice C. PLoS Comput Biol Research Article Single-cell genome sequencing provides a highly granular view of biological systems but is affected by high error rates, allelic amplification bias, and uneven genome coverage. This creates a need for data-specific computational methods, for purposes such as for cell lineage tree inference. The objective of cell lineage tree reconstruction is to infer the evolutionary process that generated a set of observed cell genomes. Lineage trees may enable a better understanding of tumor formation and growth, as well as of organ development for healthy body cells. We describe a method, Scelestial, for lineage tree reconstruction from single-cell data, which is based on an approximation algorithm for the Steiner tree problem and is a generalization of the neighbor-joining method. We adapt the algorithm to efficiently select a limited subset of potential sequences as internal nodes, in the presence of missing values, and to minimize cost by lineage tree-based missing value imputation. In a comparison against seven state-of-the-art single-cell lineage tree reconstruction algorithms—BitPhylogeny, OncoNEM, SCITE, SiFit, SASC, SCIPhI, and SiCloneFit—on simulated and real single-cell tumor samples, Scelestial performed best at reconstructing trees in terms of accuracy and run time. Scelestial has been implemented in C++. It is also available as an R package named RScelestial. Public Library of Science 2022-08-11 /pmc/articles/PMC9426887/ /pubmed/35951662 http://dx.doi.org/10.1371/journal.pcbi.1009100 Text en © 2022 Foroughmand-Araabi et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Foroughmand-Araabi, Mohammad-Hadi
Goliaei, Sama
McHardy, Alice C.
Scelestial: Fast and accurate single-cell lineage tree inference based on a Steiner tree approximation algorithm
title Scelestial: Fast and accurate single-cell lineage tree inference based on a Steiner tree approximation algorithm
title_full Scelestial: Fast and accurate single-cell lineage tree inference based on a Steiner tree approximation algorithm
title_fullStr Scelestial: Fast and accurate single-cell lineage tree inference based on a Steiner tree approximation algorithm
title_full_unstemmed Scelestial: Fast and accurate single-cell lineage tree inference based on a Steiner tree approximation algorithm
title_short Scelestial: Fast and accurate single-cell lineage tree inference based on a Steiner tree approximation algorithm
title_sort scelestial: fast and accurate single-cell lineage tree inference based on a steiner tree approximation algorithm
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9426887/
https://www.ncbi.nlm.nih.gov/pubmed/35951662
http://dx.doi.org/10.1371/journal.pcbi.1009100
work_keys_str_mv AT foroughmandaraabimohammadhadi scelestialfastandaccuratesinglecelllineagetreeinferencebasedonasteinertreeapproximationalgorithm
AT goliaeisama scelestialfastandaccuratesinglecelllineagetreeinferencebasedonasteinertreeapproximationalgorithm
AT mchardyalicec scelestialfastandaccuratesinglecelllineagetreeinferencebasedonasteinertreeapproximationalgorithm