Cargando…

RobustTree: An adaptive, robust PCA algorithm for embedded tree structure recovery from single-cell sequencing data

Robust Principal Component Analysis (RPCA) offers a powerful tool for recovering a low-rank matrix from highly corrupted data, with growing applications in computational biology. Biological processes commonly form intrinsic hierarchical structures, such as tree structures of cell development traject...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Ziwei, Zhang, Bingwei, Gong, Fuzhou, Wan, Lin, Ma, Liang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10030613/
https://www.ncbi.nlm.nih.gov/pubmed/36968591
http://dx.doi.org/10.3389/fgene.2023.1110899
_version_ 1784910416175955968
author Chen, Ziwei
Zhang, Bingwei
Gong, Fuzhou
Wan, Lin
Ma, Liang
author_facet Chen, Ziwei
Zhang, Bingwei
Gong, Fuzhou
Wan, Lin
Ma, Liang
author_sort Chen, Ziwei
collection PubMed
description Robust Principal Component Analysis (RPCA) offers a powerful tool for recovering a low-rank matrix from highly corrupted data, with growing applications in computational biology. Biological processes commonly form intrinsic hierarchical structures, such as tree structures of cell development trajectories and tumor evolutionary history. The rapid development of single-cell sequencing (SCS) technology calls for the recovery of embedded tree structures from noisy and heterogeneous SCS data. In this study, we propose RobustTree, a unified framework to reconstruct the inherent topological structure underlying high-dimensional data with noise. By extending RPCA to handle tree structure optimization, RobustTree leverages data denoising, clustering, and tree structure reconstruction. It solves the tree optimization problem with an adaptive parameter selection scheme that we proposed. In addition to recovering real datasets, RobustTree can reconstruct continuous topological structure and discrete-state topological structure of underlying SCS data. We apply RobustTree on multiple synthetic and real datasets and demonstrate its high accuracy and robustness when analyzing high-noise SCS data with embedded complex structures. The code is available at https://github.com/ucasdp/RobustTree.
format Online
Article
Text
id pubmed-10030613
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-100306132023-03-23 RobustTree: An adaptive, robust PCA algorithm for embedded tree structure recovery from single-cell sequencing data Chen, Ziwei Zhang, Bingwei Gong, Fuzhou Wan, Lin Ma, Liang Front Genet Genetics Robust Principal Component Analysis (RPCA) offers a powerful tool for recovering a low-rank matrix from highly corrupted data, with growing applications in computational biology. Biological processes commonly form intrinsic hierarchical structures, such as tree structures of cell development trajectories and tumor evolutionary history. The rapid development of single-cell sequencing (SCS) technology calls for the recovery of embedded tree structures from noisy and heterogeneous SCS data. In this study, we propose RobustTree, a unified framework to reconstruct the inherent topological structure underlying high-dimensional data with noise. By extending RPCA to handle tree structure optimization, RobustTree leverages data denoising, clustering, and tree structure reconstruction. It solves the tree optimization problem with an adaptive parameter selection scheme that we proposed. In addition to recovering real datasets, RobustTree can reconstruct continuous topological structure and discrete-state topological structure of underlying SCS data. We apply RobustTree on multiple synthetic and real datasets and demonstrate its high accuracy and robustness when analyzing high-noise SCS data with embedded complex structures. The code is available at https://github.com/ucasdp/RobustTree. Frontiers Media S.A. 2023-03-08 /pmc/articles/PMC10030613/ /pubmed/36968591 http://dx.doi.org/10.3389/fgene.2023.1110899 Text en Copyright © 2023 Chen, Zhang, Gong, Wan and Ma. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Chen, Ziwei
Zhang, Bingwei
Gong, Fuzhou
Wan, Lin
Ma, Liang
RobustTree: An adaptive, robust PCA algorithm for embedded tree structure recovery from single-cell sequencing data
title RobustTree: An adaptive, robust PCA algorithm for embedded tree structure recovery from single-cell sequencing data
title_full RobustTree: An adaptive, robust PCA algorithm for embedded tree structure recovery from single-cell sequencing data
title_fullStr RobustTree: An adaptive, robust PCA algorithm for embedded tree structure recovery from single-cell sequencing data
title_full_unstemmed RobustTree: An adaptive, robust PCA algorithm for embedded tree structure recovery from single-cell sequencing data
title_short RobustTree: An adaptive, robust PCA algorithm for embedded tree structure recovery from single-cell sequencing data
title_sort robusttree: an adaptive, robust pca algorithm for embedded tree structure recovery from single-cell sequencing data
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10030613/
https://www.ncbi.nlm.nih.gov/pubmed/36968591
http://dx.doi.org/10.3389/fgene.2023.1110899
work_keys_str_mv AT chenziwei robusttreeanadaptiverobustpcaalgorithmforembeddedtreestructurerecoveryfromsinglecellsequencingdata
AT zhangbingwei robusttreeanadaptiverobustpcaalgorithmforembeddedtreestructurerecoveryfromsinglecellsequencingdata
AT gongfuzhou robusttreeanadaptiverobustpcaalgorithmforembeddedtreestructurerecoveryfromsinglecellsequencingdata
AT wanlin robusttreeanadaptiverobustpcaalgorithmforembeddedtreestructurerecoveryfromsinglecellsequencingdata
AT maliang robusttreeanadaptiverobustpcaalgorithmforembeddedtreestructurerecoveryfromsinglecellsequencingdata