Cargando…

Feature selection for preserving biological trajectories in single-cell data

Single-cell technologies can readily measure the expression of thousands of molecular features from individual cells undergoing dynamic biological processes, such as cellular differentiation, immune response, and disease progression. While examining cells along a computationally ordered pseudotime o...

Descripción completa

Detalles Bibliográficos
Autores principales: Ranek, Jolene S., Stallaert, Wayne, Milner, Justin, Stanley, Natalie, Purvis, Jeremy E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10197710/
https://www.ncbi.nlm.nih.gov/pubmed/37214963
http://dx.doi.org/10.1101/2023.05.09.540043
_version_ 1785044598303752192
author Ranek, Jolene S.
Stallaert, Wayne
Milner, Justin
Stanley, Natalie
Purvis, Jeremy E.
author_facet Ranek, Jolene S.
Stallaert, Wayne
Milner, Justin
Stanley, Natalie
Purvis, Jeremy E.
author_sort Ranek, Jolene S.
collection PubMed
description Single-cell technologies can readily measure the expression of thousands of molecular features from individual cells undergoing dynamic biological processes, such as cellular differentiation, immune response, and disease progression. While examining cells along a computationally ordered pseudotime offers the potential to study how subtle changes in gene or protein expression impact cell fate decision-making, identifying characteristic features that drive continuous biological processes remains difficult to detect from unenriched and noisy single-cell data. Given that all profiled sources of feature variation contribute to the cell-to-cell distances that define an inferred cellular trajectory, including confounding sources of biological variation (e.g. cell cycle or metabolic state) or noisy and irrelevant features (e.g. measurements with low signal-to-noise ratio) can mask the underlying trajectory of study and hinder inference. Here, we present DELVE (dynamic selection of locally covarying features), an unsupervised feature selection method for identifying a representative subset of dynamically-expressed molecular features that recapitulates cellular trajectories. In contrast to previous work, DELVE uses a bottom-up approach to mitigate the effect of unwanted sources of variation confounding inference, and instead models cell states from dynamic feature modules that constitute core regulatory complexes. Using simulations, single-cell RNA sequencing data, and iterative immunofluorescence imaging data in the context of the cell cycle and cellular differentiation, we demonstrate that DELVE selects features that more accurately characterize cell populations and improve the recovery of cell type transitions. This feature selection framework provides an alternative approach for improving trajectory inference and uncovering co-variation amongst features along a biological trajectory. DELVE is implemented as an open-source python package and is publicly available at: https://github.com/jranek/delve.
format Online
Article
Text
id pubmed-10197710
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-101977102023-05-20 Feature selection for preserving biological trajectories in single-cell data Ranek, Jolene S. Stallaert, Wayne Milner, Justin Stanley, Natalie Purvis, Jeremy E. bioRxiv Article Single-cell technologies can readily measure the expression of thousands of molecular features from individual cells undergoing dynamic biological processes, such as cellular differentiation, immune response, and disease progression. While examining cells along a computationally ordered pseudotime offers the potential to study how subtle changes in gene or protein expression impact cell fate decision-making, identifying characteristic features that drive continuous biological processes remains difficult to detect from unenriched and noisy single-cell data. Given that all profiled sources of feature variation contribute to the cell-to-cell distances that define an inferred cellular trajectory, including confounding sources of biological variation (e.g. cell cycle or metabolic state) or noisy and irrelevant features (e.g. measurements with low signal-to-noise ratio) can mask the underlying trajectory of study and hinder inference. Here, we present DELVE (dynamic selection of locally covarying features), an unsupervised feature selection method for identifying a representative subset of dynamically-expressed molecular features that recapitulates cellular trajectories. In contrast to previous work, DELVE uses a bottom-up approach to mitigate the effect of unwanted sources of variation confounding inference, and instead models cell states from dynamic feature modules that constitute core regulatory complexes. Using simulations, single-cell RNA sequencing data, and iterative immunofluorescence imaging data in the context of the cell cycle and cellular differentiation, we demonstrate that DELVE selects features that more accurately characterize cell populations and improve the recovery of cell type transitions. This feature selection framework provides an alternative approach for improving trajectory inference and uncovering co-variation amongst features along a biological trajectory. DELVE is implemented as an open-source python package and is publicly available at: https://github.com/jranek/delve. Cold Spring Harbor Laboratory 2023-05-12 /pmc/articles/PMC10197710/ /pubmed/37214963 http://dx.doi.org/10.1101/2023.05.09.540043 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Ranek, Jolene S.
Stallaert, Wayne
Milner, Justin
Stanley, Natalie
Purvis, Jeremy E.
Feature selection for preserving biological trajectories in single-cell data
title Feature selection for preserving biological trajectories in single-cell data
title_full Feature selection for preserving biological trajectories in single-cell data
title_fullStr Feature selection for preserving biological trajectories in single-cell data
title_full_unstemmed Feature selection for preserving biological trajectories in single-cell data
title_short Feature selection for preserving biological trajectories in single-cell data
title_sort feature selection for preserving biological trajectories in single-cell data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10197710/
https://www.ncbi.nlm.nih.gov/pubmed/37214963
http://dx.doi.org/10.1101/2023.05.09.540043
work_keys_str_mv AT ranekjolenes featureselectionforpreservingbiologicaltrajectoriesinsinglecelldata
AT stallaertwayne featureselectionforpreservingbiologicaltrajectoriesinsinglecelldata
AT milnerjustin featureselectionforpreservingbiologicaltrajectoriesinsinglecelldata
AT stanleynatalie featureselectionforpreservingbiologicaltrajectoriesinsinglecelldata
AT purvisjeremye featureselectionforpreservingbiologicaltrajectoriesinsinglecelldata