Cargando…
Feature selection for preserving biological trajectories in single-cell data
Single-cell technologies can readily measure the expression of thousands of molecular features from individual cells undergoing dynamic biological processes, such as cellular differentiation, immune response, and disease progression. While examining cells along a computationally ordered pseudotime o...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10197710/ https://www.ncbi.nlm.nih.gov/pubmed/37214963 http://dx.doi.org/10.1101/2023.05.09.540043 |
_version_ | 1785044598303752192 |
---|---|
author | Ranek, Jolene S. Stallaert, Wayne Milner, Justin Stanley, Natalie Purvis, Jeremy E. |
author_facet | Ranek, Jolene S. Stallaert, Wayne Milner, Justin Stanley, Natalie Purvis, Jeremy E. |
author_sort | Ranek, Jolene S. |
collection | PubMed |
description | Single-cell technologies can readily measure the expression of thousands of molecular features from individual cells undergoing dynamic biological processes, such as cellular differentiation, immune response, and disease progression. While examining cells along a computationally ordered pseudotime offers the potential to study how subtle changes in gene or protein expression impact cell fate decision-making, identifying characteristic features that drive continuous biological processes remains difficult to detect from unenriched and noisy single-cell data. Given that all profiled sources of feature variation contribute to the cell-to-cell distances that define an inferred cellular trajectory, including confounding sources of biological variation (e.g. cell cycle or metabolic state) or noisy and irrelevant features (e.g. measurements with low signal-to-noise ratio) can mask the underlying trajectory of study and hinder inference. Here, we present DELVE (dynamic selection of locally covarying features), an unsupervised feature selection method for identifying a representative subset of dynamically-expressed molecular features that recapitulates cellular trajectories. In contrast to previous work, DELVE uses a bottom-up approach to mitigate the effect of unwanted sources of variation confounding inference, and instead models cell states from dynamic feature modules that constitute core regulatory complexes. Using simulations, single-cell RNA sequencing data, and iterative immunofluorescence imaging data in the context of the cell cycle and cellular differentiation, we demonstrate that DELVE selects features that more accurately characterize cell populations and improve the recovery of cell type transitions. This feature selection framework provides an alternative approach for improving trajectory inference and uncovering co-variation amongst features along a biological trajectory. DELVE is implemented as an open-source python package and is publicly available at: https://github.com/jranek/delve. |
format | Online Article Text |
id | pubmed-10197710 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-101977102023-05-20 Feature selection for preserving biological trajectories in single-cell data Ranek, Jolene S. Stallaert, Wayne Milner, Justin Stanley, Natalie Purvis, Jeremy E. bioRxiv Article Single-cell technologies can readily measure the expression of thousands of molecular features from individual cells undergoing dynamic biological processes, such as cellular differentiation, immune response, and disease progression. While examining cells along a computationally ordered pseudotime offers the potential to study how subtle changes in gene or protein expression impact cell fate decision-making, identifying characteristic features that drive continuous biological processes remains difficult to detect from unenriched and noisy single-cell data. Given that all profiled sources of feature variation contribute to the cell-to-cell distances that define an inferred cellular trajectory, including confounding sources of biological variation (e.g. cell cycle or metabolic state) or noisy and irrelevant features (e.g. measurements with low signal-to-noise ratio) can mask the underlying trajectory of study and hinder inference. Here, we present DELVE (dynamic selection of locally covarying features), an unsupervised feature selection method for identifying a representative subset of dynamically-expressed molecular features that recapitulates cellular trajectories. In contrast to previous work, DELVE uses a bottom-up approach to mitigate the effect of unwanted sources of variation confounding inference, and instead models cell states from dynamic feature modules that constitute core regulatory complexes. Using simulations, single-cell RNA sequencing data, and iterative immunofluorescence imaging data in the context of the cell cycle and cellular differentiation, we demonstrate that DELVE selects features that more accurately characterize cell populations and improve the recovery of cell type transitions. This feature selection framework provides an alternative approach for improving trajectory inference and uncovering co-variation amongst features along a biological trajectory. DELVE is implemented as an open-source python package and is publicly available at: https://github.com/jranek/delve. Cold Spring Harbor Laboratory 2023-05-12 /pmc/articles/PMC10197710/ /pubmed/37214963 http://dx.doi.org/10.1101/2023.05.09.540043 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator. |
spellingShingle | Article Ranek, Jolene S. Stallaert, Wayne Milner, Justin Stanley, Natalie Purvis, Jeremy E. Feature selection for preserving biological trajectories in single-cell data |
title | Feature selection for preserving biological trajectories in single-cell data |
title_full | Feature selection for preserving biological trajectories in single-cell data |
title_fullStr | Feature selection for preserving biological trajectories in single-cell data |
title_full_unstemmed | Feature selection for preserving biological trajectories in single-cell data |
title_short | Feature selection for preserving biological trajectories in single-cell data |
title_sort | feature selection for preserving biological trajectories in single-cell data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10197710/ https://www.ncbi.nlm.nih.gov/pubmed/37214963 http://dx.doi.org/10.1101/2023.05.09.540043 |
work_keys_str_mv | AT ranekjolenes featureselectionforpreservingbiologicaltrajectoriesinsinglecelldata AT stallaertwayne featureselectionforpreservingbiologicaltrajectoriesinsinglecelldata AT milnerjustin featureselectionforpreservingbiologicaltrajectoriesinsinglecelldata AT stanleynatalie featureselectionforpreservingbiologicaltrajectoriesinsinglecelldata AT purvisjeremye featureselectionforpreservingbiologicaltrajectoriesinsinglecelldata |