Cargando…

An introduction to representation learning for single-cell data analysis

Single-cell-resolved systems biology methods, including omics- and imaging-based measurement modalities, generate a wealth of high-dimensional data characterizing the heterogeneity of cell populations. Representation learning methods are routinely used to analyze these complex, high-dimensional data...

Descripción completa

Detalles Bibliográficos
Autores principales: Gunawan, Ihuan, Vafaee, Fatemeh, Meijering, Erik, Lock, John George
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10475795/
https://www.ncbi.nlm.nih.gov/pubmed/37671013
http://dx.doi.org/10.1016/j.crmeth.2023.100547
_version_ 1785100793651658752
author Gunawan, Ihuan
Vafaee, Fatemeh
Meijering, Erik
Lock, John George
author_facet Gunawan, Ihuan
Vafaee, Fatemeh
Meijering, Erik
Lock, John George
author_sort Gunawan, Ihuan
collection PubMed
description Single-cell-resolved systems biology methods, including omics- and imaging-based measurement modalities, generate a wealth of high-dimensional data characterizing the heterogeneity of cell populations. Representation learning methods are routinely used to analyze these complex, high-dimensional data by projecting them into lower-dimensional embeddings. This facilitates the interpretation and interrogation of the structures, dynamics, and regulation of cell heterogeneity. Reflecting their central role in analyzing diverse single-cell data types, a myriad of representation learning methods exist, with new approaches continually emerging. Here, we contrast general features of representation learning methods spanning statistical, manifold learning, and neural network approaches. We consider key steps involved in representation learning with single-cell data, including data pre-processing, hyperparameter optimization, downstream analysis, and biological validation. Interdependencies and contingencies linking these steps are also highlighted. This overview is intended to guide researchers in the selection, application, and optimization of representation learning strategies for current and future single-cell research applications.
format Online
Article
Text
id pubmed-10475795
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-104757952023-09-05 An introduction to representation learning for single-cell data analysis Gunawan, Ihuan Vafaee, Fatemeh Meijering, Erik Lock, John George Cell Rep Methods Perspective Single-cell-resolved systems biology methods, including omics- and imaging-based measurement modalities, generate a wealth of high-dimensional data characterizing the heterogeneity of cell populations. Representation learning methods are routinely used to analyze these complex, high-dimensional data by projecting them into lower-dimensional embeddings. This facilitates the interpretation and interrogation of the structures, dynamics, and regulation of cell heterogeneity. Reflecting their central role in analyzing diverse single-cell data types, a myriad of representation learning methods exist, with new approaches continually emerging. Here, we contrast general features of representation learning methods spanning statistical, manifold learning, and neural network approaches. We consider key steps involved in representation learning with single-cell data, including data pre-processing, hyperparameter optimization, downstream analysis, and biological validation. Interdependencies and contingencies linking these steps are also highlighted. This overview is intended to guide researchers in the selection, application, and optimization of representation learning strategies for current and future single-cell research applications. Elsevier 2023-08-02 /pmc/articles/PMC10475795/ /pubmed/37671013 http://dx.doi.org/10.1016/j.crmeth.2023.100547 Text en © 2023 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Perspective
Gunawan, Ihuan
Vafaee, Fatemeh
Meijering, Erik
Lock, John George
An introduction to representation learning for single-cell data analysis
title An introduction to representation learning for single-cell data analysis
title_full An introduction to representation learning for single-cell data analysis
title_fullStr An introduction to representation learning for single-cell data analysis
title_full_unstemmed An introduction to representation learning for single-cell data analysis
title_short An introduction to representation learning for single-cell data analysis
title_sort introduction to representation learning for single-cell data analysis
topic Perspective
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10475795/
https://www.ncbi.nlm.nih.gov/pubmed/37671013
http://dx.doi.org/10.1016/j.crmeth.2023.100547
work_keys_str_mv AT gunawanihuan anintroductiontorepresentationlearningforsinglecelldataanalysis
AT vafaeefatemeh anintroductiontorepresentationlearningforsinglecelldataanalysis
AT meijeringerik anintroductiontorepresentationlearningforsinglecelldataanalysis
AT lockjohngeorge anintroductiontorepresentationlearningforsinglecelldataanalysis
AT gunawanihuan introductiontorepresentationlearningforsinglecelldataanalysis
AT vafaeefatemeh introductiontorepresentationlearningforsinglecelldataanalysis
AT meijeringerik introductiontorepresentationlearningforsinglecelldataanalysis
AT lockjohngeorge introductiontorepresentationlearningforsinglecelldataanalysis