Cargando…
An introduction to representation learning for single-cell data analysis
Single-cell-resolved systems biology methods, including omics- and imaging-based measurement modalities, generate a wealth of high-dimensional data characterizing the heterogeneity of cell populations. Representation learning methods are routinely used to analyze these complex, high-dimensional data...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10475795/ https://www.ncbi.nlm.nih.gov/pubmed/37671013 http://dx.doi.org/10.1016/j.crmeth.2023.100547 |
_version_ | 1785100793651658752 |
---|---|
author | Gunawan, Ihuan Vafaee, Fatemeh Meijering, Erik Lock, John George |
author_facet | Gunawan, Ihuan Vafaee, Fatemeh Meijering, Erik Lock, John George |
author_sort | Gunawan, Ihuan |
collection | PubMed |
description | Single-cell-resolved systems biology methods, including omics- and imaging-based measurement modalities, generate a wealth of high-dimensional data characterizing the heterogeneity of cell populations. Representation learning methods are routinely used to analyze these complex, high-dimensional data by projecting them into lower-dimensional embeddings. This facilitates the interpretation and interrogation of the structures, dynamics, and regulation of cell heterogeneity. Reflecting their central role in analyzing diverse single-cell data types, a myriad of representation learning methods exist, with new approaches continually emerging. Here, we contrast general features of representation learning methods spanning statistical, manifold learning, and neural network approaches. We consider key steps involved in representation learning with single-cell data, including data pre-processing, hyperparameter optimization, downstream analysis, and biological validation. Interdependencies and contingencies linking these steps are also highlighted. This overview is intended to guide researchers in the selection, application, and optimization of representation learning strategies for current and future single-cell research applications. |
format | Online Article Text |
id | pubmed-10475795 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-104757952023-09-05 An introduction to representation learning for single-cell data analysis Gunawan, Ihuan Vafaee, Fatemeh Meijering, Erik Lock, John George Cell Rep Methods Perspective Single-cell-resolved systems biology methods, including omics- and imaging-based measurement modalities, generate a wealth of high-dimensional data characterizing the heterogeneity of cell populations. Representation learning methods are routinely used to analyze these complex, high-dimensional data by projecting them into lower-dimensional embeddings. This facilitates the interpretation and interrogation of the structures, dynamics, and regulation of cell heterogeneity. Reflecting their central role in analyzing diverse single-cell data types, a myriad of representation learning methods exist, with new approaches continually emerging. Here, we contrast general features of representation learning methods spanning statistical, manifold learning, and neural network approaches. We consider key steps involved in representation learning with single-cell data, including data pre-processing, hyperparameter optimization, downstream analysis, and biological validation. Interdependencies and contingencies linking these steps are also highlighted. This overview is intended to guide researchers in the selection, application, and optimization of representation learning strategies for current and future single-cell research applications. Elsevier 2023-08-02 /pmc/articles/PMC10475795/ /pubmed/37671013 http://dx.doi.org/10.1016/j.crmeth.2023.100547 Text en © 2023 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Perspective Gunawan, Ihuan Vafaee, Fatemeh Meijering, Erik Lock, John George An introduction to representation learning for single-cell data analysis |
title | An introduction to representation learning for single-cell data analysis |
title_full | An introduction to representation learning for single-cell data analysis |
title_fullStr | An introduction to representation learning for single-cell data analysis |
title_full_unstemmed | An introduction to representation learning for single-cell data analysis |
title_short | An introduction to representation learning for single-cell data analysis |
title_sort | introduction to representation learning for single-cell data analysis |
topic | Perspective |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10475795/ https://www.ncbi.nlm.nih.gov/pubmed/37671013 http://dx.doi.org/10.1016/j.crmeth.2023.100547 |
work_keys_str_mv | AT gunawanihuan anintroductiontorepresentationlearningforsinglecelldataanalysis AT vafaeefatemeh anintroductiontorepresentationlearningforsinglecelldataanalysis AT meijeringerik anintroductiontorepresentationlearningforsinglecelldataanalysis AT lockjohngeorge anintroductiontorepresentationlearningforsinglecelldataanalysis AT gunawanihuan introductiontorepresentationlearningforsinglecelldataanalysis AT vafaeefatemeh introductiontorepresentationlearningforsinglecelldataanalysis AT meijeringerik introductiontorepresentationlearningforsinglecelldataanalysis AT lockjohngeorge introductiontorepresentationlearningforsinglecelldataanalysis |