Cargando…

Polyphony: an Interactive Transfer Learning Framework for Single-Cell Data Analysis

Reference-based cell-type annotation can significantly reduce time and effort in single-cell analysis by transferring labels from a previously-annotated dataset to a new dataset. However, label transfer by end-to-end computational methods is challenging due to the entanglement of technical (e.g., fr...

Descripción completa

Detalles Bibliográficos
Autores principales: Cheng, Furui, Keller, Mark S, Qu, Huamin, Gehlenborg, Nils, Wang, Qianwen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10039961/
https://www.ncbi.nlm.nih.gov/pubmed/36155452
http://dx.doi.org/10.1109/TVCG.2022.3209408
_version_ 1784912380727132160
author Cheng, Furui
Keller, Mark S
Qu, Huamin
Gehlenborg, Nils
Wang, Qianwen
author_facet Cheng, Furui
Keller, Mark S
Qu, Huamin
Gehlenborg, Nils
Wang, Qianwen
author_sort Cheng, Furui
collection PubMed
description Reference-based cell-type annotation can significantly reduce time and effort in single-cell analysis by transferring labels from a previously-annotated dataset to a new dataset. However, label transfer by end-to-end computational methods is challenging due to the entanglement of technical (e.g., from different sequencing batches or techniques) and biological (e.g., from different cellular microenvironments) variations, only the first of which must be removed. To address this issue, we propose Polyphony, an interactive transfer learning (ITL) framework, to complement biologists’ knowledge with advanced computational methods. Polyphony is motivated and guided by domain experts’ needs for a controllable, interactive, and algorithm-assisted annotation process, identified through interviews with seven biologists. We introduce anchors, i.e., analogous cell populations across datasets, as a paradigm to explain the computational process and collect user feedback for model improvement. We further design a set of visualizations and interactions to empower users to add, delete, or modify anchors, resulting in refined cell type annotations. The effectiveness of this approach is demonstrated through quantitative experiments, two hypothetical use cases, and interviews with two biologists. The results show that our anchor-based ITL method takes advantage of both human and machine intelligence in annotating massive single-cell datasets.
format Online
Article
Text
id pubmed-10039961
institution National Center for Biotechnology Information
language English
publishDate 2023
record_format MEDLINE/PubMed
spelling pubmed-100399612023-04-05 Polyphony: an Interactive Transfer Learning Framework for Single-Cell Data Analysis Cheng, Furui Keller, Mark S Qu, Huamin Gehlenborg, Nils Wang, Qianwen IEEE Trans Vis Comput Graph Article Reference-based cell-type annotation can significantly reduce time and effort in single-cell analysis by transferring labels from a previously-annotated dataset to a new dataset. However, label transfer by end-to-end computational methods is challenging due to the entanglement of technical (e.g., from different sequencing batches or techniques) and biological (e.g., from different cellular microenvironments) variations, only the first of which must be removed. To address this issue, we propose Polyphony, an interactive transfer learning (ITL) framework, to complement biologists’ knowledge with advanced computational methods. Polyphony is motivated and guided by domain experts’ needs for a controllable, interactive, and algorithm-assisted annotation process, identified through interviews with seven biologists. We introduce anchors, i.e., analogous cell populations across datasets, as a paradigm to explain the computational process and collect user feedback for model improvement. We further design a set of visualizations and interactions to empower users to add, delete, or modify anchors, resulting in refined cell type annotations. The effectiveness of this approach is demonstrated through quantitative experiments, two hypothetical use cases, and interviews with two biologists. The results show that our anchor-based ITL method takes advantage of both human and machine intelligence in annotating massive single-cell datasets. 2023-01 2022-12-20 /pmc/articles/PMC10039961/ /pubmed/36155452 http://dx.doi.org/10.1109/TVCG.2022.3209408 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Cheng, Furui
Keller, Mark S
Qu, Huamin
Gehlenborg, Nils
Wang, Qianwen
Polyphony: an Interactive Transfer Learning Framework for Single-Cell Data Analysis
title Polyphony: an Interactive Transfer Learning Framework for Single-Cell Data Analysis
title_full Polyphony: an Interactive Transfer Learning Framework for Single-Cell Data Analysis
title_fullStr Polyphony: an Interactive Transfer Learning Framework for Single-Cell Data Analysis
title_full_unstemmed Polyphony: an Interactive Transfer Learning Framework for Single-Cell Data Analysis
title_short Polyphony: an Interactive Transfer Learning Framework for Single-Cell Data Analysis
title_sort polyphony: an interactive transfer learning framework for single-cell data analysis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10039961/
https://www.ncbi.nlm.nih.gov/pubmed/36155452
http://dx.doi.org/10.1109/TVCG.2022.3209408
work_keys_str_mv AT chengfurui polyphonyaninteractivetransferlearningframeworkforsinglecelldataanalysis
AT kellermarks polyphonyaninteractivetransferlearningframeworkforsinglecelldataanalysis
AT quhuamin polyphonyaninteractivetransferlearningframeworkforsinglecelldataanalysis
AT gehlenborgnils polyphonyaninteractivetransferlearningframeworkforsinglecelldataanalysis
AT wangqianwen polyphonyaninteractivetransferlearningframeworkforsinglecelldataanalysis