Cargando…

MIMIC: an optimization method to identify cell type-specific marker panel for cell sorting

Multi-omics data allow us to select a small set of informative markers for the discrimination of specific cell types and study of cellular heterogeneity. However, it is often challenging to choose an optimal marker panel from the high-dimensional molecular profiles for a large amount of cell types....

Descripción completa

Detalles Bibliográficos
Autores principales: Zou, Meng, Duren, Zhana, Yuan, Qiuyue, Li, Henry, Hutchins, Andrew Paul, Wong, Wing Hung, Wang, Yong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8575015/
https://www.ncbi.nlm.nih.gov/pubmed/34180954
http://dx.doi.org/10.1093/bib/bbab235
_version_ 1784595602361810944
author Zou, Meng
Duren, Zhana
Yuan, Qiuyue
Li, Henry
Hutchins, Andrew Paul
Wong, Wing Hung
Wang, Yong
author_facet Zou, Meng
Duren, Zhana
Yuan, Qiuyue
Li, Henry
Hutchins, Andrew Paul
Wong, Wing Hung
Wang, Yong
author_sort Zou, Meng
collection PubMed
description Multi-omics data allow us to select a small set of informative markers for the discrimination of specific cell types and study of cellular heterogeneity. However, it is often challenging to choose an optimal marker panel from the high-dimensional molecular profiles for a large amount of cell types. Here, we propose a method called Mixed Integer programming Model to Identify Cell type-specific marker panel (MIMIC). MIMIC maintains the hierarchical topology among different cell types and simultaneously maximizes the specificity of a fixed number of selected markers. MIMIC was benchmarked on the mouse ENCODE RNA-seq dataset, with 29 diverse tissues, for 43 surface markers (SMs) and 1345 transcription factors (TFs). MIMIC could select biologically meaningful markers and is robust for different accuracy criteria. It shows advantages over the standard single gene-based approaches and widely used dimensional reduction methods, such as multidimensional scaling and t-SNE, both in accuracy and in biological interpretation. Furthermore, the combination of SMs and TFs achieves better specificity than SMs or TFs alone. Applying MIMIC to a large collection of 641 RNA-seq samples covering 231 cell types identifies a panel of TFs and SMs that reveal the modularity of cell type association networks. Finally, the scalability of MIMIC is demonstrated by selecting enhancer markers from mouse ENCODE data. MIMIC is freely available at https://github.com/MengZou1/MIMIC.
format Online
Article
Text
id pubmed-8575015
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-85750152021-11-09 MIMIC: an optimization method to identify cell type-specific marker panel for cell sorting Zou, Meng Duren, Zhana Yuan, Qiuyue Li, Henry Hutchins, Andrew Paul Wong, Wing Hung Wang, Yong Brief Bioinform Problem Solving Protocol Multi-omics data allow us to select a small set of informative markers for the discrimination of specific cell types and study of cellular heterogeneity. However, it is often challenging to choose an optimal marker panel from the high-dimensional molecular profiles for a large amount of cell types. Here, we propose a method called Mixed Integer programming Model to Identify Cell type-specific marker panel (MIMIC). MIMIC maintains the hierarchical topology among different cell types and simultaneously maximizes the specificity of a fixed number of selected markers. MIMIC was benchmarked on the mouse ENCODE RNA-seq dataset, with 29 diverse tissues, for 43 surface markers (SMs) and 1345 transcription factors (TFs). MIMIC could select biologically meaningful markers and is robust for different accuracy criteria. It shows advantages over the standard single gene-based approaches and widely used dimensional reduction methods, such as multidimensional scaling and t-SNE, both in accuracy and in biological interpretation. Furthermore, the combination of SMs and TFs achieves better specificity than SMs or TFs alone. Applying MIMIC to a large collection of 641 RNA-seq samples covering 231 cell types identifies a panel of TFs and SMs that reveal the modularity of cell type association networks. Finally, the scalability of MIMIC is demonstrated by selecting enhancer markers from mouse ENCODE data. MIMIC is freely available at https://github.com/MengZou1/MIMIC. Oxford University Press 2021-06-26 /pmc/articles/PMC8575015/ /pubmed/34180954 http://dx.doi.org/10.1093/bib/bbab235 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Problem Solving Protocol
Zou, Meng
Duren, Zhana
Yuan, Qiuyue
Li, Henry
Hutchins, Andrew Paul
Wong, Wing Hung
Wang, Yong
MIMIC: an optimization method to identify cell type-specific marker panel for cell sorting
title MIMIC: an optimization method to identify cell type-specific marker panel for cell sorting
title_full MIMIC: an optimization method to identify cell type-specific marker panel for cell sorting
title_fullStr MIMIC: an optimization method to identify cell type-specific marker panel for cell sorting
title_full_unstemmed MIMIC: an optimization method to identify cell type-specific marker panel for cell sorting
title_short MIMIC: an optimization method to identify cell type-specific marker panel for cell sorting
title_sort mimic: an optimization method to identify cell type-specific marker panel for cell sorting
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8575015/
https://www.ncbi.nlm.nih.gov/pubmed/34180954
http://dx.doi.org/10.1093/bib/bbab235
work_keys_str_mv AT zoumeng mimicanoptimizationmethodtoidentifycelltypespecificmarkerpanelforcellsorting
AT durenzhana mimicanoptimizationmethodtoidentifycelltypespecificmarkerpanelforcellsorting
AT yuanqiuyue mimicanoptimizationmethodtoidentifycelltypespecificmarkerpanelforcellsorting
AT lihenry mimicanoptimizationmethodtoidentifycelltypespecificmarkerpanelforcellsorting
AT hutchinsandrewpaul mimicanoptimizationmethodtoidentifycelltypespecificmarkerpanelforcellsorting
AT wongwinghung mimicanoptimizationmethodtoidentifycelltypespecificmarkerpanelforcellsorting
AT wangyong mimicanoptimizationmethodtoidentifycelltypespecificmarkerpanelforcellsorting