Cargando…
MIMIC: an optimization method to identify cell type-specific marker panel for cell sorting
Multi-omics data allow us to select a small set of informative markers for the discrimination of specific cell types and study of cellular heterogeneity. However, it is often challenging to choose an optimal marker panel from the high-dimensional molecular profiles for a large amount of cell types....
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8575015/ https://www.ncbi.nlm.nih.gov/pubmed/34180954 http://dx.doi.org/10.1093/bib/bbab235 |
_version_ | 1784595602361810944 |
---|---|
author | Zou, Meng Duren, Zhana Yuan, Qiuyue Li, Henry Hutchins, Andrew Paul Wong, Wing Hung Wang, Yong |
author_facet | Zou, Meng Duren, Zhana Yuan, Qiuyue Li, Henry Hutchins, Andrew Paul Wong, Wing Hung Wang, Yong |
author_sort | Zou, Meng |
collection | PubMed |
description | Multi-omics data allow us to select a small set of informative markers for the discrimination of specific cell types and study of cellular heterogeneity. However, it is often challenging to choose an optimal marker panel from the high-dimensional molecular profiles for a large amount of cell types. Here, we propose a method called Mixed Integer programming Model to Identify Cell type-specific marker panel (MIMIC). MIMIC maintains the hierarchical topology among different cell types and simultaneously maximizes the specificity of a fixed number of selected markers. MIMIC was benchmarked on the mouse ENCODE RNA-seq dataset, with 29 diverse tissues, for 43 surface markers (SMs) and 1345 transcription factors (TFs). MIMIC could select biologically meaningful markers and is robust for different accuracy criteria. It shows advantages over the standard single gene-based approaches and widely used dimensional reduction methods, such as multidimensional scaling and t-SNE, both in accuracy and in biological interpretation. Furthermore, the combination of SMs and TFs achieves better specificity than SMs or TFs alone. Applying MIMIC to a large collection of 641 RNA-seq samples covering 231 cell types identifies a panel of TFs and SMs that reveal the modularity of cell type association networks. Finally, the scalability of MIMIC is demonstrated by selecting enhancer markers from mouse ENCODE data. MIMIC is freely available at https://github.com/MengZou1/MIMIC. |
format | Online Article Text |
id | pubmed-8575015 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-85750152021-11-09 MIMIC: an optimization method to identify cell type-specific marker panel for cell sorting Zou, Meng Duren, Zhana Yuan, Qiuyue Li, Henry Hutchins, Andrew Paul Wong, Wing Hung Wang, Yong Brief Bioinform Problem Solving Protocol Multi-omics data allow us to select a small set of informative markers for the discrimination of specific cell types and study of cellular heterogeneity. However, it is often challenging to choose an optimal marker panel from the high-dimensional molecular profiles for a large amount of cell types. Here, we propose a method called Mixed Integer programming Model to Identify Cell type-specific marker panel (MIMIC). MIMIC maintains the hierarchical topology among different cell types and simultaneously maximizes the specificity of a fixed number of selected markers. MIMIC was benchmarked on the mouse ENCODE RNA-seq dataset, with 29 diverse tissues, for 43 surface markers (SMs) and 1345 transcription factors (TFs). MIMIC could select biologically meaningful markers and is robust for different accuracy criteria. It shows advantages over the standard single gene-based approaches and widely used dimensional reduction methods, such as multidimensional scaling and t-SNE, both in accuracy and in biological interpretation. Furthermore, the combination of SMs and TFs achieves better specificity than SMs or TFs alone. Applying MIMIC to a large collection of 641 RNA-seq samples covering 231 cell types identifies a panel of TFs and SMs that reveal the modularity of cell type association networks. Finally, the scalability of MIMIC is demonstrated by selecting enhancer markers from mouse ENCODE data. MIMIC is freely available at https://github.com/MengZou1/MIMIC. Oxford University Press 2021-06-26 /pmc/articles/PMC8575015/ /pubmed/34180954 http://dx.doi.org/10.1093/bib/bbab235 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Problem Solving Protocol Zou, Meng Duren, Zhana Yuan, Qiuyue Li, Henry Hutchins, Andrew Paul Wong, Wing Hung Wang, Yong MIMIC: an optimization method to identify cell type-specific marker panel for cell sorting |
title | MIMIC: an optimization method to identify cell type-specific marker panel for cell sorting |
title_full | MIMIC: an optimization method to identify cell type-specific marker panel for cell sorting |
title_fullStr | MIMIC: an optimization method to identify cell type-specific marker panel for cell sorting |
title_full_unstemmed | MIMIC: an optimization method to identify cell type-specific marker panel for cell sorting |
title_short | MIMIC: an optimization method to identify cell type-specific marker panel for cell sorting |
title_sort | mimic: an optimization method to identify cell type-specific marker panel for cell sorting |
topic | Problem Solving Protocol |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8575015/ https://www.ncbi.nlm.nih.gov/pubmed/34180954 http://dx.doi.org/10.1093/bib/bbab235 |
work_keys_str_mv | AT zoumeng mimicanoptimizationmethodtoidentifycelltypespecificmarkerpanelforcellsorting AT durenzhana mimicanoptimizationmethodtoidentifycelltypespecificmarkerpanelforcellsorting AT yuanqiuyue mimicanoptimizationmethodtoidentifycelltypespecificmarkerpanelforcellsorting AT lihenry mimicanoptimizationmethodtoidentifycelltypespecificmarkerpanelforcellsorting AT hutchinsandrewpaul mimicanoptimizationmethodtoidentifycelltypespecificmarkerpanelforcellsorting AT wongwinghung mimicanoptimizationmethodtoidentifycelltypespecificmarkerpanelforcellsorting AT wangyong mimicanoptimizationmethodtoidentifycelltypespecificmarkerpanelforcellsorting |