Cargando…

Polar Gini Curve: A Technique to Discover Gene Expression Spatial Patterns from Single-cell RNA-seq Data

In this work, we describe the development of Polar Gini Curve, a method for characterizing cluster markers by analyzing single-cell RNA sequencing (scRNA-seq) data. Polar Gini Curve combines the gene expression and the 2D coordinates (“spatial”) information to detect patterns of uniformity in any cl...

Descripción completa

Detalles Bibliográficos
Autores principales: Nguyen, Thanh Minh, Jeevan, Jacob John, Xu, Nuo, Chen, Jake Y.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8864247/
https://www.ncbi.nlm.nih.gov/pubmed/34958962
http://dx.doi.org/10.1016/j.gpb.2020.09.006
_version_ 1784655418245513216
author Nguyen, Thanh Minh
Jeevan, Jacob John
Xu, Nuo
Chen, Jake Y.
author_facet Nguyen, Thanh Minh
Jeevan, Jacob John
Xu, Nuo
Chen, Jake Y.
author_sort Nguyen, Thanh Minh
collection PubMed
description In this work, we describe the development of Polar Gini Curve, a method for characterizing cluster markers by analyzing single-cell RNA sequencing (scRNA-seq) data. Polar Gini Curve combines the gene expression and the 2D coordinates (“spatial”) information to detect patterns of uniformity in any clustered cells from scRNA-seq data. We demonstrate that Polar Gini Curve can help users characterize the shape and density distribution of cells in a particular cluster, which can be generated during routine scRNA-seq data analysis. To quantify the extent to which a gene is uniformly distributed in a cell cluster space, we combine two polar Gini curves (PGCs)—one drawn upon the cell-points expressing the gene (the “foreground curve”) and the other drawn upon all cell-points in the cluster (the “background curve”). We show that genes with highly dissimilar foreground and background curves tend not to uniformly distributed in the cell cluster—thus having spatially divergent gene expression patterns within the cluster. Genes with similar foreground and background curves tend to uniformly distributed in the cell cluster—thus having uniform gene expression patterns within the cluster. Such quantitative attributes of PGCs can be applied to sensitively discover biomarkers across clusters from scRNA-seq data. We demonstrate the performance of the Polar Gini Curve framework in several simulation case studies. Using this framework to analyze a real-world neonatal mouse heart cell dataset, the detected biomarkers may characterize novel subtypes of cardiac muscle cells. The source code and data for Polar Gini Curve could be found at http://discovery.informatics.uab.edu/PGC/ or https://figshare.com/projects/Polar_Gini_Curve/76749.
format Online
Article
Text
id pubmed-8864247
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-88642472022-03-02 Polar Gini Curve: A Technique to Discover Gene Expression Spatial Patterns from Single-cell RNA-seq Data Nguyen, Thanh Minh Jeevan, Jacob John Xu, Nuo Chen, Jake Y. Genomics Proteomics Bioinformatics Method In this work, we describe the development of Polar Gini Curve, a method for characterizing cluster markers by analyzing single-cell RNA sequencing (scRNA-seq) data. Polar Gini Curve combines the gene expression and the 2D coordinates (“spatial”) information to detect patterns of uniformity in any clustered cells from scRNA-seq data. We demonstrate that Polar Gini Curve can help users characterize the shape and density distribution of cells in a particular cluster, which can be generated during routine scRNA-seq data analysis. To quantify the extent to which a gene is uniformly distributed in a cell cluster space, we combine two polar Gini curves (PGCs)—one drawn upon the cell-points expressing the gene (the “foreground curve”) and the other drawn upon all cell-points in the cluster (the “background curve”). We show that genes with highly dissimilar foreground and background curves tend not to uniformly distributed in the cell cluster—thus having spatially divergent gene expression patterns within the cluster. Genes with similar foreground and background curves tend to uniformly distributed in the cell cluster—thus having uniform gene expression patterns within the cluster. Such quantitative attributes of PGCs can be applied to sensitively discover biomarkers across clusters from scRNA-seq data. We demonstrate the performance of the Polar Gini Curve framework in several simulation case studies. Using this framework to analyze a real-world neonatal mouse heart cell dataset, the detected biomarkers may characterize novel subtypes of cardiac muscle cells. The source code and data for Polar Gini Curve could be found at http://discovery.informatics.uab.edu/PGC/ or https://figshare.com/projects/Polar_Gini_Curve/76749. Elsevier 2021-06 2021-12-25 /pmc/articles/PMC8864247/ /pubmed/34958962 http://dx.doi.org/10.1016/j.gpb.2020.09.006 Text en © 2021 Beijing Institute of Genomics https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Method
Nguyen, Thanh Minh
Jeevan, Jacob John
Xu, Nuo
Chen, Jake Y.
Polar Gini Curve: A Technique to Discover Gene Expression Spatial Patterns from Single-cell RNA-seq Data
title Polar Gini Curve: A Technique to Discover Gene Expression Spatial Patterns from Single-cell RNA-seq Data
title_full Polar Gini Curve: A Technique to Discover Gene Expression Spatial Patterns from Single-cell RNA-seq Data
title_fullStr Polar Gini Curve: A Technique to Discover Gene Expression Spatial Patterns from Single-cell RNA-seq Data
title_full_unstemmed Polar Gini Curve: A Technique to Discover Gene Expression Spatial Patterns from Single-cell RNA-seq Data
title_short Polar Gini Curve: A Technique to Discover Gene Expression Spatial Patterns from Single-cell RNA-seq Data
title_sort polar gini curve: a technique to discover gene expression spatial patterns from single-cell rna-seq data
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8864247/
https://www.ncbi.nlm.nih.gov/pubmed/34958962
http://dx.doi.org/10.1016/j.gpb.2020.09.006
work_keys_str_mv AT nguyenthanhminh polarginicurveatechniquetodiscovergeneexpressionspatialpatternsfromsinglecellrnaseqdata
AT jeevanjacobjohn polarginicurveatechniquetodiscovergeneexpressionspatialpatternsfromsinglecellrnaseqdata
AT xunuo polarginicurveatechniquetodiscovergeneexpressionspatialpatternsfromsinglecellrnaseqdata
AT chenjakey polarginicurveatechniquetodiscovergeneexpressionspatialpatternsfromsinglecellrnaseqdata