Cargando…

Modeling dynamic correlation in zero‐inflated bivariate count data with applications to single‐cell RNA sequencing data

Interactions between biological molecules in a cell are tightly coordinated and often highly dynamic. As a result of these varying signaling activities, changes in gene coexpression patterns could often be observed. The advancements in next‐generation sequencing technologies bring new statistical ch...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Zhen, Ho, Yen‐Yi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8477913/
https://www.ncbi.nlm.nih.gov/pubmed/33720414
http://dx.doi.org/10.1111/biom.13457
_version_ 1784575946720804864
author Yang, Zhen
Ho, Yen‐Yi
author_facet Yang, Zhen
Ho, Yen‐Yi
author_sort Yang, Zhen
collection PubMed
description Interactions between biological molecules in a cell are tightly coordinated and often highly dynamic. As a result of these varying signaling activities, changes in gene coexpression patterns could often be observed. The advancements in next‐generation sequencing technologies bring new statistical challenges for studying these dynamic changes of gene coexpression. In recent years, methods have been developed to examine genomic information from individual cells. Single‐cell RNA sequencing (scRNA‐seq) data are count‐based, and often exhibit characteristics such as overdispersion and zero inflation. To explore the dynamic dependence structure in scRNA‐seq data and other zero‐inflated count data, new approaches are needed. In this paper, we consider overdispersion and zero inflation in count outcomes and propose a ZEro‐inflated negative binomial dynamic COrrelation model (ZENCO). The observed count data are modeled as a mixture of two components: success amplifications and dropout events in ZENCO. A latent variable is incorporated into ZENCO to model the covariate‐dependent correlation structure. We conduct simulation studies to evaluate the performance of our proposed method and to compare it with existing approaches. We also illustrate the implementation of our proposed approach using scRNA‐seq data from a study of minimal residual disease in melanoma.
format Online
Article
Text
id pubmed-8477913
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-84779132022-10-14 Modeling dynamic correlation in zero‐inflated bivariate count data with applications to single‐cell RNA sequencing data Yang, Zhen Ho, Yen‐Yi Biometrics Biometric Practice Interactions between biological molecules in a cell are tightly coordinated and often highly dynamic. As a result of these varying signaling activities, changes in gene coexpression patterns could often be observed. The advancements in next‐generation sequencing technologies bring new statistical challenges for studying these dynamic changes of gene coexpression. In recent years, methods have been developed to examine genomic information from individual cells. Single‐cell RNA sequencing (scRNA‐seq) data are count‐based, and often exhibit characteristics such as overdispersion and zero inflation. To explore the dynamic dependence structure in scRNA‐seq data and other zero‐inflated count data, new approaches are needed. In this paper, we consider overdispersion and zero inflation in count outcomes and propose a ZEro‐inflated negative binomial dynamic COrrelation model (ZENCO). The observed count data are modeled as a mixture of two components: success amplifications and dropout events in ZENCO. A latent variable is incorporated into ZENCO to model the covariate‐dependent correlation structure. We conduct simulation studies to evaluate the performance of our proposed method and to compare it with existing approaches. We also illustrate the implementation of our proposed approach using scRNA‐seq data from a study of minimal residual disease in melanoma. John Wiley and Sons Inc. 2021-03-30 2022-06 /pmc/articles/PMC8477913/ /pubmed/33720414 http://dx.doi.org/10.1111/biom.13457 Text en © 2021 The Authors. Biometrics published by Wiley Periodicals LLC on behalf of International Biometric Society. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made.
spellingShingle Biometric Practice
Yang, Zhen
Ho, Yen‐Yi
Modeling dynamic correlation in zero‐inflated bivariate count data with applications to single‐cell RNA sequencing data
title Modeling dynamic correlation in zero‐inflated bivariate count data with applications to single‐cell RNA sequencing data
title_full Modeling dynamic correlation in zero‐inflated bivariate count data with applications to single‐cell RNA sequencing data
title_fullStr Modeling dynamic correlation in zero‐inflated bivariate count data with applications to single‐cell RNA sequencing data
title_full_unstemmed Modeling dynamic correlation in zero‐inflated bivariate count data with applications to single‐cell RNA sequencing data
title_short Modeling dynamic correlation in zero‐inflated bivariate count data with applications to single‐cell RNA sequencing data
title_sort modeling dynamic correlation in zero‐inflated bivariate count data with applications to single‐cell rna sequencing data
topic Biometric Practice
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8477913/
https://www.ncbi.nlm.nih.gov/pubmed/33720414
http://dx.doi.org/10.1111/biom.13457
work_keys_str_mv AT yangzhen modelingdynamiccorrelationinzeroinflatedbivariatecountdatawithapplicationstosinglecellrnasequencingdata
AT hoyenyi modelingdynamiccorrelationinzeroinflatedbivariatecountdatawithapplicationstosinglecellrnasequencingdata