Cargando…

A Two-Part Mixed Model for Differential Expression Analysis in Single-Cell High-Throughput Gene Expression Data

The high-throughput gene expression data generated from recent single-cell RNA sequencing (scRNA-seq) and parallel single-cell reverse transcription quantitative real-time PCR (scRT-qPCR) technologies enable biologists to study the function of transcriptome at the level of individual cells. Compared...

Descripción completa

Detalles Bibliográficos
Autores principales: Shi, Yang, Lee, Ji-Hyun, Kang, Huining, Jiang, Hui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8872627/
https://www.ncbi.nlm.nih.gov/pubmed/35205420
http://dx.doi.org/10.3390/genes13020377
_version_ 1784657283920166912
author Shi, Yang
Lee, Ji-Hyun
Kang, Huining
Jiang, Hui
author_facet Shi, Yang
Lee, Ji-Hyun
Kang, Huining
Jiang, Hui
author_sort Shi, Yang
collection PubMed
description The high-throughput gene expression data generated from recent single-cell RNA sequencing (scRNA-seq) and parallel single-cell reverse transcription quantitative real-time PCR (scRT-qPCR) technologies enable biologists to study the function of transcriptome at the level of individual cells. Compared with bulk RNA-seq and RT-qPCR gene expression data, single-cell data show notable distinct features, including excessive zero expression values, high variability, and clustered design. We propose to model single-cell high-throughput gene expression data using a two-part mixed model, which not only adequately accounts for the aforementioned features of single-cell expression data but also provides the flexibility of adjusting for covariates. An efficient computational algorithm, automatic differentiation, is used for estimating the model parameters. Compared with existing methods, our approach shows improved power for detecting differential expressed genes in single-cell high-throughput gene expression data.
format Online
Article
Text
id pubmed-8872627
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-88726272022-02-25 A Two-Part Mixed Model for Differential Expression Analysis in Single-Cell High-Throughput Gene Expression Data Shi, Yang Lee, Ji-Hyun Kang, Huining Jiang, Hui Genes (Basel) Article The high-throughput gene expression data generated from recent single-cell RNA sequencing (scRNA-seq) and parallel single-cell reverse transcription quantitative real-time PCR (scRT-qPCR) technologies enable biologists to study the function of transcriptome at the level of individual cells. Compared with bulk RNA-seq and RT-qPCR gene expression data, single-cell data show notable distinct features, including excessive zero expression values, high variability, and clustered design. We propose to model single-cell high-throughput gene expression data using a two-part mixed model, which not only adequately accounts for the aforementioned features of single-cell expression data but also provides the flexibility of adjusting for covariates. An efficient computational algorithm, automatic differentiation, is used for estimating the model parameters. Compared with existing methods, our approach shows improved power for detecting differential expressed genes in single-cell high-throughput gene expression data. MDPI 2022-02-18 /pmc/articles/PMC8872627/ /pubmed/35205420 http://dx.doi.org/10.3390/genes13020377 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Shi, Yang
Lee, Ji-Hyun
Kang, Huining
Jiang, Hui
A Two-Part Mixed Model for Differential Expression Analysis in Single-Cell High-Throughput Gene Expression Data
title A Two-Part Mixed Model for Differential Expression Analysis in Single-Cell High-Throughput Gene Expression Data
title_full A Two-Part Mixed Model for Differential Expression Analysis in Single-Cell High-Throughput Gene Expression Data
title_fullStr A Two-Part Mixed Model for Differential Expression Analysis in Single-Cell High-Throughput Gene Expression Data
title_full_unstemmed A Two-Part Mixed Model for Differential Expression Analysis in Single-Cell High-Throughput Gene Expression Data
title_short A Two-Part Mixed Model for Differential Expression Analysis in Single-Cell High-Throughput Gene Expression Data
title_sort two-part mixed model for differential expression analysis in single-cell high-throughput gene expression data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8872627/
https://www.ncbi.nlm.nih.gov/pubmed/35205420
http://dx.doi.org/10.3390/genes13020377
work_keys_str_mv AT shiyang atwopartmixedmodelfordifferentialexpressionanalysisinsinglecellhighthroughputgeneexpressiondata
AT leejihyun atwopartmixedmodelfordifferentialexpressionanalysisinsinglecellhighthroughputgeneexpressiondata
AT kanghuining atwopartmixedmodelfordifferentialexpressionanalysisinsinglecellhighthroughputgeneexpressiondata
AT jianghui atwopartmixedmodelfordifferentialexpressionanalysisinsinglecellhighthroughputgeneexpressiondata
AT shiyang twopartmixedmodelfordifferentialexpressionanalysisinsinglecellhighthroughputgeneexpressiondata
AT leejihyun twopartmixedmodelfordifferentialexpressionanalysisinsinglecellhighthroughputgeneexpressiondata
AT kanghuining twopartmixedmodelfordifferentialexpressionanalysisinsinglecellhighthroughputgeneexpressiondata
AT jianghui twopartmixedmodelfordifferentialexpressionanalysisinsinglecellhighthroughputgeneexpressiondata