Cargando…

clrDV: a differential variability test for RNA-Seq data based on the skew-normal distribution

BACKGROUND: Pathological conditions may result in certain genes having expression variance that differs markedly from that of the control. Finding such genes from gene expression data can provide invaluable candidates for therapeutic intervention. Under the dominant paradigm for modeling RNA-Seq gen...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Hongxiang, Khang, Tsung Fei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10544356/
https://www.ncbi.nlm.nih.gov/pubmed/37790621
http://dx.doi.org/10.7717/peerj.16126
_version_ 1785114488544952320
author Li, Hongxiang
Khang, Tsung Fei
author_facet Li, Hongxiang
Khang, Tsung Fei
author_sort Li, Hongxiang
collection PubMed
description BACKGROUND: Pathological conditions may result in certain genes having expression variance that differs markedly from that of the control. Finding such genes from gene expression data can provide invaluable candidates for therapeutic intervention. Under the dominant paradigm for modeling RNA-Seq gene counts using the negative binomial model, tests of differential variability are challenging to develop, owing to dependence of the variance on the mean. METHODS: Here, we describe clrDV, a statistical method for detecting genes that show differential variability between two populations. We present the skew-normal distribution for modeling gene-wise null distribution of centered log-ratio transformation of compositional RNA-seq data. RESULTS: Simulation results show that clrDV has false discovery rate and probability of Type II error that are on par with or superior to existing methodologies. In addition, its run time is faster than its closest competitors, and remains relatively constant for increasing sample size per group. Analysis of a large neurodegenerative disease RNA-Seq dataset using clrDV successfully recovers multiple gene candidates that have been reported to be associated with Alzheimer’s disease.
format Online
Article
Text
id pubmed-10544356
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-105443562023-10-03 clrDV: a differential variability test for RNA-Seq data based on the skew-normal distribution Li, Hongxiang Khang, Tsung Fei PeerJ Bioinformatics BACKGROUND: Pathological conditions may result in certain genes having expression variance that differs markedly from that of the control. Finding such genes from gene expression data can provide invaluable candidates for therapeutic intervention. Under the dominant paradigm for modeling RNA-Seq gene counts using the negative binomial model, tests of differential variability are challenging to develop, owing to dependence of the variance on the mean. METHODS: Here, we describe clrDV, a statistical method for detecting genes that show differential variability between two populations. We present the skew-normal distribution for modeling gene-wise null distribution of centered log-ratio transformation of compositional RNA-seq data. RESULTS: Simulation results show that clrDV has false discovery rate and probability of Type II error that are on par with or superior to existing methodologies. In addition, its run time is faster than its closest competitors, and remains relatively constant for increasing sample size per group. Analysis of a large neurodegenerative disease RNA-Seq dataset using clrDV successfully recovers multiple gene candidates that have been reported to be associated with Alzheimer’s disease. PeerJ Inc. 2023-09-29 /pmc/articles/PMC10544356/ /pubmed/37790621 http://dx.doi.org/10.7717/peerj.16126 Text en ©2023 Li and Khang https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Li, Hongxiang
Khang, Tsung Fei
clrDV: a differential variability test for RNA-Seq data based on the skew-normal distribution
title clrDV: a differential variability test for RNA-Seq data based on the skew-normal distribution
title_full clrDV: a differential variability test for RNA-Seq data based on the skew-normal distribution
title_fullStr clrDV: a differential variability test for RNA-Seq data based on the skew-normal distribution
title_full_unstemmed clrDV: a differential variability test for RNA-Seq data based on the skew-normal distribution
title_short clrDV: a differential variability test for RNA-Seq data based on the skew-normal distribution
title_sort clrdv: a differential variability test for rna-seq data based on the skew-normal distribution
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10544356/
https://www.ncbi.nlm.nih.gov/pubmed/37790621
http://dx.doi.org/10.7717/peerj.16126
work_keys_str_mv AT lihongxiang clrdvadifferentialvariabilitytestforrnaseqdatabasedontheskewnormaldistribution
AT khangtsungfei clrdvadifferentialvariabilitytestforrnaseqdatabasedontheskewnormaldistribution