Cargando…
clrDV: a differential variability test for RNA-Seq data based on the skew-normal distribution
BACKGROUND: Pathological conditions may result in certain genes having expression variance that differs markedly from that of the control. Finding such genes from gene expression data can provide invaluable candidates for therapeutic intervention. Under the dominant paradigm for modeling RNA-Seq gen...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10544356/ https://www.ncbi.nlm.nih.gov/pubmed/37790621 http://dx.doi.org/10.7717/peerj.16126 |
_version_ | 1785114488544952320 |
---|---|
author | Li, Hongxiang Khang, Tsung Fei |
author_facet | Li, Hongxiang Khang, Tsung Fei |
author_sort | Li, Hongxiang |
collection | PubMed |
description | BACKGROUND: Pathological conditions may result in certain genes having expression variance that differs markedly from that of the control. Finding such genes from gene expression data can provide invaluable candidates for therapeutic intervention. Under the dominant paradigm for modeling RNA-Seq gene counts using the negative binomial model, tests of differential variability are challenging to develop, owing to dependence of the variance on the mean. METHODS: Here, we describe clrDV, a statistical method for detecting genes that show differential variability between two populations. We present the skew-normal distribution for modeling gene-wise null distribution of centered log-ratio transformation of compositional RNA-seq data. RESULTS: Simulation results show that clrDV has false discovery rate and probability of Type II error that are on par with or superior to existing methodologies. In addition, its run time is faster than its closest competitors, and remains relatively constant for increasing sample size per group. Analysis of a large neurodegenerative disease RNA-Seq dataset using clrDV successfully recovers multiple gene candidates that have been reported to be associated with Alzheimer’s disease. |
format | Online Article Text |
id | pubmed-10544356 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-105443562023-10-03 clrDV: a differential variability test for RNA-Seq data based on the skew-normal distribution Li, Hongxiang Khang, Tsung Fei PeerJ Bioinformatics BACKGROUND: Pathological conditions may result in certain genes having expression variance that differs markedly from that of the control. Finding such genes from gene expression data can provide invaluable candidates for therapeutic intervention. Under the dominant paradigm for modeling RNA-Seq gene counts using the negative binomial model, tests of differential variability are challenging to develop, owing to dependence of the variance on the mean. METHODS: Here, we describe clrDV, a statistical method for detecting genes that show differential variability between two populations. We present the skew-normal distribution for modeling gene-wise null distribution of centered log-ratio transformation of compositional RNA-seq data. RESULTS: Simulation results show that clrDV has false discovery rate and probability of Type II error that are on par with or superior to existing methodologies. In addition, its run time is faster than its closest competitors, and remains relatively constant for increasing sample size per group. Analysis of a large neurodegenerative disease RNA-Seq dataset using clrDV successfully recovers multiple gene candidates that have been reported to be associated with Alzheimer’s disease. PeerJ Inc. 2023-09-29 /pmc/articles/PMC10544356/ /pubmed/37790621 http://dx.doi.org/10.7717/peerj.16126 Text en ©2023 Li and Khang https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. |
spellingShingle | Bioinformatics Li, Hongxiang Khang, Tsung Fei clrDV: a differential variability test for RNA-Seq data based on the skew-normal distribution |
title | clrDV: a differential variability test for RNA-Seq data based on the skew-normal distribution |
title_full | clrDV: a differential variability test for RNA-Seq data based on the skew-normal distribution |
title_fullStr | clrDV: a differential variability test for RNA-Seq data based on the skew-normal distribution |
title_full_unstemmed | clrDV: a differential variability test for RNA-Seq data based on the skew-normal distribution |
title_short | clrDV: a differential variability test for RNA-Seq data based on the skew-normal distribution |
title_sort | clrdv: a differential variability test for rna-seq data based on the skew-normal distribution |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10544356/ https://www.ncbi.nlm.nih.gov/pubmed/37790621 http://dx.doi.org/10.7717/peerj.16126 |
work_keys_str_mv | AT lihongxiang clrdvadifferentialvariabilitytestforrnaseqdatabasedontheskewnormaldistribution AT khangtsungfei clrdvadifferentialvariabilitytestforrnaseqdatabasedontheskewnormaldistribution |