Cargando…
lncDIFF: a novel quasi-likelihood method for differential expression analysis of non-coding RNA
BACKGROUND: Long non-coding RNA (lncRNA) expression data have been increasingly used in finding diagnostic and prognostic biomarkers in cancer studies. Existing differential analysis tools for RNA sequencing do not effectively accommodate low abundant genes, as commonly observed in lncRNAs. RESULTS:...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6604377/ https://www.ncbi.nlm.nih.gov/pubmed/31266446 http://dx.doi.org/10.1186/s12864-019-5926-4 |
_version_ | 1783431700757348352 |
---|---|
author | Li, Qian Yu, Xiaoqing Chaudhary, Ritu Slebos, Robbert J. C. Chung, Christine H. Wang, Xuefeng |
author_facet | Li, Qian Yu, Xiaoqing Chaudhary, Ritu Slebos, Robbert J. C. Chung, Christine H. Wang, Xuefeng |
author_sort | Li, Qian |
collection | PubMed |
description | BACKGROUND: Long non-coding RNA (lncRNA) expression data have been increasingly used in finding diagnostic and prognostic biomarkers in cancer studies. Existing differential analysis tools for RNA sequencing do not effectively accommodate low abundant genes, as commonly observed in lncRNAs. RESULTS: We investigated the statistical distribution of normalized counts for low expression genes in lncRNAs and mRNAs, and proposed a new tool lncDIFF based on the underlying distribution pattern to detect differentially expressed (DE) lncRNAs. lncDIFF adopts the generalized linear model with zero-inflated Exponential quasi-likelihood to estimate group effect on normalized counts, and employs the likelihood ratio test to detect differential expressed genes. The proposed method and tool are applicable to data processed with standard RNA-Seq preprocessing and normalization pipelines. Simulation results showed that lncDIFF was able to detect DE genes with more power and lower false discovery rate regardless of the data pattern, compared to DESeq2, edgeR, limma, zinbwave, DEsingle, and ShrinkBayes. In the analysis of a head and neck squamous cell carcinomas data, lncDIFF also appeared to have higher sensitivity in identifying novel lncRNA genes with relatively large fold change and prognostic value. CONCLUSIONS: lncDIFF is a powerful differential analysis tool for low abundance non-coding RNA expression data. This method is compatible with various existing RNA-Seq quantification and normalization tools. lncDIFF is implemented in an R package available at https://github.com/qianli10000/lncDIFF. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5926-4) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6604377 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-66043772019-07-12 lncDIFF: a novel quasi-likelihood method for differential expression analysis of non-coding RNA Li, Qian Yu, Xiaoqing Chaudhary, Ritu Slebos, Robbert J. C. Chung, Christine H. Wang, Xuefeng BMC Genomics Methodology Article BACKGROUND: Long non-coding RNA (lncRNA) expression data have been increasingly used in finding diagnostic and prognostic biomarkers in cancer studies. Existing differential analysis tools for RNA sequencing do not effectively accommodate low abundant genes, as commonly observed in lncRNAs. RESULTS: We investigated the statistical distribution of normalized counts for low expression genes in lncRNAs and mRNAs, and proposed a new tool lncDIFF based on the underlying distribution pattern to detect differentially expressed (DE) lncRNAs. lncDIFF adopts the generalized linear model with zero-inflated Exponential quasi-likelihood to estimate group effect on normalized counts, and employs the likelihood ratio test to detect differential expressed genes. The proposed method and tool are applicable to data processed with standard RNA-Seq preprocessing and normalization pipelines. Simulation results showed that lncDIFF was able to detect DE genes with more power and lower false discovery rate regardless of the data pattern, compared to DESeq2, edgeR, limma, zinbwave, DEsingle, and ShrinkBayes. In the analysis of a head and neck squamous cell carcinomas data, lncDIFF also appeared to have higher sensitivity in identifying novel lncRNA genes with relatively large fold change and prognostic value. CONCLUSIONS: lncDIFF is a powerful differential analysis tool for low abundance non-coding RNA expression data. This method is compatible with various existing RNA-Seq quantification and normalization tools. lncDIFF is implemented in an R package available at https://github.com/qianli10000/lncDIFF. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5926-4) contains supplementary material, which is available to authorized users. BioMed Central 2019-07-02 /pmc/articles/PMC6604377/ /pubmed/31266446 http://dx.doi.org/10.1186/s12864-019-5926-4 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Li, Qian Yu, Xiaoqing Chaudhary, Ritu Slebos, Robbert J. C. Chung, Christine H. Wang, Xuefeng lncDIFF: a novel quasi-likelihood method for differential expression analysis of non-coding RNA |
title | lncDIFF: a novel quasi-likelihood method for differential expression analysis of non-coding RNA |
title_full | lncDIFF: a novel quasi-likelihood method for differential expression analysis of non-coding RNA |
title_fullStr | lncDIFF: a novel quasi-likelihood method for differential expression analysis of non-coding RNA |
title_full_unstemmed | lncDIFF: a novel quasi-likelihood method for differential expression analysis of non-coding RNA |
title_short | lncDIFF: a novel quasi-likelihood method for differential expression analysis of non-coding RNA |
title_sort | lncdiff: a novel quasi-likelihood method for differential expression analysis of non-coding rna |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6604377/ https://www.ncbi.nlm.nih.gov/pubmed/31266446 http://dx.doi.org/10.1186/s12864-019-5926-4 |
work_keys_str_mv | AT liqian lncdiffanovelquasilikelihoodmethodfordifferentialexpressionanalysisofnoncodingrna AT yuxiaoqing lncdiffanovelquasilikelihoodmethodfordifferentialexpressionanalysisofnoncodingrna AT chaudharyritu lncdiffanovelquasilikelihoodmethodfordifferentialexpressionanalysisofnoncodingrna AT slebosrobbertjc lncdiffanovelquasilikelihoodmethodfordifferentialexpressionanalysisofnoncodingrna AT chungchristineh lncdiffanovelquasilikelihoodmethodfordifferentialexpressionanalysisofnoncodingrna AT wangxuefeng lncdiffanovelquasilikelihoodmethodfordifferentialexpressionanalysisofnoncodingrna |