Cargando…

lncDIFF: a novel quasi-likelihood method for differential expression analysis of non-coding RNA

BACKGROUND: Long non-coding RNA (lncRNA) expression data have been increasingly used in finding diagnostic and prognostic biomarkers in cancer studies. Existing differential analysis tools for RNA sequencing do not effectively accommodate low abundant genes, as commonly observed in lncRNAs. RESULTS:...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Qian, Yu, Xiaoqing, Chaudhary, Ritu, Slebos, Robbert J. C., Chung, Christine H., Wang, Xuefeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6604377/
https://www.ncbi.nlm.nih.gov/pubmed/31266446
http://dx.doi.org/10.1186/s12864-019-5926-4
_version_ 1783431700757348352
author Li, Qian
Yu, Xiaoqing
Chaudhary, Ritu
Slebos, Robbert J. C.
Chung, Christine H.
Wang, Xuefeng
author_facet Li, Qian
Yu, Xiaoqing
Chaudhary, Ritu
Slebos, Robbert J. C.
Chung, Christine H.
Wang, Xuefeng
author_sort Li, Qian
collection PubMed
description BACKGROUND: Long non-coding RNA (lncRNA) expression data have been increasingly used in finding diagnostic and prognostic biomarkers in cancer studies. Existing differential analysis tools for RNA sequencing do not effectively accommodate low abundant genes, as commonly observed in lncRNAs. RESULTS: We investigated the statistical distribution of normalized counts for low expression genes in lncRNAs and mRNAs, and proposed a new tool lncDIFF based on the underlying distribution pattern to detect differentially expressed (DE) lncRNAs. lncDIFF adopts the generalized linear model with zero-inflated Exponential quasi-likelihood to estimate group effect on normalized counts, and employs the likelihood ratio test to detect differential expressed genes. The proposed method and tool are applicable to data processed with standard RNA-Seq preprocessing and normalization pipelines. Simulation results showed that lncDIFF was able to detect DE genes with more power and lower false discovery rate regardless of the data pattern, compared to DESeq2, edgeR, limma, zinbwave, DEsingle, and ShrinkBayes. In the analysis of a head and neck squamous cell carcinomas data, lncDIFF also appeared to have higher sensitivity in identifying novel lncRNA genes with relatively large fold change and prognostic value. CONCLUSIONS: lncDIFF is a powerful differential analysis tool for low abundance non-coding RNA expression data. This method is compatible with various existing RNA-Seq quantification and normalization tools. lncDIFF is implemented in an R package available at https://github.com/qianli10000/lncDIFF. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5926-4) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6604377
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-66043772019-07-12 lncDIFF: a novel quasi-likelihood method for differential expression analysis of non-coding RNA Li, Qian Yu, Xiaoqing Chaudhary, Ritu Slebos, Robbert J. C. Chung, Christine H. Wang, Xuefeng BMC Genomics Methodology Article BACKGROUND: Long non-coding RNA (lncRNA) expression data have been increasingly used in finding diagnostic and prognostic biomarkers in cancer studies. Existing differential analysis tools for RNA sequencing do not effectively accommodate low abundant genes, as commonly observed in lncRNAs. RESULTS: We investigated the statistical distribution of normalized counts for low expression genes in lncRNAs and mRNAs, and proposed a new tool lncDIFF based on the underlying distribution pattern to detect differentially expressed (DE) lncRNAs. lncDIFF adopts the generalized linear model with zero-inflated Exponential quasi-likelihood to estimate group effect on normalized counts, and employs the likelihood ratio test to detect differential expressed genes. The proposed method and tool are applicable to data processed with standard RNA-Seq preprocessing and normalization pipelines. Simulation results showed that lncDIFF was able to detect DE genes with more power and lower false discovery rate regardless of the data pattern, compared to DESeq2, edgeR, limma, zinbwave, DEsingle, and ShrinkBayes. In the analysis of a head and neck squamous cell carcinomas data, lncDIFF also appeared to have higher sensitivity in identifying novel lncRNA genes with relatively large fold change and prognostic value. CONCLUSIONS: lncDIFF is a powerful differential analysis tool for low abundance non-coding RNA expression data. This method is compatible with various existing RNA-Seq quantification and normalization tools. lncDIFF is implemented in an R package available at https://github.com/qianli10000/lncDIFF. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5926-4) contains supplementary material, which is available to authorized users. BioMed Central 2019-07-02 /pmc/articles/PMC6604377/ /pubmed/31266446 http://dx.doi.org/10.1186/s12864-019-5926-4 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Li, Qian
Yu, Xiaoqing
Chaudhary, Ritu
Slebos, Robbert J. C.
Chung, Christine H.
Wang, Xuefeng
lncDIFF: a novel quasi-likelihood method for differential expression analysis of non-coding RNA
title lncDIFF: a novel quasi-likelihood method for differential expression analysis of non-coding RNA
title_full lncDIFF: a novel quasi-likelihood method for differential expression analysis of non-coding RNA
title_fullStr lncDIFF: a novel quasi-likelihood method for differential expression analysis of non-coding RNA
title_full_unstemmed lncDIFF: a novel quasi-likelihood method for differential expression analysis of non-coding RNA
title_short lncDIFF: a novel quasi-likelihood method for differential expression analysis of non-coding RNA
title_sort lncdiff: a novel quasi-likelihood method for differential expression analysis of non-coding rna
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6604377/
https://www.ncbi.nlm.nih.gov/pubmed/31266446
http://dx.doi.org/10.1186/s12864-019-5926-4
work_keys_str_mv AT liqian lncdiffanovelquasilikelihoodmethodfordifferentialexpressionanalysisofnoncodingrna
AT yuxiaoqing lncdiffanovelquasilikelihoodmethodfordifferentialexpressionanalysisofnoncodingrna
AT chaudharyritu lncdiffanovelquasilikelihoodmethodfordifferentialexpressionanalysisofnoncodingrna
AT slebosrobbertjc lncdiffanovelquasilikelihoodmethodfordifferentialexpressionanalysisofnoncodingrna
AT chungchristineh lncdiffanovelquasilikelihoodmethodfordifferentialexpressionanalysisofnoncodingrna
AT wangxuefeng lncdiffanovelquasilikelihoodmethodfordifferentialexpressionanalysisofnoncodingrna