Cargando…
Detecting Differentially Expressed Genes with RNA-seq Data Using Backward Selection to Account for the Effects of Relevant Covariates
A common challenge in analysis of transcriptomic data is to identify differentially expressed genes, i.e., genes whose mean transcript abundance levels differ across the levels of a factor of scientific interest. Transcript abundance levels can be measured simultaneously for thousands of genes in mu...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer US
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4666287/ https://www.ncbi.nlm.nih.gov/pubmed/26660449 http://dx.doi.org/10.1007/s13253-015-0226-1 |
_version_ | 1782403689125773312 |
---|---|
author | Nguyen, Yet Nettleton, Dan Liu, Haibo Tuggle, Christopher K. |
author_facet | Nguyen, Yet Nettleton, Dan Liu, Haibo Tuggle, Christopher K. |
author_sort | Nguyen, Yet |
collection | PubMed |
description | A common challenge in analysis of transcriptomic data is to identify differentially expressed genes, i.e., genes whose mean transcript abundance levels differ across the levels of a factor of scientific interest. Transcript abundance levels can be measured simultaneously for thousands of genes in multiple biological samples using RNA sequencing (RNA-seq) technology. Part of the variation in RNA-seq measures of transcript abundance may be associated with variation in continuous and/or categorical covariates measured for each experimental unit or RNA sample. Ignoring relevant covariates or modeling the effects of irrelevant covariates can be detrimental to identifying differentially expressed genes. We propose a backward selection strategy for selecting a set of covariates whose effects are accounted for when searching for differentially expressed genes. We illustrate our approach through the analysis of an RNA-seq study intended to identify genes differentially expressed between two lines of pigs divergently selected for residual feed intake. We use simulation to show the advantages of our backward selection procedure over alternative strategies that either ignore or adjust for all measured covariates. |
format | Online Article Text |
id | pubmed-4666287 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Springer US |
record_format | MEDLINE/PubMed |
spelling | pubmed-46662872015-12-09 Detecting Differentially Expressed Genes with RNA-seq Data Using Backward Selection to Account for the Effects of Relevant Covariates Nguyen, Yet Nettleton, Dan Liu, Haibo Tuggle, Christopher K. J Agric Biol Environ Stat Article A common challenge in analysis of transcriptomic data is to identify differentially expressed genes, i.e., genes whose mean transcript abundance levels differ across the levels of a factor of scientific interest. Transcript abundance levels can be measured simultaneously for thousands of genes in multiple biological samples using RNA sequencing (RNA-seq) technology. Part of the variation in RNA-seq measures of transcript abundance may be associated with variation in continuous and/or categorical covariates measured for each experimental unit or RNA sample. Ignoring relevant covariates or modeling the effects of irrelevant covariates can be detrimental to identifying differentially expressed genes. We propose a backward selection strategy for selecting a set of covariates whose effects are accounted for when searching for differentially expressed genes. We illustrate our approach through the analysis of an RNA-seq study intended to identify genes differentially expressed between two lines of pigs divergently selected for residual feed intake. We use simulation to show the advantages of our backward selection procedure over alternative strategies that either ignore or adjust for all measured covariates. Springer US 2015-10-01 2015 /pmc/articles/PMC4666287/ /pubmed/26660449 http://dx.doi.org/10.1007/s13253-015-0226-1 Text en © The Author(s) 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. |
spellingShingle | Article Nguyen, Yet Nettleton, Dan Liu, Haibo Tuggle, Christopher K. Detecting Differentially Expressed Genes with RNA-seq Data Using Backward Selection to Account for the Effects of Relevant Covariates |
title | Detecting Differentially Expressed Genes with RNA-seq Data Using Backward Selection to Account for the Effects of Relevant Covariates |
title_full | Detecting Differentially Expressed Genes with RNA-seq Data Using Backward Selection to Account for the Effects of Relevant Covariates |
title_fullStr | Detecting Differentially Expressed Genes with RNA-seq Data Using Backward Selection to Account for the Effects of Relevant Covariates |
title_full_unstemmed | Detecting Differentially Expressed Genes with RNA-seq Data Using Backward Selection to Account for the Effects of Relevant Covariates |
title_short | Detecting Differentially Expressed Genes with RNA-seq Data Using Backward Selection to Account for the Effects of Relevant Covariates |
title_sort | detecting differentially expressed genes with rna-seq data using backward selection to account for the effects of relevant covariates |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4666287/ https://www.ncbi.nlm.nih.gov/pubmed/26660449 http://dx.doi.org/10.1007/s13253-015-0226-1 |
work_keys_str_mv | AT nguyenyet detectingdifferentiallyexpressedgeneswithrnaseqdatausingbackwardselectiontoaccountfortheeffectsofrelevantcovariates AT nettletondan detectingdifferentiallyexpressedgeneswithrnaseqdatausingbackwardselectiontoaccountfortheeffectsofrelevantcovariates AT liuhaibo detectingdifferentiallyexpressedgeneswithrnaseqdatausingbackwardselectiontoaccountfortheeffectsofrelevantcovariates AT tugglechristopherk detectingdifferentiallyexpressedgeneswithrnaseqdatausingbackwardselectiontoaccountfortheeffectsofrelevantcovariates |