Cargando…

Detecting Differentially Expressed Genes with RNA-seq Data Using Backward Selection to Account for the Effects of Relevant Covariates

A common challenge in analysis of transcriptomic data is to identify differentially expressed genes, i.e., genes whose mean transcript abundance levels differ across the levels of a factor of scientific interest. Transcript abundance levels can be measured simultaneously for thousands of genes in mu...

Descripción completa

Detalles Bibliográficos
Autores principales: Nguyen, Yet, Nettleton, Dan, Liu, Haibo, Tuggle, Christopher K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4666287/
https://www.ncbi.nlm.nih.gov/pubmed/26660449
http://dx.doi.org/10.1007/s13253-015-0226-1
_version_ 1782403689125773312
author Nguyen, Yet
Nettleton, Dan
Liu, Haibo
Tuggle, Christopher K.
author_facet Nguyen, Yet
Nettleton, Dan
Liu, Haibo
Tuggle, Christopher K.
author_sort Nguyen, Yet
collection PubMed
description A common challenge in analysis of transcriptomic data is to identify differentially expressed genes, i.e., genes whose mean transcript abundance levels differ across the levels of a factor of scientific interest. Transcript abundance levels can be measured simultaneously for thousands of genes in multiple biological samples using RNA sequencing (RNA-seq) technology. Part of the variation in RNA-seq measures of transcript abundance may be associated with variation in continuous and/or categorical covariates measured for each experimental unit or RNA sample. Ignoring relevant covariates or modeling the effects of irrelevant covariates can be detrimental to identifying differentially expressed genes. We propose a backward selection strategy for selecting a set of covariates whose effects are accounted for when searching for differentially expressed genes. We illustrate our approach through the analysis of an RNA-seq study intended to identify genes differentially expressed between two lines of pigs divergently selected for residual feed intake. We use simulation to show the advantages of our backward selection procedure over alternative strategies that either ignore or adjust for all measured covariates.
format Online
Article
Text
id pubmed-4666287
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-46662872015-12-09 Detecting Differentially Expressed Genes with RNA-seq Data Using Backward Selection to Account for the Effects of Relevant Covariates Nguyen, Yet Nettleton, Dan Liu, Haibo Tuggle, Christopher K. J Agric Biol Environ Stat Article A common challenge in analysis of transcriptomic data is to identify differentially expressed genes, i.e., genes whose mean transcript abundance levels differ across the levels of a factor of scientific interest. Transcript abundance levels can be measured simultaneously for thousands of genes in multiple biological samples using RNA sequencing (RNA-seq) technology. Part of the variation in RNA-seq measures of transcript abundance may be associated with variation in continuous and/or categorical covariates measured for each experimental unit or RNA sample. Ignoring relevant covariates or modeling the effects of irrelevant covariates can be detrimental to identifying differentially expressed genes. We propose a backward selection strategy for selecting a set of covariates whose effects are accounted for when searching for differentially expressed genes. We illustrate our approach through the analysis of an RNA-seq study intended to identify genes differentially expressed between two lines of pigs divergently selected for residual feed intake. We use simulation to show the advantages of our backward selection procedure over alternative strategies that either ignore or adjust for all measured covariates. Springer US 2015-10-01 2015 /pmc/articles/PMC4666287/ /pubmed/26660449 http://dx.doi.org/10.1007/s13253-015-0226-1 Text en © The Author(s) 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Article
Nguyen, Yet
Nettleton, Dan
Liu, Haibo
Tuggle, Christopher K.
Detecting Differentially Expressed Genes with RNA-seq Data Using Backward Selection to Account for the Effects of Relevant Covariates
title Detecting Differentially Expressed Genes with RNA-seq Data Using Backward Selection to Account for the Effects of Relevant Covariates
title_full Detecting Differentially Expressed Genes with RNA-seq Data Using Backward Selection to Account for the Effects of Relevant Covariates
title_fullStr Detecting Differentially Expressed Genes with RNA-seq Data Using Backward Selection to Account for the Effects of Relevant Covariates
title_full_unstemmed Detecting Differentially Expressed Genes with RNA-seq Data Using Backward Selection to Account for the Effects of Relevant Covariates
title_short Detecting Differentially Expressed Genes with RNA-seq Data Using Backward Selection to Account for the Effects of Relevant Covariates
title_sort detecting differentially expressed genes with rna-seq data using backward selection to account for the effects of relevant covariates
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4666287/
https://www.ncbi.nlm.nih.gov/pubmed/26660449
http://dx.doi.org/10.1007/s13253-015-0226-1
work_keys_str_mv AT nguyenyet detectingdifferentiallyexpressedgeneswithrnaseqdatausingbackwardselectiontoaccountfortheeffectsofrelevantcovariates
AT nettletondan detectingdifferentiallyexpressedgeneswithrnaseqdatausingbackwardselectiontoaccountfortheeffectsofrelevantcovariates
AT liuhaibo detectingdifferentiallyexpressedgeneswithrnaseqdatausingbackwardselectiontoaccountfortheeffectsofrelevantcovariates
AT tugglechristopherk detectingdifferentiallyexpressedgeneswithrnaseqdatausingbackwardselectiontoaccountfortheeffectsofrelevantcovariates