Cargando…

Systematic Identification of Housekeeping Genes Possibly Used as References in Caenorhabditis elegans by Large-Scale Data Integration

For accurate gene expression quantification, normalization of gene expression data against reliable reference genes is required. It is known that the expression levels of commonly used reference genes vary considerably under different experimental conditions, and therefore, their use for data normal...

Descripción completa

Detalles Bibliográficos
Autores principales: Tao, Jingxin, Hao, Youjin, Li, Xudong, Yin, Huachun, Nie, Xiner, Zhang, Jie, Xu, Boying, Chen, Qiao, Li, Bo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7140892/
https://www.ncbi.nlm.nih.gov/pubmed/32213971
http://dx.doi.org/10.3390/cells9030786
_version_ 1783519093161197568
author Tao, Jingxin
Hao, Youjin
Li, Xudong
Yin, Huachun
Nie, Xiner
Zhang, Jie
Xu, Boying
Chen, Qiao
Li, Bo
author_facet Tao, Jingxin
Hao, Youjin
Li, Xudong
Yin, Huachun
Nie, Xiner
Zhang, Jie
Xu, Boying
Chen, Qiao
Li, Bo
author_sort Tao, Jingxin
collection PubMed
description For accurate gene expression quantification, normalization of gene expression data against reliable reference genes is required. It is known that the expression levels of commonly used reference genes vary considerably under different experimental conditions, and therefore, their use for data normalization is limited. In this study, an unbiased identification of reference genes in Caenorhabditis elegans was performed based on 145 microarray datasets (2296 gene array samples) covering different developmental stages, different tissues, drug treatments, lifestyle, and various stresses. As a result, thirteen housekeeping genes (rps-23, rps-26, rps-27, rps-16, rps-2, rps-4, rps-17, rpl-24.1, rpl-27, rpl-33, rpl-36, rpl-35, and rpl-15) with enhanced stability were comprehensively identified by using six popular normalization algorithms and RankAggreg method. Functional enrichment analysis revealed that these genes were significantly overrepresented in GO terms or KEGG pathways related to ribosomes. Validation analysis using recently published datasets revealed that the expressions of newly identified candidate reference genes were more stable than the commonly used reference genes. Based on the results, we recommended using rpl-33 and rps-26 as the optimal reference genes for microarray and rps-2 and rps-4 for RNA-sequencing data validation. More importantly, the most stable rps-23 should be a promising reference gene for both data types. This study, for the first time, successfully displays a large-scale microarray data driven genome-wide identification of stable reference genes for normalizing gene expression data and provides a potential guideline on the selection of universal internal reference genes in C. elegans, for quantitative gene expression analysis.
format Online
Article
Text
id pubmed-7140892
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-71408922020-04-10 Systematic Identification of Housekeeping Genes Possibly Used as References in Caenorhabditis elegans by Large-Scale Data Integration Tao, Jingxin Hao, Youjin Li, Xudong Yin, Huachun Nie, Xiner Zhang, Jie Xu, Boying Chen, Qiao Li, Bo Cells Article For accurate gene expression quantification, normalization of gene expression data against reliable reference genes is required. It is known that the expression levels of commonly used reference genes vary considerably under different experimental conditions, and therefore, their use for data normalization is limited. In this study, an unbiased identification of reference genes in Caenorhabditis elegans was performed based on 145 microarray datasets (2296 gene array samples) covering different developmental stages, different tissues, drug treatments, lifestyle, and various stresses. As a result, thirteen housekeeping genes (rps-23, rps-26, rps-27, rps-16, rps-2, rps-4, rps-17, rpl-24.1, rpl-27, rpl-33, rpl-36, rpl-35, and rpl-15) with enhanced stability were comprehensively identified by using six popular normalization algorithms and RankAggreg method. Functional enrichment analysis revealed that these genes were significantly overrepresented in GO terms or KEGG pathways related to ribosomes. Validation analysis using recently published datasets revealed that the expressions of newly identified candidate reference genes were more stable than the commonly used reference genes. Based on the results, we recommended using rpl-33 and rps-26 as the optimal reference genes for microarray and rps-2 and rps-4 for RNA-sequencing data validation. More importantly, the most stable rps-23 should be a promising reference gene for both data types. This study, for the first time, successfully displays a large-scale microarray data driven genome-wide identification of stable reference genes for normalizing gene expression data and provides a potential guideline on the selection of universal internal reference genes in C. elegans, for quantitative gene expression analysis. MDPI 2020-03-24 /pmc/articles/PMC7140892/ /pubmed/32213971 http://dx.doi.org/10.3390/cells9030786 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Tao, Jingxin
Hao, Youjin
Li, Xudong
Yin, Huachun
Nie, Xiner
Zhang, Jie
Xu, Boying
Chen, Qiao
Li, Bo
Systematic Identification of Housekeeping Genes Possibly Used as References in Caenorhabditis elegans by Large-Scale Data Integration
title Systematic Identification of Housekeeping Genes Possibly Used as References in Caenorhabditis elegans by Large-Scale Data Integration
title_full Systematic Identification of Housekeeping Genes Possibly Used as References in Caenorhabditis elegans by Large-Scale Data Integration
title_fullStr Systematic Identification of Housekeeping Genes Possibly Used as References in Caenorhabditis elegans by Large-Scale Data Integration
title_full_unstemmed Systematic Identification of Housekeeping Genes Possibly Used as References in Caenorhabditis elegans by Large-Scale Data Integration
title_short Systematic Identification of Housekeeping Genes Possibly Used as References in Caenorhabditis elegans by Large-Scale Data Integration
title_sort systematic identification of housekeeping genes possibly used as references in caenorhabditis elegans by large-scale data integration
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7140892/
https://www.ncbi.nlm.nih.gov/pubmed/32213971
http://dx.doi.org/10.3390/cells9030786
work_keys_str_mv AT taojingxin systematicidentificationofhousekeepinggenespossiblyusedasreferencesincaenorhabditiselegansbylargescaledataintegration
AT haoyoujin systematicidentificationofhousekeepinggenespossiblyusedasreferencesincaenorhabditiselegansbylargescaledataintegration
AT lixudong systematicidentificationofhousekeepinggenespossiblyusedasreferencesincaenorhabditiselegansbylargescaledataintegration
AT yinhuachun systematicidentificationofhousekeepinggenespossiblyusedasreferencesincaenorhabditiselegansbylargescaledataintegration
AT niexiner systematicidentificationofhousekeepinggenespossiblyusedasreferencesincaenorhabditiselegansbylargescaledataintegration
AT zhangjie systematicidentificationofhousekeepinggenespossiblyusedasreferencesincaenorhabditiselegansbylargescaledataintegration
AT xuboying systematicidentificationofhousekeepinggenespossiblyusedasreferencesincaenorhabditiselegansbylargescaledataintegration
AT chenqiao systematicidentificationofhousekeepinggenespossiblyusedasreferencesincaenorhabditiselegansbylargescaledataintegration
AT libo systematicidentificationofhousekeepinggenespossiblyusedasreferencesincaenorhabditiselegansbylargescaledataintegration