Cargando…
Systematic Identification of Housekeeping Genes Possibly Used as References in Caenorhabditis elegans by Large-Scale Data Integration
For accurate gene expression quantification, normalization of gene expression data against reliable reference genes is required. It is known that the expression levels of commonly used reference genes vary considerably under different experimental conditions, and therefore, their use for data normal...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7140892/ https://www.ncbi.nlm.nih.gov/pubmed/32213971 http://dx.doi.org/10.3390/cells9030786 |
_version_ | 1783519093161197568 |
---|---|
author | Tao, Jingxin Hao, Youjin Li, Xudong Yin, Huachun Nie, Xiner Zhang, Jie Xu, Boying Chen, Qiao Li, Bo |
author_facet | Tao, Jingxin Hao, Youjin Li, Xudong Yin, Huachun Nie, Xiner Zhang, Jie Xu, Boying Chen, Qiao Li, Bo |
author_sort | Tao, Jingxin |
collection | PubMed |
description | For accurate gene expression quantification, normalization of gene expression data against reliable reference genes is required. It is known that the expression levels of commonly used reference genes vary considerably under different experimental conditions, and therefore, their use for data normalization is limited. In this study, an unbiased identification of reference genes in Caenorhabditis elegans was performed based on 145 microarray datasets (2296 gene array samples) covering different developmental stages, different tissues, drug treatments, lifestyle, and various stresses. As a result, thirteen housekeeping genes (rps-23, rps-26, rps-27, rps-16, rps-2, rps-4, rps-17, rpl-24.1, rpl-27, rpl-33, rpl-36, rpl-35, and rpl-15) with enhanced stability were comprehensively identified by using six popular normalization algorithms and RankAggreg method. Functional enrichment analysis revealed that these genes were significantly overrepresented in GO terms or KEGG pathways related to ribosomes. Validation analysis using recently published datasets revealed that the expressions of newly identified candidate reference genes were more stable than the commonly used reference genes. Based on the results, we recommended using rpl-33 and rps-26 as the optimal reference genes for microarray and rps-2 and rps-4 for RNA-sequencing data validation. More importantly, the most stable rps-23 should be a promising reference gene for both data types. This study, for the first time, successfully displays a large-scale microarray data driven genome-wide identification of stable reference genes for normalizing gene expression data and provides a potential guideline on the selection of universal internal reference genes in C. elegans, for quantitative gene expression analysis. |
format | Online Article Text |
id | pubmed-7140892 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-71408922020-04-10 Systematic Identification of Housekeeping Genes Possibly Used as References in Caenorhabditis elegans by Large-Scale Data Integration Tao, Jingxin Hao, Youjin Li, Xudong Yin, Huachun Nie, Xiner Zhang, Jie Xu, Boying Chen, Qiao Li, Bo Cells Article For accurate gene expression quantification, normalization of gene expression data against reliable reference genes is required. It is known that the expression levels of commonly used reference genes vary considerably under different experimental conditions, and therefore, their use for data normalization is limited. In this study, an unbiased identification of reference genes in Caenorhabditis elegans was performed based on 145 microarray datasets (2296 gene array samples) covering different developmental stages, different tissues, drug treatments, lifestyle, and various stresses. As a result, thirteen housekeeping genes (rps-23, rps-26, rps-27, rps-16, rps-2, rps-4, rps-17, rpl-24.1, rpl-27, rpl-33, rpl-36, rpl-35, and rpl-15) with enhanced stability were comprehensively identified by using six popular normalization algorithms and RankAggreg method. Functional enrichment analysis revealed that these genes were significantly overrepresented in GO terms or KEGG pathways related to ribosomes. Validation analysis using recently published datasets revealed that the expressions of newly identified candidate reference genes were more stable than the commonly used reference genes. Based on the results, we recommended using rpl-33 and rps-26 as the optimal reference genes for microarray and rps-2 and rps-4 for RNA-sequencing data validation. More importantly, the most stable rps-23 should be a promising reference gene for both data types. This study, for the first time, successfully displays a large-scale microarray data driven genome-wide identification of stable reference genes for normalizing gene expression data and provides a potential guideline on the selection of universal internal reference genes in C. elegans, for quantitative gene expression analysis. MDPI 2020-03-24 /pmc/articles/PMC7140892/ /pubmed/32213971 http://dx.doi.org/10.3390/cells9030786 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Tao, Jingxin Hao, Youjin Li, Xudong Yin, Huachun Nie, Xiner Zhang, Jie Xu, Boying Chen, Qiao Li, Bo Systematic Identification of Housekeeping Genes Possibly Used as References in Caenorhabditis elegans by Large-Scale Data Integration |
title | Systematic Identification of Housekeeping Genes Possibly Used as References in Caenorhabditis elegans by Large-Scale Data Integration |
title_full | Systematic Identification of Housekeeping Genes Possibly Used as References in Caenorhabditis elegans by Large-Scale Data Integration |
title_fullStr | Systematic Identification of Housekeeping Genes Possibly Used as References in Caenorhabditis elegans by Large-Scale Data Integration |
title_full_unstemmed | Systematic Identification of Housekeeping Genes Possibly Used as References in Caenorhabditis elegans by Large-Scale Data Integration |
title_short | Systematic Identification of Housekeeping Genes Possibly Used as References in Caenorhabditis elegans by Large-Scale Data Integration |
title_sort | systematic identification of housekeeping genes possibly used as references in caenorhabditis elegans by large-scale data integration |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7140892/ https://www.ncbi.nlm.nih.gov/pubmed/32213971 http://dx.doi.org/10.3390/cells9030786 |
work_keys_str_mv | AT taojingxin systematicidentificationofhousekeepinggenespossiblyusedasreferencesincaenorhabditiselegansbylargescaledataintegration AT haoyoujin systematicidentificationofhousekeepinggenespossiblyusedasreferencesincaenorhabditiselegansbylargescaledataintegration AT lixudong systematicidentificationofhousekeepinggenespossiblyusedasreferencesincaenorhabditiselegansbylargescaledataintegration AT yinhuachun systematicidentificationofhousekeepinggenespossiblyusedasreferencesincaenorhabditiselegansbylargescaledataintegration AT niexiner systematicidentificationofhousekeepinggenespossiblyusedasreferencesincaenorhabditiselegansbylargescaledataintegration AT zhangjie systematicidentificationofhousekeepinggenespossiblyusedasreferencesincaenorhabditiselegansbylargescaledataintegration AT xuboying systematicidentificationofhousekeepinggenespossiblyusedasreferencesincaenorhabditiselegansbylargescaledataintegration AT chenqiao systematicidentificationofhousekeepinggenespossiblyusedasreferencesincaenorhabditiselegansbylargescaledataintegration AT libo systematicidentificationofhousekeepinggenespossiblyusedasreferencesincaenorhabditiselegansbylargescaledataintegration |