Cargando…

Tissue enrichment analysis for C. elegans genomics

BACKGROUND: Over the last ten years, there has been explosive development in methods for measuring gene expression. These methods can identify thousands of genes altered between conditions, but understanding these datasets and forming hypotheses based on them remains challenging. One way to analyze...

Descripción completa

Detalles Bibliográficos
Autores principales: Angeles-Albores, David, N. Lee, Raymond Y., Chan, Juancarlos, Sternberg, Paul W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5020436/
https://www.ncbi.nlm.nih.gov/pubmed/27618863
http://dx.doi.org/10.1186/s12859-016-1229-9
_version_ 1782453200551411712
author Angeles-Albores, David
N. Lee, Raymond Y.
Chan, Juancarlos
Sternberg, Paul W.
author_facet Angeles-Albores, David
N. Lee, Raymond Y.
Chan, Juancarlos
Sternberg, Paul W.
author_sort Angeles-Albores, David
collection PubMed
description BACKGROUND: Over the last ten years, there has been explosive development in methods for measuring gene expression. These methods can identify thousands of genes altered between conditions, but understanding these datasets and forming hypotheses based on them remains challenging. One way to analyze these datasets is to associate ontologies (hierarchical, descriptive vocabularies with controlled relations between terms) with genes and to look for enrichment of specific terms. Although Gene Ontology (GO) is available for Caenorhabditis elegans, it does not include anatomical information. RESULTS: We have developed a tool for identifying enrichment of C. elegans tissues among gene sets and generated a website GUI where users can access this tool. Since a common drawback to ontology enrichment analyses is its verbosity, we developed a very simple filtering algorithm to reduce the ontology size by an order of magnitude. We adjusted these filters and validated our tool using a set of 30 gold standards from Expression Cluster data in WormBase. We show our tool can even discriminate between embryonic and larval tissues and can even identify tissues down to the single-cell level. We used our tool to identify multiple neuronal tissues that are down-regulated due to pathogen infection in C. elegans. CONCLUSIONS: Our Tissue Enrichment Analysis (TEA) can be found within WormBase, and can be downloaded using Python’s standard pip installer. It tests a slimmed-down C. elegans tissue ontology for enrichment of specific terms and provides users with a text and graphic representation of the results. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1229-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5020436
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-50204362016-09-20 Tissue enrichment analysis for C. elegans genomics Angeles-Albores, David N. Lee, Raymond Y. Chan, Juancarlos Sternberg, Paul W. BMC Bioinformatics Methodology Article BACKGROUND: Over the last ten years, there has been explosive development in methods for measuring gene expression. These methods can identify thousands of genes altered between conditions, but understanding these datasets and forming hypotheses based on them remains challenging. One way to analyze these datasets is to associate ontologies (hierarchical, descriptive vocabularies with controlled relations between terms) with genes and to look for enrichment of specific terms. Although Gene Ontology (GO) is available for Caenorhabditis elegans, it does not include anatomical information. RESULTS: We have developed a tool for identifying enrichment of C. elegans tissues among gene sets and generated a website GUI where users can access this tool. Since a common drawback to ontology enrichment analyses is its verbosity, we developed a very simple filtering algorithm to reduce the ontology size by an order of magnitude. We adjusted these filters and validated our tool using a set of 30 gold standards from Expression Cluster data in WormBase. We show our tool can even discriminate between embryonic and larval tissues and can even identify tissues down to the single-cell level. We used our tool to identify multiple neuronal tissues that are down-regulated due to pathogen infection in C. elegans. CONCLUSIONS: Our Tissue Enrichment Analysis (TEA) can be found within WormBase, and can be downloaded using Python’s standard pip installer. It tests a slimmed-down C. elegans tissue ontology for enrichment of specific terms and provides users with a text and graphic representation of the results. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1229-9) contains supplementary material, which is available to authorized users. BioMed Central 2016-09-13 /pmc/articles/PMC5020436/ /pubmed/27618863 http://dx.doi.org/10.1186/s12859-016-1229-9 Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Angeles-Albores, David
N. Lee, Raymond Y.
Chan, Juancarlos
Sternberg, Paul W.
Tissue enrichment analysis for C. elegans genomics
title Tissue enrichment analysis for C. elegans genomics
title_full Tissue enrichment analysis for C. elegans genomics
title_fullStr Tissue enrichment analysis for C. elegans genomics
title_full_unstemmed Tissue enrichment analysis for C. elegans genomics
title_short Tissue enrichment analysis for C. elegans genomics
title_sort tissue enrichment analysis for c. elegans genomics
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5020436/
https://www.ncbi.nlm.nih.gov/pubmed/27618863
http://dx.doi.org/10.1186/s12859-016-1229-9
work_keys_str_mv AT angelesalboresdavid tissueenrichmentanalysisforcelegansgenomics
AT nleeraymondy tissueenrichmentanalysisforcelegansgenomics
AT chanjuancarlos tissueenrichmentanalysisforcelegansgenomics
AT sternbergpaulw tissueenrichmentanalysisforcelegansgenomics