Cargando…

CEA: Combination-based gene set functional enrichment analysis

Functional enrichment analysis is a fundamental and challenging task in bioinformatics. Most of the current enrichment analysis approaches individually evaluate functional terms and often output a list of enriched terms with high similarity and redundancy, which makes it difficult for downstream stu...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Duanchen, Liu, Yinliang, Zhang, Xiang-Sun, Wu, Ling-Yun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6117355/
https://www.ncbi.nlm.nih.gov/pubmed/30166636
http://dx.doi.org/10.1038/s41598-018-31396-4
_version_ 1783351741347004416
author Sun, Duanchen
Liu, Yinliang
Zhang, Xiang-Sun
Wu, Ling-Yun
author_facet Sun, Duanchen
Liu, Yinliang
Zhang, Xiang-Sun
Wu, Ling-Yun
author_sort Sun, Duanchen
collection PubMed
description Functional enrichment analysis is a fundamental and challenging task in bioinformatics. Most of the current enrichment analysis approaches individually evaluate functional terms and often output a list of enriched terms with high similarity and redundancy, which makes it difficult for downstream studies to extract the underlying biological interpretation. In this paper, we proposed a novel framework to assess the performance of combination-based enrichment analysis. Using this framework, we formulated the enrichment analysis as a multi-objective combinatorial optimization problem and developed the CEA (Combination-based Enrichment Analysis) method. CEA provides the whole landscape of term combinations; therefore, it is a good benchmark for evaluating the current state-of-the-art combination-based functional enrichment methods in a comprehensive manner. We tested the effectiveness of CEA on four published microarray datasets. Enriched functional terms identified by CEA not only involve crucial biological processes of related diseases, but also have much less redundancy and can serve as a preferable representation for the enriched terms found by traditional single-term-based methods. CEA has been implemented in the R package CopTea and is available at http://github.com/wulingyun/CopTea/.
format Online
Article
Text
id pubmed-6117355
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-61173552018-09-05 CEA: Combination-based gene set functional enrichment analysis Sun, Duanchen Liu, Yinliang Zhang, Xiang-Sun Wu, Ling-Yun Sci Rep Article Functional enrichment analysis is a fundamental and challenging task in bioinformatics. Most of the current enrichment analysis approaches individually evaluate functional terms and often output a list of enriched terms with high similarity and redundancy, which makes it difficult for downstream studies to extract the underlying biological interpretation. In this paper, we proposed a novel framework to assess the performance of combination-based enrichment analysis. Using this framework, we formulated the enrichment analysis as a multi-objective combinatorial optimization problem and developed the CEA (Combination-based Enrichment Analysis) method. CEA provides the whole landscape of term combinations; therefore, it is a good benchmark for evaluating the current state-of-the-art combination-based functional enrichment methods in a comprehensive manner. We tested the effectiveness of CEA on four published microarray datasets. Enriched functional terms identified by CEA not only involve crucial biological processes of related diseases, but also have much less redundancy and can serve as a preferable representation for the enriched terms found by traditional single-term-based methods. CEA has been implemented in the R package CopTea and is available at http://github.com/wulingyun/CopTea/. Nature Publishing Group UK 2018-08-30 /pmc/articles/PMC6117355/ /pubmed/30166636 http://dx.doi.org/10.1038/s41598-018-31396-4 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Sun, Duanchen
Liu, Yinliang
Zhang, Xiang-Sun
Wu, Ling-Yun
CEA: Combination-based gene set functional enrichment analysis
title CEA: Combination-based gene set functional enrichment analysis
title_full CEA: Combination-based gene set functional enrichment analysis
title_fullStr CEA: Combination-based gene set functional enrichment analysis
title_full_unstemmed CEA: Combination-based gene set functional enrichment analysis
title_short CEA: Combination-based gene set functional enrichment analysis
title_sort cea: combination-based gene set functional enrichment analysis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6117355/
https://www.ncbi.nlm.nih.gov/pubmed/30166636
http://dx.doi.org/10.1038/s41598-018-31396-4
work_keys_str_mv AT sunduanchen ceacombinationbasedgenesetfunctionalenrichmentanalysis
AT liuyinliang ceacombinationbasedgenesetfunctionalenrichmentanalysis
AT zhangxiangsun ceacombinationbasedgenesetfunctionalenrichmentanalysis
AT wulingyun ceacombinationbasedgenesetfunctionalenrichmentanalysis