Cargando…
GSEApy: a comprehensive package for performing gene set enrichment analysis in Python
MOTIVATION: Gene set enrichment analysis (GSEA) is a commonly used algorithm for characterizing gene expression changes. However, the currently available tools used to perform GSEA have a limited ability to analyze large datasets, which is particularly problematic for the analysis of single-cell dat...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9805564/ https://www.ncbi.nlm.nih.gov/pubmed/36426870 http://dx.doi.org/10.1093/bioinformatics/btac757 |
_version_ | 1784862354272419840 |
---|---|
author | Fang, Zhuoqing Liu, Xinyuan Peltz, Gary |
author_facet | Fang, Zhuoqing Liu, Xinyuan Peltz, Gary |
author_sort | Fang, Zhuoqing |
collection | PubMed |
description | MOTIVATION: Gene set enrichment analysis (GSEA) is a commonly used algorithm for characterizing gene expression changes. However, the currently available tools used to perform GSEA have a limited ability to analyze large datasets, which is particularly problematic for the analysis of single-cell data. To overcome this limitation, we developed a GSEA package in Python (GSEApy), which could efficiently analyze large single-cell datasets. RESULTS: We present a package (GSEApy) that performs GSEA in either the command line or Python environment. GSEApy uses a Rust implementation to enable it to calculate the same enrichment statistic as GSEA for a collection of pathways. The Rust implementation of GSEApy is 3-fold faster than the Numpy version of GSEApy (v0.10.8) and uses >4-fold less memory. GSEApy also provides an interface between Python and Enrichr web services, as well as for BioMart. The Enrichr application programming interface enables GSEApy to perform over-representation analysis for an input gene list. Furthermore, GSEApy consists of several tools, each designed to facilitate a particular type of enrichment analysis. AVAILABILITY AND IMPLEMENTATION: The new GSEApy with Rust extension is deposited in PyPI: https://pypi.org/project/gseapy/. The GSEApy source code is freely available at https://github.com/zqfang/GSEApy. Also, the documentation website is available at https://gseapy.rtfd.io/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-9805564 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-98055642023-01-03 GSEApy: a comprehensive package for performing gene set enrichment analysis in Python Fang, Zhuoqing Liu, Xinyuan Peltz, Gary Bioinformatics Applications Note MOTIVATION: Gene set enrichment analysis (GSEA) is a commonly used algorithm for characterizing gene expression changes. However, the currently available tools used to perform GSEA have a limited ability to analyze large datasets, which is particularly problematic for the analysis of single-cell data. To overcome this limitation, we developed a GSEA package in Python (GSEApy), which could efficiently analyze large single-cell datasets. RESULTS: We present a package (GSEApy) that performs GSEA in either the command line or Python environment. GSEApy uses a Rust implementation to enable it to calculate the same enrichment statistic as GSEA for a collection of pathways. The Rust implementation of GSEApy is 3-fold faster than the Numpy version of GSEApy (v0.10.8) and uses >4-fold less memory. GSEApy also provides an interface between Python and Enrichr web services, as well as for BioMart. The Enrichr application programming interface enables GSEApy to perform over-representation analysis for an input gene list. Furthermore, GSEApy consists of several tools, each designed to facilitate a particular type of enrichment analysis. AVAILABILITY AND IMPLEMENTATION: The new GSEApy with Rust extension is deposited in PyPI: https://pypi.org/project/gseapy/. The GSEApy source code is freely available at https://github.com/zqfang/GSEApy. Also, the documentation website is available at https://gseapy.rtfd.io/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-11-25 /pmc/articles/PMC9805564/ /pubmed/36426870 http://dx.doi.org/10.1093/bioinformatics/btac757 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Applications Note Fang, Zhuoqing Liu, Xinyuan Peltz, Gary GSEApy: a comprehensive package for performing gene set enrichment analysis in Python |
title | GSEApy: a comprehensive package for performing gene set enrichment analysis in Python |
title_full | GSEApy: a comprehensive package for performing gene set enrichment analysis in Python |
title_fullStr | GSEApy: a comprehensive package for performing gene set enrichment analysis in Python |
title_full_unstemmed | GSEApy: a comprehensive package for performing gene set enrichment analysis in Python |
title_short | GSEApy: a comprehensive package for performing gene set enrichment analysis in Python |
title_sort | gseapy: a comprehensive package for performing gene set enrichment analysis in python |
topic | Applications Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9805564/ https://www.ncbi.nlm.nih.gov/pubmed/36426870 http://dx.doi.org/10.1093/bioinformatics/btac757 |
work_keys_str_mv | AT fangzhuoqing gseapyacomprehensivepackageforperforminggenesetenrichmentanalysisinpython AT liuxinyuan gseapyacomprehensivepackageforperforminggenesetenrichmentanalysisinpython AT peltzgary gseapyacomprehensivepackageforperforminggenesetenrichmentanalysisinpython |