Cargando…

Systematic assessment of pathway databases, based on a diverse collection of user-submitted experiments

A knowledge-based grouping of genes into pathways or functional units is essential for describing and understanding cellular complexity. However, it is not always clear a priori how and at what level of specificity functionally interconnected genes should be partitioned into pathways, for a given ap...

Descripción completa

Detalles Bibliográficos
Autores principales: Gable, Annika L, Szklarczyk, Damian, Lyon, David, Matias Rodrigues, João F, von Mering, Christian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9487593/
https://www.ncbi.nlm.nih.gov/pubmed/36088548
http://dx.doi.org/10.1093/bib/bbac355
_version_ 1784792488090796032
author Gable, Annika L
Szklarczyk, Damian
Lyon, David
Matias Rodrigues, João F
von Mering, Christian
author_facet Gable, Annika L
Szklarczyk, Damian
Lyon, David
Matias Rodrigues, João F
von Mering, Christian
author_sort Gable, Annika L
collection PubMed
description A knowledge-based grouping of genes into pathways or functional units is essential for describing and understanding cellular complexity. However, it is not always clear a priori how and at what level of specificity functionally interconnected genes should be partitioned into pathways, for a given application. Here, we assess and compare nine existing and two conceptually novel functional classification systems, with respect to their discovery power and generality in gene set enrichment testing. We base our assessment on a collection of nearly 2000 functional genomics datasets provided by users of the STRING database. With these real-life and diverse queries, we assess which systems typically provide the most specific and complete enrichment results. We find many structural and performance differences between classification systems. Overall, the well-established, hierarchically organized pathway annotation systems yield the best enrichment performance, despite covering substantial parts of the human genome in general terms only. On the other hand, the more recent unsupervised annotation systems perform strongest in understudied areas and organisms, and in detecting more specific pathways, albeit with less informative labels.
format Online
Article
Text
id pubmed-9487593
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-94875932022-09-21 Systematic assessment of pathway databases, based on a diverse collection of user-submitted experiments Gable, Annika L Szklarczyk, Damian Lyon, David Matias Rodrigues, João F von Mering, Christian Brief Bioinform Problem Solving Protocol A knowledge-based grouping of genes into pathways or functional units is essential for describing and understanding cellular complexity. However, it is not always clear a priori how and at what level of specificity functionally interconnected genes should be partitioned into pathways, for a given application. Here, we assess and compare nine existing and two conceptually novel functional classification systems, with respect to their discovery power and generality in gene set enrichment testing. We base our assessment on a collection of nearly 2000 functional genomics datasets provided by users of the STRING database. With these real-life and diverse queries, we assess which systems typically provide the most specific and complete enrichment results. We find many structural and performance differences between classification systems. Overall, the well-established, hierarchically organized pathway annotation systems yield the best enrichment performance, despite covering substantial parts of the human genome in general terms only. On the other hand, the more recent unsupervised annotation systems perform strongest in understudied areas and organisms, and in detecting more specific pathways, albeit with less informative labels. Oxford University Press 2022-09-10 /pmc/articles/PMC9487593/ /pubmed/36088548 http://dx.doi.org/10.1093/bib/bbac355 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Problem Solving Protocol
Gable, Annika L
Szklarczyk, Damian
Lyon, David
Matias Rodrigues, João F
von Mering, Christian
Systematic assessment of pathway databases, based on a diverse collection of user-submitted experiments
title Systematic assessment of pathway databases, based on a diverse collection of user-submitted experiments
title_full Systematic assessment of pathway databases, based on a diverse collection of user-submitted experiments
title_fullStr Systematic assessment of pathway databases, based on a diverse collection of user-submitted experiments
title_full_unstemmed Systematic assessment of pathway databases, based on a diverse collection of user-submitted experiments
title_short Systematic assessment of pathway databases, based on a diverse collection of user-submitted experiments
title_sort systematic assessment of pathway databases, based on a diverse collection of user-submitted experiments
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9487593/
https://www.ncbi.nlm.nih.gov/pubmed/36088548
http://dx.doi.org/10.1093/bib/bbac355
work_keys_str_mv AT gableannikal systematicassessmentofpathwaydatabasesbasedonadiversecollectionofusersubmittedexperiments
AT szklarczykdamian systematicassessmentofpathwaydatabasesbasedonadiversecollectionofusersubmittedexperiments
AT lyondavid systematicassessmentofpathwaydatabasesbasedonadiversecollectionofusersubmittedexperiments
AT matiasrodriguesjoaof systematicassessmentofpathwaydatabasesbasedonadiversecollectionofusersubmittedexperiments
AT vonmeringchristian systematicassessmentofpathwaydatabasesbasedonadiversecollectionofusersubmittedexperiments