Cargando…

Pathway analysis in metabolomics: Recommendations for the use of over-representation analysis

Over-representation analysis (ORA) is one of the commonest pathway analysis approaches used for the functional interpretation of metabolomics datasets. Despite the widespread use of ORA in metabolomics, the community lacks guidelines detailing its best-practice use. Many factors have a pronounced im...

Descripción completa

Detalles Bibliográficos
Autores principales: Wieder, Cecilia, Frainay, Clément, Poupin, Nathalie, Rodríguez-Mier, Pablo, Vinson, Florence, Cooke, Juliette, Lai, Rachel PJ, Bundy, Jacob G., Jourdan, Fabien, Ebbels, Timothy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8448349/
https://www.ncbi.nlm.nih.gov/pubmed/34492007
http://dx.doi.org/10.1371/journal.pcbi.1009105
_version_ 1784569221120786432
author Wieder, Cecilia
Frainay, Clément
Poupin, Nathalie
Rodríguez-Mier, Pablo
Vinson, Florence
Cooke, Juliette
Lai, Rachel PJ
Bundy, Jacob G.
Jourdan, Fabien
Ebbels, Timothy
author_facet Wieder, Cecilia
Frainay, Clément
Poupin, Nathalie
Rodríguez-Mier, Pablo
Vinson, Florence
Cooke, Juliette
Lai, Rachel PJ
Bundy, Jacob G.
Jourdan, Fabien
Ebbels, Timothy
author_sort Wieder, Cecilia
collection PubMed
description Over-representation analysis (ORA) is one of the commonest pathway analysis approaches used for the functional interpretation of metabolomics datasets. Despite the widespread use of ORA in metabolomics, the community lacks guidelines detailing its best-practice use. Many factors have a pronounced impact on the results, but to date their effects have received little systematic attention. Using five publicly available datasets, we demonstrated that changes in parameters such as the background set, differential metabolite selection methods, and pathway database used can result in profoundly different ORA results. The use of a non-assay-specific background set, for example, resulted in large numbers of false-positive pathways. Pathway database choice, evaluated using three of the most popular metabolic pathway databases (KEGG, Reactome, and BioCyc), led to vastly different results in both the number and function of significantly enriched pathways. Factors that are specific to metabolomics data, such as the reliability of compound identification and the chemical bias of different analytical platforms also impacted ORA results. Simulated metabolite misidentification rates as low as 4% resulted in both gain of false-positive pathways and loss of truly significant pathways across all datasets. Our results have several practical implications for ORA users, as well as those using alternative pathway analysis methods. We offer a set of recommendations for the use of ORA in metabolomics, alongside a set of minimal reporting guidelines, as a first step towards the standardisation of pathway analysis in metabolomics.
format Online
Article
Text
id pubmed-8448349
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-84483492021-09-18 Pathway analysis in metabolomics: Recommendations for the use of over-representation analysis Wieder, Cecilia Frainay, Clément Poupin, Nathalie Rodríguez-Mier, Pablo Vinson, Florence Cooke, Juliette Lai, Rachel PJ Bundy, Jacob G. Jourdan, Fabien Ebbels, Timothy PLoS Comput Biol Research Article Over-representation analysis (ORA) is one of the commonest pathway analysis approaches used for the functional interpretation of metabolomics datasets. Despite the widespread use of ORA in metabolomics, the community lacks guidelines detailing its best-practice use. Many factors have a pronounced impact on the results, but to date their effects have received little systematic attention. Using five publicly available datasets, we demonstrated that changes in parameters such as the background set, differential metabolite selection methods, and pathway database used can result in profoundly different ORA results. The use of a non-assay-specific background set, for example, resulted in large numbers of false-positive pathways. Pathway database choice, evaluated using three of the most popular metabolic pathway databases (KEGG, Reactome, and BioCyc), led to vastly different results in both the number and function of significantly enriched pathways. Factors that are specific to metabolomics data, such as the reliability of compound identification and the chemical bias of different analytical platforms also impacted ORA results. Simulated metabolite misidentification rates as low as 4% resulted in both gain of false-positive pathways and loss of truly significant pathways across all datasets. Our results have several practical implications for ORA users, as well as those using alternative pathway analysis methods. We offer a set of recommendations for the use of ORA in metabolomics, alongside a set of minimal reporting guidelines, as a first step towards the standardisation of pathway analysis in metabolomics. Public Library of Science 2021-09-07 /pmc/articles/PMC8448349/ /pubmed/34492007 http://dx.doi.org/10.1371/journal.pcbi.1009105 Text en © 2021 Wieder et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Wieder, Cecilia
Frainay, Clément
Poupin, Nathalie
Rodríguez-Mier, Pablo
Vinson, Florence
Cooke, Juliette
Lai, Rachel PJ
Bundy, Jacob G.
Jourdan, Fabien
Ebbels, Timothy
Pathway analysis in metabolomics: Recommendations for the use of over-representation analysis
title Pathway analysis in metabolomics: Recommendations for the use of over-representation analysis
title_full Pathway analysis in metabolomics: Recommendations for the use of over-representation analysis
title_fullStr Pathway analysis in metabolomics: Recommendations for the use of over-representation analysis
title_full_unstemmed Pathway analysis in metabolomics: Recommendations for the use of over-representation analysis
title_short Pathway analysis in metabolomics: Recommendations for the use of over-representation analysis
title_sort pathway analysis in metabolomics: recommendations for the use of over-representation analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8448349/
https://www.ncbi.nlm.nih.gov/pubmed/34492007
http://dx.doi.org/10.1371/journal.pcbi.1009105
work_keys_str_mv AT wiedercecilia pathwayanalysisinmetabolomicsrecommendationsfortheuseofoverrepresentationanalysis
AT frainayclement pathwayanalysisinmetabolomicsrecommendationsfortheuseofoverrepresentationanalysis
AT poupinnathalie pathwayanalysisinmetabolomicsrecommendationsfortheuseofoverrepresentationanalysis
AT rodriguezmierpablo pathwayanalysisinmetabolomicsrecommendationsfortheuseofoverrepresentationanalysis
AT vinsonflorence pathwayanalysisinmetabolomicsrecommendationsfortheuseofoverrepresentationanalysis
AT cookejuliette pathwayanalysisinmetabolomicsrecommendationsfortheuseofoverrepresentationanalysis
AT lairachelpj pathwayanalysisinmetabolomicsrecommendationsfortheuseofoverrepresentationanalysis
AT bundyjacobg pathwayanalysisinmetabolomicsrecommendationsfortheuseofoverrepresentationanalysis
AT jourdanfabien pathwayanalysisinmetabolomicsrecommendationsfortheuseofoverrepresentationanalysis
AT ebbelstimothy pathwayanalysisinmetabolomicsrecommendationsfortheuseofoverrepresentationanalysis