Cargando…

Fixing molecular complexes in BioPAX standards to enrich interactions and detect redundancies using semantic web technologies

MOTIVATION: Molecular complexes play a major role in the regulation of biological pathways. The Biological Pathway Exchange format (BioPAX) facilitates the integration of data sources describing interactions some of which involving complexes. The BioPAX specification explicitly prevents complexes to...

Descripción completa

Detalles Bibliográficos
Autores principales: Juigné, Camille, Dameron, Olivier, Moreews, François, Gondret, Florence, Becker, Emmanuelle
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10168583/
https://www.ncbi.nlm.nih.gov/pubmed/37097895
http://dx.doi.org/10.1093/bioinformatics/btad257
_version_ 1785038884189503488
author Juigné, Camille
Dameron, Olivier
Moreews, François
Gondret, Florence
Becker, Emmanuelle
author_facet Juigné, Camille
Dameron, Olivier
Moreews, François
Gondret, Florence
Becker, Emmanuelle
author_sort Juigné, Camille
collection PubMed
description MOTIVATION: Molecular complexes play a major role in the regulation of biological pathways. The Biological Pathway Exchange format (BioPAX) facilitates the integration of data sources describing interactions some of which involving complexes. The BioPAX specification explicitly prevents complexes to have any component that is another complex (unless this component is a black-box complex whose composition is unknown). However, we observed that the well-curated Reactome pathway database contains such recursive complexes of complexes. We propose reproductible and semantically rich SPARQL queries for identifying and fixing invalid complexes in BioPAX databases, and evaluate the consequences of fixing these nonconformities in the Reactome database. RESULTS: For the Homo sapiens version of Reactome, we identify 5833 recursively defined complexes out of the 14 987 complexes (39%). This situation is not specific to the Human dataset, as all tested species of Reactome exhibit between 30% (Plasmodium falciparum) and 40% (Sus scrofa, Bos taurus, Canis familiaris, and Gallus gallus) of recursive complexes. As an additional consequence, the procedure also allows the detection of complex redundancies. Overall, this method improves the conformity and the automated analysis of the graph by repairing the topology of the complexes in the graph. This will allow to apply further reasoning methods on better consistent data. AVAILABILITY AND IMPLEMENTATION: We provide a Jupyter notebook detailing the analysis https://github.com/cjuigne/non_conformities_detection_biopax.
format Online
Article
Text
id pubmed-10168583
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-101685832023-05-10 Fixing molecular complexes in BioPAX standards to enrich interactions and detect redundancies using semantic web technologies Juigné, Camille Dameron, Olivier Moreews, François Gondret, Florence Becker, Emmanuelle Bioinformatics Original Paper MOTIVATION: Molecular complexes play a major role in the regulation of biological pathways. The Biological Pathway Exchange format (BioPAX) facilitates the integration of data sources describing interactions some of which involving complexes. The BioPAX specification explicitly prevents complexes to have any component that is another complex (unless this component is a black-box complex whose composition is unknown). However, we observed that the well-curated Reactome pathway database contains such recursive complexes of complexes. We propose reproductible and semantically rich SPARQL queries for identifying and fixing invalid complexes in BioPAX databases, and evaluate the consequences of fixing these nonconformities in the Reactome database. RESULTS: For the Homo sapiens version of Reactome, we identify 5833 recursively defined complexes out of the 14 987 complexes (39%). This situation is not specific to the Human dataset, as all tested species of Reactome exhibit between 30% (Plasmodium falciparum) and 40% (Sus scrofa, Bos taurus, Canis familiaris, and Gallus gallus) of recursive complexes. As an additional consequence, the procedure also allows the detection of complex redundancies. Overall, this method improves the conformity and the automated analysis of the graph by repairing the topology of the complexes in the graph. This will allow to apply further reasoning methods on better consistent data. AVAILABILITY AND IMPLEMENTATION: We provide a Jupyter notebook detailing the analysis https://github.com/cjuigne/non_conformities_detection_biopax. Oxford University Press 2023-04-25 /pmc/articles/PMC10168583/ /pubmed/37097895 http://dx.doi.org/10.1093/bioinformatics/btad257 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Juigné, Camille
Dameron, Olivier
Moreews, François
Gondret, Florence
Becker, Emmanuelle
Fixing molecular complexes in BioPAX standards to enrich interactions and detect redundancies using semantic web technologies
title Fixing molecular complexes in BioPAX standards to enrich interactions and detect redundancies using semantic web technologies
title_full Fixing molecular complexes in BioPAX standards to enrich interactions and detect redundancies using semantic web technologies
title_fullStr Fixing molecular complexes in BioPAX standards to enrich interactions and detect redundancies using semantic web technologies
title_full_unstemmed Fixing molecular complexes in BioPAX standards to enrich interactions and detect redundancies using semantic web technologies
title_short Fixing molecular complexes in BioPAX standards to enrich interactions and detect redundancies using semantic web technologies
title_sort fixing molecular complexes in biopax standards to enrich interactions and detect redundancies using semantic web technologies
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10168583/
https://www.ncbi.nlm.nih.gov/pubmed/37097895
http://dx.doi.org/10.1093/bioinformatics/btad257
work_keys_str_mv AT juignecamille fixingmolecularcomplexesinbiopaxstandardstoenrichinteractionsanddetectredundanciesusingsemanticwebtechnologies
AT dameronolivier fixingmolecularcomplexesinbiopaxstandardstoenrichinteractionsanddetectredundanciesusingsemanticwebtechnologies
AT moreewsfrancois fixingmolecularcomplexesinbiopaxstandardstoenrichinteractionsanddetectredundanciesusingsemanticwebtechnologies
AT gondretflorence fixingmolecularcomplexesinbiopaxstandardstoenrichinteractionsanddetectredundanciesusingsemanticwebtechnologies
AT beckeremmanuelle fixingmolecularcomplexesinbiopaxstandardstoenrichinteractionsanddetectredundanciesusingsemanticwebtechnologies