Cargando…

Improving annotation propagation on molecular networks through random walks: introducing ChemWalker

MOTIVATION: Annotation of the mass signals is still the biggest bottleneck for the untargeted mass spectrometry analysis of complex mixtures. Molecular networks are being increasingly adopted by the mass spectrometry community as a tool to annotate large-scale experiments. We have previously shown t...

Descripción completa

Detalles Bibliográficos
Autores principales: Borelli, Tiago Cabral, Arini, Gabriel Santos, Feitosa, Luís G P, Dorrestein, Pieter C, Lopes, Norberto Peporine, da Silva, Ricardo R
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9991053/
https://www.ncbi.nlm.nih.gov/pubmed/36864626
http://dx.doi.org/10.1093/bioinformatics/btad078
_version_ 1784902063021359104
author Borelli, Tiago Cabral
Arini, Gabriel Santos
Feitosa, Luís G P
Dorrestein, Pieter C
Lopes, Norberto Peporine
da Silva, Ricardo R
author_facet Borelli, Tiago Cabral
Arini, Gabriel Santos
Feitosa, Luís G P
Dorrestein, Pieter C
Lopes, Norberto Peporine
da Silva, Ricardo R
author_sort Borelli, Tiago Cabral
collection PubMed
description MOTIVATION: Annotation of the mass signals is still the biggest bottleneck for the untargeted mass spectrometry analysis of complex mixtures. Molecular networks are being increasingly adopted by the mass spectrometry community as a tool to annotate large-scale experiments. We have previously shown that the process of propagating annotations from spectral library matches on molecular networks can be automated using Network Annotation Propagation (NAP). One of the limitations of NAP is that the information for the spectral matches is only propagated locally, to the first neighbor of a spectral match. Here, we show that annotation propagation can be expanded to nodes not directly connected to spectral matches using random walks on graphs, introducing the ChemWalker python library. RESULTS: Similarly to NAP, ChemWalker relies on combinatorial in silico fragmentation results, performed by MetFrag, searching biologically relevant databases. Departing from the combination of a spectral network and the structural similarity among candidate structures, we have used MetFusion Scoring function to create a weight function, producing a weighted graph. This graph was subsequently used by the random walk to calculate the probability of ‘walking’ through a set of candidates, departing from seed nodes (represented by spectral library matches). This approach allowed the information propagation to nodes not directly connected to the spectral library match. Compared with NAP, ChemWalker has a series of improvements, on running time, scalability and maintainability and is available as a standalone python package. AVAILABILITY AND IMPLEMENTATION: ChemWalker is freely available at https://github.com/computational-chemical-biology/ChemWalker CONTACT: ridasilva@usp.br SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9991053
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-99910532023-03-08 Improving annotation propagation on molecular networks through random walks: introducing ChemWalker Borelli, Tiago Cabral Arini, Gabriel Santos Feitosa, Luís G P Dorrestein, Pieter C Lopes, Norberto Peporine da Silva, Ricardo R Bioinformatics Applications Note MOTIVATION: Annotation of the mass signals is still the biggest bottleneck for the untargeted mass spectrometry analysis of complex mixtures. Molecular networks are being increasingly adopted by the mass spectrometry community as a tool to annotate large-scale experiments. We have previously shown that the process of propagating annotations from spectral library matches on molecular networks can be automated using Network Annotation Propagation (NAP). One of the limitations of NAP is that the information for the spectral matches is only propagated locally, to the first neighbor of a spectral match. Here, we show that annotation propagation can be expanded to nodes not directly connected to spectral matches using random walks on graphs, introducing the ChemWalker python library. RESULTS: Similarly to NAP, ChemWalker relies on combinatorial in silico fragmentation results, performed by MetFrag, searching biologically relevant databases. Departing from the combination of a spectral network and the structural similarity among candidate structures, we have used MetFusion Scoring function to create a weight function, producing a weighted graph. This graph was subsequently used by the random walk to calculate the probability of ‘walking’ through a set of candidates, departing from seed nodes (represented by spectral library matches). This approach allowed the information propagation to nodes not directly connected to the spectral library match. Compared with NAP, ChemWalker has a series of improvements, on running time, scalability and maintainability and is available as a standalone python package. AVAILABILITY AND IMPLEMENTATION: ChemWalker is freely available at https://github.com/computational-chemical-biology/ChemWalker CONTACT: ridasilva@usp.br SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2023-03-02 /pmc/articles/PMC9991053/ /pubmed/36864626 http://dx.doi.org/10.1093/bioinformatics/btad078 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Note
Borelli, Tiago Cabral
Arini, Gabriel Santos
Feitosa, Luís G P
Dorrestein, Pieter C
Lopes, Norberto Peporine
da Silva, Ricardo R
Improving annotation propagation on molecular networks through random walks: introducing ChemWalker
title Improving annotation propagation on molecular networks through random walks: introducing ChemWalker
title_full Improving annotation propagation on molecular networks through random walks: introducing ChemWalker
title_fullStr Improving annotation propagation on molecular networks through random walks: introducing ChemWalker
title_full_unstemmed Improving annotation propagation on molecular networks through random walks: introducing ChemWalker
title_short Improving annotation propagation on molecular networks through random walks: introducing ChemWalker
title_sort improving annotation propagation on molecular networks through random walks: introducing chemwalker
topic Applications Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9991053/
https://www.ncbi.nlm.nih.gov/pubmed/36864626
http://dx.doi.org/10.1093/bioinformatics/btad078
work_keys_str_mv AT borellitiagocabral improvingannotationpropagationonmolecularnetworksthroughrandomwalksintroducingchemwalker
AT arinigabrielsantos improvingannotationpropagationonmolecularnetworksthroughrandomwalksintroducingchemwalker
AT feitosaluisgp improvingannotationpropagationonmolecularnetworksthroughrandomwalksintroducingchemwalker
AT dorresteinpieterc improvingannotationpropagationonmolecularnetworksthroughrandomwalksintroducingchemwalker
AT lopesnorbertopeporine improvingannotationpropagationonmolecularnetworksthroughrandomwalksintroducingchemwalker
AT dasilvaricardor improvingannotationpropagationonmolecularnetworksthroughrandomwalksintroducingchemwalker