Cargando…

Taxonomically Informed Scoring Enhances Confidence in Natural Products Annotation

Mass spectrometry (MS) offers unrivalled sensitivity for the metabolite profiling of complex biological matrices encountered in natural products (NP) research. The massive and complex sets of spectral data generated by such platforms require computational approaches for their interpretation. Within...

Descripción completa

Detalles Bibliográficos
Autores principales: Rutz, Adriano, Dounoue-Kubo, Miwa, Ollivier, Simon, Bisson, Jonathan, Bagheri, Mohsen, Saesong, Tongchai, Ebrahimi, Samad Nejad, Ingkaninan, Kornkanok, Wolfender, Jean-Luc, Allard, Pierre-Marie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6824209/
https://www.ncbi.nlm.nih.gov/pubmed/31708947
http://dx.doi.org/10.3389/fpls.2019.01329
_version_ 1783464698441629696
author Rutz, Adriano
Dounoue-Kubo, Miwa
Ollivier, Simon
Bisson, Jonathan
Bagheri, Mohsen
Saesong, Tongchai
Ebrahimi, Samad Nejad
Ingkaninan, Kornkanok
Wolfender, Jean-Luc
Allard, Pierre-Marie
author_facet Rutz, Adriano
Dounoue-Kubo, Miwa
Ollivier, Simon
Bisson, Jonathan
Bagheri, Mohsen
Saesong, Tongchai
Ebrahimi, Samad Nejad
Ingkaninan, Kornkanok
Wolfender, Jean-Luc
Allard, Pierre-Marie
author_sort Rutz, Adriano
collection PubMed
description Mass spectrometry (MS) offers unrivalled sensitivity for the metabolite profiling of complex biological matrices encountered in natural products (NP) research. The massive and complex sets of spectral data generated by such platforms require computational approaches for their interpretation. Within such approaches, computational metabolite annotation automatically links spectral data to candidate structures via a score, which is usually established between the acquired data and experimental or theoretical spectral databases (DB). This process leads to various candidate structures for each MS features. However, at this stage, obtaining high annotation confidence level remains a challenge notably due to the extensive chemodiversity of specialized metabolomes. The design of a metascore is a way to capture complementary experimental attributes and improve the annotation process. Here, we show that integrating the taxonomic position of the biological source of the analyzed samples and candidate structures enhances confidence in metabolite annotation. A script is proposed to automatically input such information at various granularity levels (species, genus, and family) and complement the score obtained between experimental spectral data and output of available computational metabolite annotation tools (ISDB-DNP, MS-Finder, Sirius). In all cases, the consideration of the taxonomic distance allowed an efficient re-ranking of the candidate structures leading to a systematic enhancement of the recall and precision rates of the tools (1.5- to 7-fold increase in the F1 score). Our results clearly demonstrate the importance of considering taxonomic information in the process of specialized metabolites annotation. This requires to access structural data systematically documented with biological origin, both for new and previously reported NPs. In this respect, the establishment of an open structural DB of specialized metabolites and their associated metadata, particularly biological sources, is timely and critical for the NP research community.
format Online
Article
Text
id pubmed-6824209
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-68242092019-11-08 Taxonomically Informed Scoring Enhances Confidence in Natural Products Annotation Rutz, Adriano Dounoue-Kubo, Miwa Ollivier, Simon Bisson, Jonathan Bagheri, Mohsen Saesong, Tongchai Ebrahimi, Samad Nejad Ingkaninan, Kornkanok Wolfender, Jean-Luc Allard, Pierre-Marie Front Plant Sci Plant Science Mass spectrometry (MS) offers unrivalled sensitivity for the metabolite profiling of complex biological matrices encountered in natural products (NP) research. The massive and complex sets of spectral data generated by such platforms require computational approaches for their interpretation. Within such approaches, computational metabolite annotation automatically links spectral data to candidate structures via a score, which is usually established between the acquired data and experimental or theoretical spectral databases (DB). This process leads to various candidate structures for each MS features. However, at this stage, obtaining high annotation confidence level remains a challenge notably due to the extensive chemodiversity of specialized metabolomes. The design of a metascore is a way to capture complementary experimental attributes and improve the annotation process. Here, we show that integrating the taxonomic position of the biological source of the analyzed samples and candidate structures enhances confidence in metabolite annotation. A script is proposed to automatically input such information at various granularity levels (species, genus, and family) and complement the score obtained between experimental spectral data and output of available computational metabolite annotation tools (ISDB-DNP, MS-Finder, Sirius). In all cases, the consideration of the taxonomic distance allowed an efficient re-ranking of the candidate structures leading to a systematic enhancement of the recall and precision rates of the tools (1.5- to 7-fold increase in the F1 score). Our results clearly demonstrate the importance of considering taxonomic information in the process of specialized metabolites annotation. This requires to access structural data systematically documented with biological origin, both for new and previously reported NPs. In this respect, the establishment of an open structural DB of specialized metabolites and their associated metadata, particularly biological sources, is timely and critical for the NP research community. Frontiers Media S.A. 2019-10-25 /pmc/articles/PMC6824209/ /pubmed/31708947 http://dx.doi.org/10.3389/fpls.2019.01329 Text en Copyright © 2019 Rutz, Dounoue-Kubo, Ollivier, Bisson, Bagheri, Saesong, Ebrahimi, Ingkaninan, Wolfender and Allard http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Rutz, Adriano
Dounoue-Kubo, Miwa
Ollivier, Simon
Bisson, Jonathan
Bagheri, Mohsen
Saesong, Tongchai
Ebrahimi, Samad Nejad
Ingkaninan, Kornkanok
Wolfender, Jean-Luc
Allard, Pierre-Marie
Taxonomically Informed Scoring Enhances Confidence in Natural Products Annotation
title Taxonomically Informed Scoring Enhances Confidence in Natural Products Annotation
title_full Taxonomically Informed Scoring Enhances Confidence in Natural Products Annotation
title_fullStr Taxonomically Informed Scoring Enhances Confidence in Natural Products Annotation
title_full_unstemmed Taxonomically Informed Scoring Enhances Confidence in Natural Products Annotation
title_short Taxonomically Informed Scoring Enhances Confidence in Natural Products Annotation
title_sort taxonomically informed scoring enhances confidence in natural products annotation
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6824209/
https://www.ncbi.nlm.nih.gov/pubmed/31708947
http://dx.doi.org/10.3389/fpls.2019.01329
work_keys_str_mv AT rutzadriano taxonomicallyinformedscoringenhancesconfidenceinnaturalproductsannotation
AT dounouekubomiwa taxonomicallyinformedscoringenhancesconfidenceinnaturalproductsannotation
AT olliviersimon taxonomicallyinformedscoringenhancesconfidenceinnaturalproductsannotation
AT bissonjonathan taxonomicallyinformedscoringenhancesconfidenceinnaturalproductsannotation
AT bagherimohsen taxonomicallyinformedscoringenhancesconfidenceinnaturalproductsannotation
AT saesongtongchai taxonomicallyinformedscoringenhancesconfidenceinnaturalproductsannotation
AT ebrahimisamadnejad taxonomicallyinformedscoringenhancesconfidenceinnaturalproductsannotation
AT ingkaninankornkanok taxonomicallyinformedscoringenhancesconfidenceinnaturalproductsannotation
AT wolfenderjeanluc taxonomicallyinformedscoringenhancesconfidenceinnaturalproductsannotation
AT allardpierremarie taxonomicallyinformedscoringenhancesconfidenceinnaturalproductsannotation