Cargando…

An automated model annotation system (AMAS) for SBML models

MOTIVATION: Annotations of biochemical models provide details of chemical species, documentation of chemical reactions, and other essential information. Unfortunately, the vast majority of biochemical models have few, if any, annotations, or the annotations provide insufficient detail to understand...

Descripción completa

Detalles Bibliográficos
Autores principales: Shin, Woosub, Gennari, John H, Hellerstein, Joseph L, Sauro, Herbert M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10628433/
https://www.ncbi.nlm.nih.gov/pubmed/37882737
http://dx.doi.org/10.1093/bioinformatics/btad658
_version_ 1785131757403635712
author Shin, Woosub
Gennari, John H
Hellerstein, Joseph L
Sauro, Herbert M
author_facet Shin, Woosub
Gennari, John H
Hellerstein, Joseph L
Sauro, Herbert M
author_sort Shin, Woosub
collection PubMed
description MOTIVATION: Annotations of biochemical models provide details of chemical species, documentation of chemical reactions, and other essential information. Unfortunately, the vast majority of biochemical models have few, if any, annotations, or the annotations provide insufficient detail to understand the limitations of the model. The quality and quantity of annotations can be improved by developing tools that recommend annotations. For example, recommender tools have been developed for annotations of genes. Although annotating genes is conceptually similar to annotating biochemical models, there are important technical differences that make it difficult to directly apply this prior work. RESULTS: We present AMAS, a system that predicts annotations for elements of models represented in the Systems Biology Markup Language (SBML) community standard. We provide a general framework for predicting model annotations for a query element based on a database of annotated reference elements and a match score function that calculates the similarity between the query element and reference elements. The framework is instantiated to specific element types (e.g. species, reactions) by specifying the reference database (e.g. ChEBI for species) and the match score function (e.g. string similarity). We analyze the computational efficiency and prediction quality of AMAS for species and reactions in BiGG and BioModels and find that it has subsecond response times and accuracy between 80% and 95% depending on specifics of what is predicted. We have incorporated AMAS into an open-source, pip-installable Python package that can run as a command-line tool that predicts and adds annotations to species and reactions to an SBML model. AVAILABILITY AND IMPLEMENTATION: Our project is hosted at https://github.com/sys-bio/AMAS, where we provide examples, documentation, and source code files. Our source code is licensed under the MIT open-source license.
format Online
Article
Text
id pubmed-10628433
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-106284332023-11-08 An automated model annotation system (AMAS) for SBML models Shin, Woosub Gennari, John H Hellerstein, Joseph L Sauro, Herbert M Bioinformatics Original Paper MOTIVATION: Annotations of biochemical models provide details of chemical species, documentation of chemical reactions, and other essential information. Unfortunately, the vast majority of biochemical models have few, if any, annotations, or the annotations provide insufficient detail to understand the limitations of the model. The quality and quantity of annotations can be improved by developing tools that recommend annotations. For example, recommender tools have been developed for annotations of genes. Although annotating genes is conceptually similar to annotating biochemical models, there are important technical differences that make it difficult to directly apply this prior work. RESULTS: We present AMAS, a system that predicts annotations for elements of models represented in the Systems Biology Markup Language (SBML) community standard. We provide a general framework for predicting model annotations for a query element based on a database of annotated reference elements and a match score function that calculates the similarity between the query element and reference elements. The framework is instantiated to specific element types (e.g. species, reactions) by specifying the reference database (e.g. ChEBI for species) and the match score function (e.g. string similarity). We analyze the computational efficiency and prediction quality of AMAS for species and reactions in BiGG and BioModels and find that it has subsecond response times and accuracy between 80% and 95% depending on specifics of what is predicted. We have incorporated AMAS into an open-source, pip-installable Python package that can run as a command-line tool that predicts and adds annotations to species and reactions to an SBML model. AVAILABILITY AND IMPLEMENTATION: Our project is hosted at https://github.com/sys-bio/AMAS, where we provide examples, documentation, and source code files. Our source code is licensed under the MIT open-source license. Oxford University Press 2023-10-26 /pmc/articles/PMC10628433/ /pubmed/37882737 http://dx.doi.org/10.1093/bioinformatics/btad658 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Shin, Woosub
Gennari, John H
Hellerstein, Joseph L
Sauro, Herbert M
An automated model annotation system (AMAS) for SBML models
title An automated model annotation system (AMAS) for SBML models
title_full An automated model annotation system (AMAS) for SBML models
title_fullStr An automated model annotation system (AMAS) for SBML models
title_full_unstemmed An automated model annotation system (AMAS) for SBML models
title_short An automated model annotation system (AMAS) for SBML models
title_sort automated model annotation system (amas) for sbml models
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10628433/
https://www.ncbi.nlm.nih.gov/pubmed/37882737
http://dx.doi.org/10.1093/bioinformatics/btad658
work_keys_str_mv AT shinwoosub anautomatedmodelannotationsystemamasforsbmlmodels
AT gennarijohnh anautomatedmodelannotationsystemamasforsbmlmodels
AT hellersteinjosephl anautomatedmodelannotationsystemamasforsbmlmodels
AT sauroherbertm anautomatedmodelannotationsystemamasforsbmlmodels
AT shinwoosub automatedmodelannotationsystemamasforsbmlmodels
AT gennarijohnh automatedmodelannotationsystemamasforsbmlmodels
AT hellersteinjosephl automatedmodelannotationsystemamasforsbmlmodels
AT sauroherbertm automatedmodelannotationsystemamasforsbmlmodels