Cargando…
An automated model annotation system (AMAS) for SBML models
MOTIVATION: Annotations of biochemical models provide details of chemical species, documentation of chemical reactions, and other essential information. Unfortunately, the vast majority of biochemical models have few, if any, annotations, or the annotations provide insufficient detail to understand...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10628433/ https://www.ncbi.nlm.nih.gov/pubmed/37882737 http://dx.doi.org/10.1093/bioinformatics/btad658 |
_version_ | 1785131757403635712 |
---|---|
author | Shin, Woosub Gennari, John H Hellerstein, Joseph L Sauro, Herbert M |
author_facet | Shin, Woosub Gennari, John H Hellerstein, Joseph L Sauro, Herbert M |
author_sort | Shin, Woosub |
collection | PubMed |
description | MOTIVATION: Annotations of biochemical models provide details of chemical species, documentation of chemical reactions, and other essential information. Unfortunately, the vast majority of biochemical models have few, if any, annotations, or the annotations provide insufficient detail to understand the limitations of the model. The quality and quantity of annotations can be improved by developing tools that recommend annotations. For example, recommender tools have been developed for annotations of genes. Although annotating genes is conceptually similar to annotating biochemical models, there are important technical differences that make it difficult to directly apply this prior work. RESULTS: We present AMAS, a system that predicts annotations for elements of models represented in the Systems Biology Markup Language (SBML) community standard. We provide a general framework for predicting model annotations for a query element based on a database of annotated reference elements and a match score function that calculates the similarity between the query element and reference elements. The framework is instantiated to specific element types (e.g. species, reactions) by specifying the reference database (e.g. ChEBI for species) and the match score function (e.g. string similarity). We analyze the computational efficiency and prediction quality of AMAS for species and reactions in BiGG and BioModels and find that it has subsecond response times and accuracy between 80% and 95% depending on specifics of what is predicted. We have incorporated AMAS into an open-source, pip-installable Python package that can run as a command-line tool that predicts and adds annotations to species and reactions to an SBML model. AVAILABILITY AND IMPLEMENTATION: Our project is hosted at https://github.com/sys-bio/AMAS, where we provide examples, documentation, and source code files. Our source code is licensed under the MIT open-source license. |
format | Online Article Text |
id | pubmed-10628433 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-106284332023-11-08 An automated model annotation system (AMAS) for SBML models Shin, Woosub Gennari, John H Hellerstein, Joseph L Sauro, Herbert M Bioinformatics Original Paper MOTIVATION: Annotations of biochemical models provide details of chemical species, documentation of chemical reactions, and other essential information. Unfortunately, the vast majority of biochemical models have few, if any, annotations, or the annotations provide insufficient detail to understand the limitations of the model. The quality and quantity of annotations can be improved by developing tools that recommend annotations. For example, recommender tools have been developed for annotations of genes. Although annotating genes is conceptually similar to annotating biochemical models, there are important technical differences that make it difficult to directly apply this prior work. RESULTS: We present AMAS, a system that predicts annotations for elements of models represented in the Systems Biology Markup Language (SBML) community standard. We provide a general framework for predicting model annotations for a query element based on a database of annotated reference elements and a match score function that calculates the similarity between the query element and reference elements. The framework is instantiated to specific element types (e.g. species, reactions) by specifying the reference database (e.g. ChEBI for species) and the match score function (e.g. string similarity). We analyze the computational efficiency and prediction quality of AMAS for species and reactions in BiGG and BioModels and find that it has subsecond response times and accuracy between 80% and 95% depending on specifics of what is predicted. We have incorporated AMAS into an open-source, pip-installable Python package that can run as a command-line tool that predicts and adds annotations to species and reactions to an SBML model. AVAILABILITY AND IMPLEMENTATION: Our project is hosted at https://github.com/sys-bio/AMAS, where we provide examples, documentation, and source code files. Our source code is licensed under the MIT open-source license. Oxford University Press 2023-10-26 /pmc/articles/PMC10628433/ /pubmed/37882737 http://dx.doi.org/10.1093/bioinformatics/btad658 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Paper Shin, Woosub Gennari, John H Hellerstein, Joseph L Sauro, Herbert M An automated model annotation system (AMAS) for SBML models |
title | An automated model annotation system (AMAS) for SBML models |
title_full | An automated model annotation system (AMAS) for SBML models |
title_fullStr | An automated model annotation system (AMAS) for SBML models |
title_full_unstemmed | An automated model annotation system (AMAS) for SBML models |
title_short | An automated model annotation system (AMAS) for SBML models |
title_sort | automated model annotation system (amas) for sbml models |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10628433/ https://www.ncbi.nlm.nih.gov/pubmed/37882737 http://dx.doi.org/10.1093/bioinformatics/btad658 |
work_keys_str_mv | AT shinwoosub anautomatedmodelannotationsystemamasforsbmlmodels AT gennarijohnh anautomatedmodelannotationsystemamasforsbmlmodels AT hellersteinjosephl anautomatedmodelannotationsystemamasforsbmlmodels AT sauroherbertm anautomatedmodelannotationsystemamasforsbmlmodels AT shinwoosub automatedmodelannotationsystemamasforsbmlmodels AT gennarijohnh automatedmodelannotationsystemamasforsbmlmodels AT hellersteinjosephl automatedmodelannotationsystemamasforsbmlmodels AT sauroherbertm automatedmodelannotationsystemamasforsbmlmodels |