Cargando…
An Automated Model Annotation System (AMAS) for SBML Models
MOTIVATION: Annotations of biochemical models provide details of chemical species, documentation of chemical reactions, and other essential information. Unfortunately, the vast majority of biochemical models have few, if any, annotations, or the annotations provide insufficient detail to understand...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10370092/ https://www.ncbi.nlm.nih.gov/pubmed/37503075 http://dx.doi.org/10.1101/2023.07.19.549722 |
_version_ | 1785077887452315648 |
---|---|
author | Shin, Woosub Gennari, John H. Hellerstein, Joseph L. Sauro, Herbert M. |
author_facet | Shin, Woosub Gennari, John H. Hellerstein, Joseph L. Sauro, Herbert M. |
author_sort | Shin, Woosub |
collection | PubMed |
description | MOTIVATION: Annotations of biochemical models provide details of chemical species, documentation of chemical reactions, and other essential information. Unfortunately, the vast majority of biochemical models have few, if any, annotations, or the annotations provide insufficient detail to understand the limitations of the model. The quality and quantity of annotations can be improved by developing tools that recommend annotations. For example, recommender tools have been developed for annotations of genes. Although annotating genes is conceptually similar to annotating biochemical models, there are important technical differences that make it difficult to directly apply this prior work. RESULTS: We present AMAS, a system that predicts annotations for elements of models represented in the Systems Biology Markup Language (SBML) community standard. We provide a general framework for predicting model annotations for a query element based on a database of annotated reference elements and a match score function that calculates the similarity between the query element and reference elements. The framework is instantiated to specific element types (e.g., species, reactions) by specifying the reference database (e.g., ChEBI for species) and the match score function (e.g., string similarity). We analyze the computational efficiency and prediction quality of AMAS for species and reactions in BiGG and BioModels and find that it has sub-second response times and accuracy between 80% and 95% depending on specifics of what is predicted. We have incorporated AMAS into an open-source, pip-installable Python package that can run as a command-line tool that predicts and adds annotations to species and reactions to an SBML model. AVAILABILITY: Our project is hosted at https://github.com/sys-bio/AMAS, where we provide examples, documentation, and source code files. Our source code is licensed under the MIT open-source license. |
format | Online Article Text |
id | pubmed-10370092 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-103700922023-07-27 An Automated Model Annotation System (AMAS) for SBML Models Shin, Woosub Gennari, John H. Hellerstein, Joseph L. Sauro, Herbert M. bioRxiv Article MOTIVATION: Annotations of biochemical models provide details of chemical species, documentation of chemical reactions, and other essential information. Unfortunately, the vast majority of biochemical models have few, if any, annotations, or the annotations provide insufficient detail to understand the limitations of the model. The quality and quantity of annotations can be improved by developing tools that recommend annotations. For example, recommender tools have been developed for annotations of genes. Although annotating genes is conceptually similar to annotating biochemical models, there are important technical differences that make it difficult to directly apply this prior work. RESULTS: We present AMAS, a system that predicts annotations for elements of models represented in the Systems Biology Markup Language (SBML) community standard. We provide a general framework for predicting model annotations for a query element based on a database of annotated reference elements and a match score function that calculates the similarity between the query element and reference elements. The framework is instantiated to specific element types (e.g., species, reactions) by specifying the reference database (e.g., ChEBI for species) and the match score function (e.g., string similarity). We analyze the computational efficiency and prediction quality of AMAS for species and reactions in BiGG and BioModels and find that it has sub-second response times and accuracy between 80% and 95% depending on specifics of what is predicted. We have incorporated AMAS into an open-source, pip-installable Python package that can run as a command-line tool that predicts and adds annotations to species and reactions to an SBML model. AVAILABILITY: Our project is hosted at https://github.com/sys-bio/AMAS, where we provide examples, documentation, and source code files. Our source code is licensed under the MIT open-source license. Cold Spring Harbor Laboratory 2023-07-21 /pmc/articles/PMC10370092/ /pubmed/37503075 http://dx.doi.org/10.1101/2023.07.19.549722 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator. |
spellingShingle | Article Shin, Woosub Gennari, John H. Hellerstein, Joseph L. Sauro, Herbert M. An Automated Model Annotation System (AMAS) for SBML Models |
title | An Automated Model Annotation System (AMAS) for SBML Models |
title_full | An Automated Model Annotation System (AMAS) for SBML Models |
title_fullStr | An Automated Model Annotation System (AMAS) for SBML Models |
title_full_unstemmed | An Automated Model Annotation System (AMAS) for SBML Models |
title_short | An Automated Model Annotation System (AMAS) for SBML Models |
title_sort | automated model annotation system (amas) for sbml models |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10370092/ https://www.ncbi.nlm.nih.gov/pubmed/37503075 http://dx.doi.org/10.1101/2023.07.19.549722 |
work_keys_str_mv | AT shinwoosub anautomatedmodelannotationsystemamasforsbmlmodels AT gennarijohnh anautomatedmodelannotationsystemamasforsbmlmodels AT hellersteinjosephl anautomatedmodelannotationsystemamasforsbmlmodels AT sauroherbertm anautomatedmodelannotationsystemamasforsbmlmodels AT shinwoosub automatedmodelannotationsystemamasforsbmlmodels AT gennarijohnh automatedmodelannotationsystemamasforsbmlmodels AT hellersteinjosephl automatedmodelannotationsystemamasforsbmlmodels AT sauroherbertm automatedmodelannotationsystemamasforsbmlmodels |