Cargando…

Prototype semantic infrastructure for automated small molecule classification and annotation in lipidomics

BACKGROUND: The development of high-throughput experimentation has led to astronomical growth in biologically relevant lipids and lipid derivatives identified, screened, and deposited in numerous online databases. Unfortunately, efforts to annotate, classify, and analyze these chemical entities have...

Descripción completa

Detalles Bibliográficos
Autores principales: Chepelev, Leonid L, Riazanov, Alexandre, Kouznetsov, Alexandre, Low, Hong Sang, Dumontier, Michel, Baker, Christopher JO
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3163564/
https://www.ncbi.nlm.nih.gov/pubmed/21791100
http://dx.doi.org/10.1186/1471-2105-12-303
_version_ 1782210962455003136
author Chepelev, Leonid L
Riazanov, Alexandre
Kouznetsov, Alexandre
Low, Hong Sang
Dumontier, Michel
Baker, Christopher JO
author_facet Chepelev, Leonid L
Riazanov, Alexandre
Kouznetsov, Alexandre
Low, Hong Sang
Dumontier, Michel
Baker, Christopher JO
author_sort Chepelev, Leonid L
collection PubMed
description BACKGROUND: The development of high-throughput experimentation has led to astronomical growth in biologically relevant lipids and lipid derivatives identified, screened, and deposited in numerous online databases. Unfortunately, efforts to annotate, classify, and analyze these chemical entities have largely remained in the hands of human curators using manual or semi-automated protocols, leaving many novel entities unclassified. Since chemical function is often closely linked to structure, accurate structure-based classification and annotation of chemical entities is imperative to understanding their functionality. RESULTS: As part of an exploratory study, we have investigated the utility of semantic web technologies in automated chemical classification and annotation of lipids. Our prototype framework consists of two components: an ontology and a set of federated web services that operate upon it. The formal lipid ontology we use here extends a part of the LiPrO ontology and draws on the lipid hierarchy in the LIPID MAPS database, as well as literature-derived knowledge. The federated semantic web services that operate upon this ontology are deployed within the Semantic Annotation, Discovery, and Integration (SADI) framework. Structure-based lipid classification is enacted by two core services. Firstly, a structural annotation service detects and enumerates relevant functional groups for a specified chemical structure. A second service reasons over lipid ontology class descriptions using the attributes obtained from the annotation service and identifies the appropriate lipid classification. We extend the utility of these core services by combining them with additional SADI services that retrieve associations between lipids and proteins and identify publications related to specified lipid types. We analyze the performance of SADI-enabled eicosanoid classification relative to the LIPID MAPS classification and reflect on the contribution of our integrative methodology in the context of high-throughput lipidomics. CONCLUSIONS: Our prototype framework is capable of accurate automated classification of lipids and facile integration of lipid class information with additional data obtained with SADI web services. The potential of programming-free integration of external web services through the SADI framework offers an opportunity for development of powerful novel applications in lipidomics. We conclude that semantic web technologies can provide an accurate and versatile means of classification and annotation of lipids.
format Online
Article
Text
id pubmed-3163564
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31635642011-08-30 Prototype semantic infrastructure for automated small molecule classification and annotation in lipidomics Chepelev, Leonid L Riazanov, Alexandre Kouznetsov, Alexandre Low, Hong Sang Dumontier, Michel Baker, Christopher JO BMC Bioinformatics Methodology Article BACKGROUND: The development of high-throughput experimentation has led to astronomical growth in biologically relevant lipids and lipid derivatives identified, screened, and deposited in numerous online databases. Unfortunately, efforts to annotate, classify, and analyze these chemical entities have largely remained in the hands of human curators using manual or semi-automated protocols, leaving many novel entities unclassified. Since chemical function is often closely linked to structure, accurate structure-based classification and annotation of chemical entities is imperative to understanding their functionality. RESULTS: As part of an exploratory study, we have investigated the utility of semantic web technologies in automated chemical classification and annotation of lipids. Our prototype framework consists of two components: an ontology and a set of federated web services that operate upon it. The formal lipid ontology we use here extends a part of the LiPrO ontology and draws on the lipid hierarchy in the LIPID MAPS database, as well as literature-derived knowledge. The federated semantic web services that operate upon this ontology are deployed within the Semantic Annotation, Discovery, and Integration (SADI) framework. Structure-based lipid classification is enacted by two core services. Firstly, a structural annotation service detects and enumerates relevant functional groups for a specified chemical structure. A second service reasons over lipid ontology class descriptions using the attributes obtained from the annotation service and identifies the appropriate lipid classification. We extend the utility of these core services by combining them with additional SADI services that retrieve associations between lipids and proteins and identify publications related to specified lipid types. We analyze the performance of SADI-enabled eicosanoid classification relative to the LIPID MAPS classification and reflect on the contribution of our integrative methodology in the context of high-throughput lipidomics. CONCLUSIONS: Our prototype framework is capable of accurate automated classification of lipids and facile integration of lipid class information with additional data obtained with SADI web services. The potential of programming-free integration of external web services through the SADI framework offers an opportunity for development of powerful novel applications in lipidomics. We conclude that semantic web technologies can provide an accurate and versatile means of classification and annotation of lipids. BioMed Central 2011-07-26 /pmc/articles/PMC3163564/ /pubmed/21791100 http://dx.doi.org/10.1186/1471-2105-12-303 Text en Copyright ©2011 Chepelev et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Chepelev, Leonid L
Riazanov, Alexandre
Kouznetsov, Alexandre
Low, Hong Sang
Dumontier, Michel
Baker, Christopher JO
Prototype semantic infrastructure for automated small molecule classification and annotation in lipidomics
title Prototype semantic infrastructure for automated small molecule classification and annotation in lipidomics
title_full Prototype semantic infrastructure for automated small molecule classification and annotation in lipidomics
title_fullStr Prototype semantic infrastructure for automated small molecule classification and annotation in lipidomics
title_full_unstemmed Prototype semantic infrastructure for automated small molecule classification and annotation in lipidomics
title_short Prototype semantic infrastructure for automated small molecule classification and annotation in lipidomics
title_sort prototype semantic infrastructure for automated small molecule classification and annotation in lipidomics
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3163564/
https://www.ncbi.nlm.nih.gov/pubmed/21791100
http://dx.doi.org/10.1186/1471-2105-12-303
work_keys_str_mv AT chepelevleonidl prototypesemanticinfrastructureforautomatedsmallmoleculeclassificationandannotationinlipidomics
AT riazanovalexandre prototypesemanticinfrastructureforautomatedsmallmoleculeclassificationandannotationinlipidomics
AT kouznetsovalexandre prototypesemanticinfrastructureforautomatedsmallmoleculeclassificationandannotationinlipidomics
AT lowhongsang prototypesemanticinfrastructureforautomatedsmallmoleculeclassificationandannotationinlipidomics
AT dumontiermichel prototypesemanticinfrastructureforautomatedsmallmoleculeclassificationandannotationinlipidomics
AT bakerchristopherjo prototypesemanticinfrastructureforautomatedsmallmoleculeclassificationandannotationinlipidomics