Cargando…

Semantic Similarity for Automatic Classification of Chemical Compounds

With the increasing amount of data made available in the chemical field, there is a strong need for systems capable of comparing and classifying chemical compounds in an efficient and effective way. The best approaches existing today are based on the structure-activity relationship premise, which st...

Descripción completa

Detalles Bibliográficos
Autores principales: Ferreira, João D., Couto, Francisco M.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2944781/
https://www.ncbi.nlm.nih.gov/pubmed/20885779
http://dx.doi.org/10.1371/journal.pcbi.1000937
_version_ 1782187125679063040
author Ferreira, João D.
Couto, Francisco M.
author_facet Ferreira, João D.
Couto, Francisco M.
author_sort Ferreira, João D.
collection PubMed
description With the increasing amount of data made available in the chemical field, there is a strong need for systems capable of comparing and classifying chemical compounds in an efficient and effective way. The best approaches existing today are based on the structure-activity relationship premise, which states that biological activity of a molecule is strongly related to its structural or physicochemical properties. This work presents a novel approach to the automatic classification of chemical compounds by integrating semantic similarity with existing structural comparison methods. Our approach was assessed based on the Matthews Correlation Coefficient for the prediction, and achieved values of 0.810 when used as a prediction of blood-brain barrier permeability, 0.694 for P-glycoprotein substrate, and 0.673 for estrogen receptor binding activity. These results expose a significant improvement over the currently existing methods, whose best performances were 0.628, 0.591, and 0.647 respectively. It was demonstrated that the integration of semantic similarity is a feasible and effective way to improve existing chemical compound classification systems. Among other possible uses, this tool helps the study of the evolution of metabolic pathways, the study of the correlation of metabolic networks with properties of those networks, or the improvement of ontologies that represent chemical information.
format Text
id pubmed-2944781
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-29447812010-09-30 Semantic Similarity for Automatic Classification of Chemical Compounds Ferreira, João D. Couto, Francisco M. PLoS Comput Biol Research Article With the increasing amount of data made available in the chemical field, there is a strong need for systems capable of comparing and classifying chemical compounds in an efficient and effective way. The best approaches existing today are based on the structure-activity relationship premise, which states that biological activity of a molecule is strongly related to its structural or physicochemical properties. This work presents a novel approach to the automatic classification of chemical compounds by integrating semantic similarity with existing structural comparison methods. Our approach was assessed based on the Matthews Correlation Coefficient for the prediction, and achieved values of 0.810 when used as a prediction of blood-brain barrier permeability, 0.694 for P-glycoprotein substrate, and 0.673 for estrogen receptor binding activity. These results expose a significant improvement over the currently existing methods, whose best performances were 0.628, 0.591, and 0.647 respectively. It was demonstrated that the integration of semantic similarity is a feasible and effective way to improve existing chemical compound classification systems. Among other possible uses, this tool helps the study of the evolution of metabolic pathways, the study of the correlation of metabolic networks with properties of those networks, or the improvement of ontologies that represent chemical information. Public Library of Science 2010-09-23 /pmc/articles/PMC2944781/ /pubmed/20885779 http://dx.doi.org/10.1371/journal.pcbi.1000937 Text en Ferreira, Couto. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Ferreira, João D.
Couto, Francisco M.
Semantic Similarity for Automatic Classification of Chemical Compounds
title Semantic Similarity for Automatic Classification of Chemical Compounds
title_full Semantic Similarity for Automatic Classification of Chemical Compounds
title_fullStr Semantic Similarity for Automatic Classification of Chemical Compounds
title_full_unstemmed Semantic Similarity for Automatic Classification of Chemical Compounds
title_short Semantic Similarity for Automatic Classification of Chemical Compounds
title_sort semantic similarity for automatic classification of chemical compounds
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2944781/
https://www.ncbi.nlm.nih.gov/pubmed/20885779
http://dx.doi.org/10.1371/journal.pcbi.1000937
work_keys_str_mv AT ferreirajoaod semanticsimilarityforautomaticclassificationofchemicalcompounds
AT coutofranciscom semanticsimilarityforautomaticclassificationofchemicalcompounds