Cargando…

Enhanced taxonomy annotation of antiviral activity data from ChEMBL

The discovery of antiviral drugs is a rapidly developing area of medicinal chemistry research. The emergence of resistant variants and outbreaks of poorly studied viral diseases make this area constantly developing. The amount of antiviral activity data available in ChEMBL consistently grows, but vi...

Descripción completa

Detalles Bibliográficos
Autores principales: Nikitina, Anastasia A, Orlov, Alexey A, Kozlovskaya, Liubov I, Palyulin, Vladimir A, Osolodkin, Dmitry I
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6367519/
https://www.ncbi.nlm.nih.gov/pubmed/30753475
http://dx.doi.org/10.1093/database/bay139
_version_ 1783393818312179712
author Nikitina, Anastasia A
Orlov, Alexey A
Kozlovskaya, Liubov I
Palyulin, Vladimir A
Osolodkin, Dmitry I
author_facet Nikitina, Anastasia A
Orlov, Alexey A
Kozlovskaya, Liubov I
Palyulin, Vladimir A
Osolodkin, Dmitry I
author_sort Nikitina, Anastasia A
collection PubMed
description The discovery of antiviral drugs is a rapidly developing area of medicinal chemistry research. The emergence of resistant variants and outbreaks of poorly studied viral diseases make this area constantly developing. The amount of antiviral activity data available in ChEMBL consistently grows, but virus taxonomy annotation of these data is not sufficient for thorough studies of antiviral chemical space. We developed a procedure for semi-automatic extraction of antiviral activity data from ChEMBL and mapped them to the virus taxonomy developed by the International Committee for Taxonomy of Viruses (ICTV). The procedure is based on the lists of virus-related values of ChEMBL annotation fields and a dictionary of virus names and acronyms mapped to ICTV taxa. Application of this data extraction procedure allows retrieving from ChEMBL 1.6 times more assays linked to 2.5 times more compounds and data points than ChEMBL web interface allows. Mapping of these data to ICTV taxa allows analyzing all the compounds tested against each viral species. Activity values and structures of the compounds were standardized, and the antiviral activity profile was created for each standard structure. Data set compiled using this algorithm was called ViralChEMBL. As case studies, we compared descriptor and scaffold distributions for the full ChEMBL and its `viral’ and `non-viral’ subsets, identified the most studied compounds and created a self-organizing map for ViralChEMBL. Our approach to data annotation appeared to be a very efficient tool for the study of antiviral chemical space.
format Online
Article
Text
id pubmed-6367519
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-63675192019-02-20 Enhanced taxonomy annotation of antiviral activity data from ChEMBL Nikitina, Anastasia A Orlov, Alexey A Kozlovskaya, Liubov I Palyulin, Vladimir A Osolodkin, Dmitry I Database (Oxford) Original Article The discovery of antiviral drugs is a rapidly developing area of medicinal chemistry research. The emergence of resistant variants and outbreaks of poorly studied viral diseases make this area constantly developing. The amount of antiviral activity data available in ChEMBL consistently grows, but virus taxonomy annotation of these data is not sufficient for thorough studies of antiviral chemical space. We developed a procedure for semi-automatic extraction of antiviral activity data from ChEMBL and mapped them to the virus taxonomy developed by the International Committee for Taxonomy of Viruses (ICTV). The procedure is based on the lists of virus-related values of ChEMBL annotation fields and a dictionary of virus names and acronyms mapped to ICTV taxa. Application of this data extraction procedure allows retrieving from ChEMBL 1.6 times more assays linked to 2.5 times more compounds and data points than ChEMBL web interface allows. Mapping of these data to ICTV taxa allows analyzing all the compounds tested against each viral species. Activity values and structures of the compounds were standardized, and the antiviral activity profile was created for each standard structure. Data set compiled using this algorithm was called ViralChEMBL. As case studies, we compared descriptor and scaffold distributions for the full ChEMBL and its `viral’ and `non-viral’ subsets, identified the most studied compounds and created a self-organizing map for ViralChEMBL. Our approach to data annotation appeared to be a very efficient tool for the study of antiviral chemical space. Oxford University Press 2019-02-08 /pmc/articles/PMC6367519/ /pubmed/30753475 http://dx.doi.org/10.1093/database/bay139 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Nikitina, Anastasia A
Orlov, Alexey A
Kozlovskaya, Liubov I
Palyulin, Vladimir A
Osolodkin, Dmitry I
Enhanced taxonomy annotation of antiviral activity data from ChEMBL
title Enhanced taxonomy annotation of antiviral activity data from ChEMBL
title_full Enhanced taxonomy annotation of antiviral activity data from ChEMBL
title_fullStr Enhanced taxonomy annotation of antiviral activity data from ChEMBL
title_full_unstemmed Enhanced taxonomy annotation of antiviral activity data from ChEMBL
title_short Enhanced taxonomy annotation of antiviral activity data from ChEMBL
title_sort enhanced taxonomy annotation of antiviral activity data from chembl
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6367519/
https://www.ncbi.nlm.nih.gov/pubmed/30753475
http://dx.doi.org/10.1093/database/bay139
work_keys_str_mv AT nikitinaanastasiaa enhancedtaxonomyannotationofantiviralactivitydatafromchembl
AT orlovalexeya enhancedtaxonomyannotationofantiviralactivitydatafromchembl
AT kozlovskayaliubovi enhancedtaxonomyannotationofantiviralactivitydatafromchembl
AT palyulinvladimira enhancedtaxonomyannotationofantiviralactivitydatafromchembl
AT osolodkindmitryi enhancedtaxonomyannotationofantiviralactivitydatafromchembl