Cargando…
Enhanced taxonomy annotation of antiviral activity data from ChEMBL
The discovery of antiviral drugs is a rapidly developing area of medicinal chemistry research. The emergence of resistant variants and outbreaks of poorly studied viral diseases make this area constantly developing. The amount of antiviral activity data available in ChEMBL consistently grows, but vi...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6367519/ https://www.ncbi.nlm.nih.gov/pubmed/30753475 http://dx.doi.org/10.1093/database/bay139 |
_version_ | 1783393818312179712 |
---|---|
author | Nikitina, Anastasia A Orlov, Alexey A Kozlovskaya, Liubov I Palyulin, Vladimir A Osolodkin, Dmitry I |
author_facet | Nikitina, Anastasia A Orlov, Alexey A Kozlovskaya, Liubov I Palyulin, Vladimir A Osolodkin, Dmitry I |
author_sort | Nikitina, Anastasia A |
collection | PubMed |
description | The discovery of antiviral drugs is a rapidly developing area of medicinal chemistry research. The emergence of resistant variants and outbreaks of poorly studied viral diseases make this area constantly developing. The amount of antiviral activity data available in ChEMBL consistently grows, but virus taxonomy annotation of these data is not sufficient for thorough studies of antiviral chemical space. We developed a procedure for semi-automatic extraction of antiviral activity data from ChEMBL and mapped them to the virus taxonomy developed by the International Committee for Taxonomy of Viruses (ICTV). The procedure is based on the lists of virus-related values of ChEMBL annotation fields and a dictionary of virus names and acronyms mapped to ICTV taxa. Application of this data extraction procedure allows retrieving from ChEMBL 1.6 times more assays linked to 2.5 times more compounds and data points than ChEMBL web interface allows. Mapping of these data to ICTV taxa allows analyzing all the compounds tested against each viral species. Activity values and structures of the compounds were standardized, and the antiviral activity profile was created for each standard structure. Data set compiled using this algorithm was called ViralChEMBL. As case studies, we compared descriptor and scaffold distributions for the full ChEMBL and its `viral’ and `non-viral’ subsets, identified the most studied compounds and created a self-organizing map for ViralChEMBL. Our approach to data annotation appeared to be a very efficient tool for the study of antiviral chemical space. |
format | Online Article Text |
id | pubmed-6367519 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-63675192019-02-20 Enhanced taxonomy annotation of antiviral activity data from ChEMBL Nikitina, Anastasia A Orlov, Alexey A Kozlovskaya, Liubov I Palyulin, Vladimir A Osolodkin, Dmitry I Database (Oxford) Original Article The discovery of antiviral drugs is a rapidly developing area of medicinal chemistry research. The emergence of resistant variants and outbreaks of poorly studied viral diseases make this area constantly developing. The amount of antiviral activity data available in ChEMBL consistently grows, but virus taxonomy annotation of these data is not sufficient for thorough studies of antiviral chemical space. We developed a procedure for semi-automatic extraction of antiviral activity data from ChEMBL and mapped them to the virus taxonomy developed by the International Committee for Taxonomy of Viruses (ICTV). The procedure is based on the lists of virus-related values of ChEMBL annotation fields and a dictionary of virus names and acronyms mapped to ICTV taxa. Application of this data extraction procedure allows retrieving from ChEMBL 1.6 times more assays linked to 2.5 times more compounds and data points than ChEMBL web interface allows. Mapping of these data to ICTV taxa allows analyzing all the compounds tested against each viral species. Activity values and structures of the compounds were standardized, and the antiviral activity profile was created for each standard structure. Data set compiled using this algorithm was called ViralChEMBL. As case studies, we compared descriptor and scaffold distributions for the full ChEMBL and its `viral’ and `non-viral’ subsets, identified the most studied compounds and created a self-organizing map for ViralChEMBL. Our approach to data annotation appeared to be a very efficient tool for the study of antiviral chemical space. Oxford University Press 2019-02-08 /pmc/articles/PMC6367519/ /pubmed/30753475 http://dx.doi.org/10.1093/database/bay139 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Nikitina, Anastasia A Orlov, Alexey A Kozlovskaya, Liubov I Palyulin, Vladimir A Osolodkin, Dmitry I Enhanced taxonomy annotation of antiviral activity data from ChEMBL |
title | Enhanced taxonomy annotation of antiviral activity data from ChEMBL |
title_full | Enhanced taxonomy annotation of antiviral activity data from ChEMBL |
title_fullStr | Enhanced taxonomy annotation of antiviral activity data from ChEMBL |
title_full_unstemmed | Enhanced taxonomy annotation of antiviral activity data from ChEMBL |
title_short | Enhanced taxonomy annotation of antiviral activity data from ChEMBL |
title_sort | enhanced taxonomy annotation of antiviral activity data from chembl |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6367519/ https://www.ncbi.nlm.nih.gov/pubmed/30753475 http://dx.doi.org/10.1093/database/bay139 |
work_keys_str_mv | AT nikitinaanastasiaa enhancedtaxonomyannotationofantiviralactivitydatafromchembl AT orlovalexeya enhancedtaxonomyannotationofantiviralactivitydatafromchembl AT kozlovskayaliubovi enhancedtaxonomyannotationofantiviralactivitydatafromchembl AT palyulinvladimira enhancedtaxonomyannotationofantiviralactivitydatafromchembl AT osolodkindmitryi enhancedtaxonomyannotationofantiviralactivitydatafromchembl |