Cargando…

Functional group and diversity analysis of BIOFACQUIM: A Mexican natural product database

Background: Natural product databases are important in drug discovery and other research areas. An analysis of its structural content, as well as functional group occurrence, provides a useful overview, as well as a means of comparison with related databases. BIOFACQUIM is an emerging database of na...

Descripción completa

Detalles Bibliográficos
Autores principales: Sánchez-Cruz, Norberto, Pilón-Jiménez, B. Angélica, Medina-Franco, José L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6993822/
https://www.ncbi.nlm.nih.gov/pubmed/32047598
http://dx.doi.org/10.12688/f1000research.21540.2
_version_ 1783493109871542272
author Sánchez-Cruz, Norberto
Pilón-Jiménez, B. Angélica
Medina-Franco, José L.
author_facet Sánchez-Cruz, Norberto
Pilón-Jiménez, B. Angélica
Medina-Franco, José L.
author_sort Sánchez-Cruz, Norberto
collection PubMed
description Background: Natural product databases are important in drug discovery and other research areas. An analysis of its structural content, as well as functional group occurrence, provides a useful overview, as well as a means of comparison with related databases. BIOFACQUIM is an emerging database of natural products characterized and isolated in Mexico. Herein, we discuss the results of a first systematic functional group analysis and global diversity of an updated version of BIOFACQUIM. Methods: BIOFACQUIM was augmented through a literature search and data curation. A structural content analysis of the dataset was performed. This involved a functional group analysis with a novel algorithm to automatically identify all functional groups in a molecule and an assessment of the global diversity using consensus diversity plots. To this end, BIOFACQUIM was compared to two major and large databases: ChEMBL 25, and a herein assembled collection of natural products with 169,839 unique compounds. Results: The structural content analysis showed that 15.7% of compounds and 11.6% of scaffolds present in the current version of BIOFACQUIM have not been reported in the other large reference datasets. It also gave a diversity increase in terms of scaffolds and molecular fingerprints regarding the previous version of the dataset, as well as a higher similarity to the assembled collection of natural products than to ChEMBL 25, in terms of diversity and frequent functional groups. Conclusions: A total of 148 natural products were added to BIOFACQUIM, which meant a diversity increase in terms of scaffolds and fingerprints. Regardless of its relatively small size, there are a significant number of compounds and scaffolds that are not present in the reference datasets, showing that curated databases of natural products, such as BIOFACQUIM, can serve as a starting point to increase the biologically relevant chemical space.
format Online
Article
Text
id pubmed-6993822
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-69938222020-02-10 Functional group and diversity analysis of BIOFACQUIM: A Mexican natural product database Sánchez-Cruz, Norberto Pilón-Jiménez, B. Angélica Medina-Franco, José L. F1000Res Research Article Background: Natural product databases are important in drug discovery and other research areas. An analysis of its structural content, as well as functional group occurrence, provides a useful overview, as well as a means of comparison with related databases. BIOFACQUIM is an emerging database of natural products characterized and isolated in Mexico. Herein, we discuss the results of a first systematic functional group analysis and global diversity of an updated version of BIOFACQUIM. Methods: BIOFACQUIM was augmented through a literature search and data curation. A structural content analysis of the dataset was performed. This involved a functional group analysis with a novel algorithm to automatically identify all functional groups in a molecule and an assessment of the global diversity using consensus diversity plots. To this end, BIOFACQUIM was compared to two major and large databases: ChEMBL 25, and a herein assembled collection of natural products with 169,839 unique compounds. Results: The structural content analysis showed that 15.7% of compounds and 11.6% of scaffolds present in the current version of BIOFACQUIM have not been reported in the other large reference datasets. It also gave a diversity increase in terms of scaffolds and molecular fingerprints regarding the previous version of the dataset, as well as a higher similarity to the assembled collection of natural products than to ChEMBL 25, in terms of diversity and frequent functional groups. Conclusions: A total of 148 natural products were added to BIOFACQUIM, which meant a diversity increase in terms of scaffolds and fingerprints. Regardless of its relatively small size, there are a significant number of compounds and scaffolds that are not present in the reference datasets, showing that curated databases of natural products, such as BIOFACQUIM, can serve as a starting point to increase the biologically relevant chemical space. F1000 Research Limited 2020-06-08 /pmc/articles/PMC6993822/ /pubmed/32047598 http://dx.doi.org/10.12688/f1000research.21540.2 Text en Copyright: © 2020 Sánchez-Cruz N et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Sánchez-Cruz, Norberto
Pilón-Jiménez, B. Angélica
Medina-Franco, José L.
Functional group and diversity analysis of BIOFACQUIM: A Mexican natural product database
title Functional group and diversity analysis of BIOFACQUIM: A Mexican natural product database
title_full Functional group and diversity analysis of BIOFACQUIM: A Mexican natural product database
title_fullStr Functional group and diversity analysis of BIOFACQUIM: A Mexican natural product database
title_full_unstemmed Functional group and diversity analysis of BIOFACQUIM: A Mexican natural product database
title_short Functional group and diversity analysis of BIOFACQUIM: A Mexican natural product database
title_sort functional group and diversity analysis of biofacquim: a mexican natural product database
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6993822/
https://www.ncbi.nlm.nih.gov/pubmed/32047598
http://dx.doi.org/10.12688/f1000research.21540.2
work_keys_str_mv AT sanchezcruznorberto functionalgroupanddiversityanalysisofbiofacquimamexicannaturalproductdatabase
AT pilonjimenezbangelica functionalgroupanddiversityanalysisofbiofacquimamexicannaturalproductdatabase
AT medinafrancojosel functionalgroupanddiversityanalysisofbiofacquimamexicannaturalproductdatabase