Cargando…

Blind trials of computer-assisted structure elucidation software

BACKGROUND: One of the largest challenges in chemistry today remains that of efficiently mining through vast amounts of data in order to elucidate the chemical structure for an unknown compound. The elucidated candidate compound must be fully consistent with the data and any other competing candidat...

Descripción completa

Detalles Bibliográficos
Autores principales: Moser, Arvin, Elyashberg, Mikhail E, Williams, Antony J, Blinov, Kirill A, DiMartino, Joseph C
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3349476/
https://www.ncbi.nlm.nih.gov/pubmed/22321892
http://dx.doi.org/10.1186/1758-2946-4-5
_version_ 1782232507723284480
author Moser, Arvin
Elyashberg, Mikhail E
Williams, Antony J
Blinov, Kirill A
DiMartino, Joseph C
author_facet Moser, Arvin
Elyashberg, Mikhail E
Williams, Antony J
Blinov, Kirill A
DiMartino, Joseph C
author_sort Moser, Arvin
collection PubMed
description BACKGROUND: One of the largest challenges in chemistry today remains that of efficiently mining through vast amounts of data in order to elucidate the chemical structure for an unknown compound. The elucidated candidate compound must be fully consistent with the data and any other competing candidates efficiently eliminated without doubt by using additional data if necessary. It has become increasingly necessary to incorporate an in silico structure generation and verification tool to facilitate this elucidation process. An effective structure elucidation software technology aims to mimic the skills of a human in interpreting the complex nature of spectral data while producing a solution within a reasonable amount of time. This type of software is known as computer-assisted structure elucidation or CASE software. A systematic trial of the ACD/Structure Elucidator CASE software was conducted over an extended period of time by analysing a set of single and double-blind trials submitted by a global audience of scientists. The purpose of the blind trials was to reduce subjective bias. Double-blind trials comprised of data where the candidate compound was unknown to both the submitting scientist and the analyst. The level of expertise of the submitting scientist ranged from novice to expert structure elucidation specialists with experience in pharmaceutical, industrial, government and academic environments. RESULTS: Beginning in 2003, and for the following nine years, the algorithms and software technology contained within ACD/Structure Elucidator have been tested against 112 data sets; many of these were unique challenges. Of these challenges 9% were double-blind trials. The results of eighteen of the single-blind trials were investigated in detail and included problems of a diverse nature with many of the specific challenges associated with algorithmic structure elucidation such as deficiency in protons, structure symmetry, a large number of heteroatoms and poor quality spectral data. CONCLUSION: When applied to a complex set of blind trials, ACD/Structure Elucidator was shown to be a very useful tool in advancing the computer's contribution to elucidating a candidate structure from a set of spectral data (NMR and MS) for an unknown. The synergistic interaction between humans and computers can be highly beneficial in terms of less biased approaches to elucidation as well as dramatic improvements in speed and throughput. In those cases where multiple candidate structures exist, ACD/Structure Elucidator is equipped to validate the correct structure and eliminate inconsistent candidates. Full elucidation can generally be performed in less than two hours; this includes the average spectral data processing time and data input.
format Online
Article
Text
id pubmed-3349476
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33494762012-05-11 Blind trials of computer-assisted structure elucidation software Moser, Arvin Elyashberg, Mikhail E Williams, Antony J Blinov, Kirill A DiMartino, Joseph C J Cheminform Research Article BACKGROUND: One of the largest challenges in chemistry today remains that of efficiently mining through vast amounts of data in order to elucidate the chemical structure for an unknown compound. The elucidated candidate compound must be fully consistent with the data and any other competing candidates efficiently eliminated without doubt by using additional data if necessary. It has become increasingly necessary to incorporate an in silico structure generation and verification tool to facilitate this elucidation process. An effective structure elucidation software technology aims to mimic the skills of a human in interpreting the complex nature of spectral data while producing a solution within a reasonable amount of time. This type of software is known as computer-assisted structure elucidation or CASE software. A systematic trial of the ACD/Structure Elucidator CASE software was conducted over an extended period of time by analysing a set of single and double-blind trials submitted by a global audience of scientists. The purpose of the blind trials was to reduce subjective bias. Double-blind trials comprised of data where the candidate compound was unknown to both the submitting scientist and the analyst. The level of expertise of the submitting scientist ranged from novice to expert structure elucidation specialists with experience in pharmaceutical, industrial, government and academic environments. RESULTS: Beginning in 2003, and for the following nine years, the algorithms and software technology contained within ACD/Structure Elucidator have been tested against 112 data sets; many of these were unique challenges. Of these challenges 9% were double-blind trials. The results of eighteen of the single-blind trials were investigated in detail and included problems of a diverse nature with many of the specific challenges associated with algorithmic structure elucidation such as deficiency in protons, structure symmetry, a large number of heteroatoms and poor quality spectral data. CONCLUSION: When applied to a complex set of blind trials, ACD/Structure Elucidator was shown to be a very useful tool in advancing the computer's contribution to elucidating a candidate structure from a set of spectral data (NMR and MS) for an unknown. The synergistic interaction between humans and computers can be highly beneficial in terms of less biased approaches to elucidation as well as dramatic improvements in speed and throughput. In those cases where multiple candidate structures exist, ACD/Structure Elucidator is equipped to validate the correct structure and eliminate inconsistent candidates. Full elucidation can generally be performed in less than two hours; this includes the average spectral data processing time and data input. BioMed Central 2012-02-09 /pmc/articles/PMC3349476/ /pubmed/22321892 http://dx.doi.org/10.1186/1758-2946-4-5 Text en Copyright ©2012 Moser et al; licensee Chemistry Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Moser, Arvin
Elyashberg, Mikhail E
Williams, Antony J
Blinov, Kirill A
DiMartino, Joseph C
Blind trials of computer-assisted structure elucidation software
title Blind trials of computer-assisted structure elucidation software
title_full Blind trials of computer-assisted structure elucidation software
title_fullStr Blind trials of computer-assisted structure elucidation software
title_full_unstemmed Blind trials of computer-assisted structure elucidation software
title_short Blind trials of computer-assisted structure elucidation software
title_sort blind trials of computer-assisted structure elucidation software
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3349476/
https://www.ncbi.nlm.nih.gov/pubmed/22321892
http://dx.doi.org/10.1186/1758-2946-4-5
work_keys_str_mv AT moserarvin blindtrialsofcomputerassistedstructureelucidationsoftware
AT elyashbergmikhaile blindtrialsofcomputerassistedstructureelucidationsoftware
AT williamsantonyj blindtrialsofcomputerassistedstructureelucidationsoftware
AT blinovkirilla blindtrialsofcomputerassistedstructureelucidationsoftware
AT dimartinojosephc blindtrialsofcomputerassistedstructureelucidationsoftware