Cargando…

Predicting anatomic therapeutic chemical classification codes using tiered learning

BACKGROUND: The low success rate and high cost of drug discovery requires the development of new paradigms to identify molecules of therapeutic value. The Anatomical Therapeutic Chemical (ATC) Code System is a World Health Organization (WHO) proposed classification that assigns multi-level codes to...

Descripción completa

Detalles Bibliográficos
Autores principales:	Olson, Thomas, Singh, Rahul
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2017
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5471942/ https://www.ncbi.nlm.nih.gov/pubmed/28617230 http://dx.doi.org/10.1186/s12859-017-1660-6

_version_	1783244050152816640
author	Olson, Thomas Singh, Rahul
author_facet	Olson, Thomas Singh, Rahul
author_sort	Olson, Thomas
collection	PubMed
description	BACKGROUND: The low success rate and high cost of drug discovery requires the development of new paradigms to identify molecules of therapeutic value. The Anatomical Therapeutic Chemical (ATC) Code System is a World Health Organization (WHO) proposed classification that assigns multi-level codes to compounds based on their therapeutic, pharmacological and chemical characteristics as well as the in-vivo sites(s) of activity. The ability to predict ATC codes of compounds can assist in creation of high-quality chemical libraries for drug screening and in applications such as drug repositioning. We propose a machine learning architecture called tiered learning for prediction of ATC codes that relies on the prediction results of the higher levels of the ATC code to simplify the predictions of the lower levels. RESULTS: The proposed approach was validated using a number of compounds in both cross-validation and test setting. The validation experiments compared chemical descriptors, initialization methods and classification algorithms. The prediction accuracy obtained with tiered learning was found to be either comparable or better than that of established methods. Additionally, the experiments demonstrated the generalizability of the tiered learning architecture, in that its use was found to improve prediction rates for a majority of machine learning algorithms when compared to their stand-alone application. CONCLUSION: The basis of our approach lies in the observation that anatomical-therapeutic biological activity of certain types typically precludes activities of many other types. Thus, there exists a characteristic distribution of the ATC codes, which can be leveraged to limit the search-space of possible codes that can be ascribed at a particular level once the codes at the preceding levels are known. Tiered learning utilizes this observation to constrain the learning space for ATC codes at a particular level based on the ATC code at higher levels. This simplifies the prediction and allows for improved accuracy.
format	Online Article Text
id	pubmed-5471942
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-54719422017-06-19 Predicting anatomic therapeutic chemical classification codes using tiered learning Olson, Thomas Singh, Rahul BMC Bioinformatics Research BACKGROUND: The low success rate and high cost of drug discovery requires the development of new paradigms to identify molecules of therapeutic value. The Anatomical Therapeutic Chemical (ATC) Code System is a World Health Organization (WHO) proposed classification that assigns multi-level codes to compounds based on their therapeutic, pharmacological and chemical characteristics as well as the in-vivo sites(s) of activity. The ability to predict ATC codes of compounds can assist in creation of high-quality chemical libraries for drug screening and in applications such as drug repositioning. We propose a machine learning architecture called tiered learning for prediction of ATC codes that relies on the prediction results of the higher levels of the ATC code to simplify the predictions of the lower levels. RESULTS: The proposed approach was validated using a number of compounds in both cross-validation and test setting. The validation experiments compared chemical descriptors, initialization methods and classification algorithms. The prediction accuracy obtained with tiered learning was found to be either comparable or better than that of established methods. Additionally, the experiments demonstrated the generalizability of the tiered learning architecture, in that its use was found to improve prediction rates for a majority of machine learning algorithms when compared to their stand-alone application. CONCLUSION: The basis of our approach lies in the observation that anatomical-therapeutic biological activity of certain types typically precludes activities of many other types. Thus, there exists a characteristic distribution of the ATC codes, which can be leveraged to limit the search-space of possible codes that can be ascribed at a particular level once the codes at the preceding levels are known. Tiered learning utilizes this observation to constrain the learning space for ATC codes at a particular level based on the ATC code at higher levels. This simplifies the prediction and allows for improved accuracy. BioMed Central 2017-06-07 /pmc/articles/PMC5471942/ /pubmed/28617230 http://dx.doi.org/10.1186/s12859-017-1660-6 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Olson, Thomas Singh, Rahul Predicting anatomic therapeutic chemical classification codes using tiered learning
title	Predicting anatomic therapeutic chemical classification codes using tiered learning
title_full	Predicting anatomic therapeutic chemical classification codes using tiered learning
title_fullStr	Predicting anatomic therapeutic chemical classification codes using tiered learning
title_full_unstemmed	Predicting anatomic therapeutic chemical classification codes using tiered learning
title_short	Predicting anatomic therapeutic chemical classification codes using tiered learning
title_sort	predicting anatomic therapeutic chemical classification codes using tiered learning
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5471942/ https://www.ncbi.nlm.nih.gov/pubmed/28617230 http://dx.doi.org/10.1186/s12859-017-1660-6
work_keys_str_mv	AT olsonthomas predictinganatomictherapeuticchemicalclassificationcodesusingtieredlearning AT singhrahul predictinganatomictherapeuticchemicalclassificationcodesusingtieredlearning

Predicting anatomic therapeutic chemical classification codes using tiered learning

Ejemplares similares