Cargando…

The Drug Data to Knowledge Pipeline: Large-Scale Claims Data Classification for Pharmacologic Insight

In biomedical informatics, assigning drug codes to categories is a common step in the analysis pipeline. Unfortunately, incomplete mappings are the norm rather than the exception with coverage values less than 85% not uncommon. Here, we perform this linking task on a nationwide insurance claims data...

Descripción completa

Detalles Bibliográficos
Autores principales: Homer, Mark L., Palmer, Nathan P., Bodenreider, Olivier, Cami, Aurel, Chadwick, Laura, Mandl, Kenneth D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Medical Informatics Association 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5001754/
https://www.ncbi.nlm.nih.gov/pubmed/27570659
_version_ 1782450477372276736
author Homer, Mark L.
Palmer, Nathan P.
Bodenreider, Olivier
Cami, Aurel
Chadwick, Laura
Mandl, Kenneth D.
author_facet Homer, Mark L.
Palmer, Nathan P.
Bodenreider, Olivier
Cami, Aurel
Chadwick, Laura
Mandl, Kenneth D.
author_sort Homer, Mark L.
collection PubMed
description In biomedical informatics, assigning drug codes to categories is a common step in the analysis pipeline. Unfortunately, incomplete mappings are the norm rather than the exception with coverage values less than 85% not uncommon. Here, we perform this linking task on a nationwide insurance claims database with over 13 million members who were dispensed, according to National Drug Codes (NDCs), over 50,000 unique product forms of medication. The chosen approach employs Cerner Multum’s VantageRx and the U.S. National Library of Medicine’s RxMix. As a result, 94.0% of the NDCs were successfully mapped to categories used by common drug terminologies, e.g., Anatomical Therapeutic Chemical (ATC). Implemented as an SQL database and scripts, the approach is generic and can be setup for a new data set in a few hours. Thus, the method is a viable option for large-scale drug classification.
format Online
Article
Text
id pubmed-5001754
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher American Medical Informatics Association
record_format MEDLINE/PubMed
spelling pubmed-50017542016-08-26 The Drug Data to Knowledge Pipeline: Large-Scale Claims Data Classification for Pharmacologic Insight Homer, Mark L. Palmer, Nathan P. Bodenreider, Olivier Cami, Aurel Chadwick, Laura Mandl, Kenneth D. AMIA Jt Summits Transl Sci Proc Articles In biomedical informatics, assigning drug codes to categories is a common step in the analysis pipeline. Unfortunately, incomplete mappings are the norm rather than the exception with coverage values less than 85% not uncommon. Here, we perform this linking task on a nationwide insurance claims database with over 13 million members who were dispensed, according to National Drug Codes (NDCs), over 50,000 unique product forms of medication. The chosen approach employs Cerner Multum’s VantageRx and the U.S. National Library of Medicine’s RxMix. As a result, 94.0% of the NDCs were successfully mapped to categories used by common drug terminologies, e.g., Anatomical Therapeutic Chemical (ATC). Implemented as an SQL database and scripts, the approach is generic and can be setup for a new data set in a few hours. Thus, the method is a viable option for large-scale drug classification. American Medical Informatics Association 2016-07-20 /pmc/articles/PMC5001754/ /pubmed/27570659 Text en ©2016 AMIA - All rights reserved. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose
spellingShingle Articles
Homer, Mark L.
Palmer, Nathan P.
Bodenreider, Olivier
Cami, Aurel
Chadwick, Laura
Mandl, Kenneth D.
The Drug Data to Knowledge Pipeline: Large-Scale Claims Data Classification for Pharmacologic Insight
title The Drug Data to Knowledge Pipeline: Large-Scale Claims Data Classification for Pharmacologic Insight
title_full The Drug Data to Knowledge Pipeline: Large-Scale Claims Data Classification for Pharmacologic Insight
title_fullStr The Drug Data to Knowledge Pipeline: Large-Scale Claims Data Classification for Pharmacologic Insight
title_full_unstemmed The Drug Data to Knowledge Pipeline: Large-Scale Claims Data Classification for Pharmacologic Insight
title_short The Drug Data to Knowledge Pipeline: Large-Scale Claims Data Classification for Pharmacologic Insight
title_sort drug data to knowledge pipeline: large-scale claims data classification for pharmacologic insight
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5001754/
https://www.ncbi.nlm.nih.gov/pubmed/27570659
work_keys_str_mv AT homermarkl thedrugdatatoknowledgepipelinelargescaleclaimsdataclassificationforpharmacologicinsight
AT palmernathanp thedrugdatatoknowledgepipelinelargescaleclaimsdataclassificationforpharmacologicinsight
AT bodenreiderolivier thedrugdatatoknowledgepipelinelargescaleclaimsdataclassificationforpharmacologicinsight
AT camiaurel thedrugdatatoknowledgepipelinelargescaleclaimsdataclassificationforpharmacologicinsight
AT chadwicklaura thedrugdatatoknowledgepipelinelargescaleclaimsdataclassificationforpharmacologicinsight
AT mandlkennethd thedrugdatatoknowledgepipelinelargescaleclaimsdataclassificationforpharmacologicinsight
AT homermarkl drugdatatoknowledgepipelinelargescaleclaimsdataclassificationforpharmacologicinsight
AT palmernathanp drugdatatoknowledgepipelinelargescaleclaimsdataclassificationforpharmacologicinsight
AT bodenreiderolivier drugdatatoknowledgepipelinelargescaleclaimsdataclassificationforpharmacologicinsight
AT camiaurel drugdatatoknowledgepipelinelargescaleclaimsdataclassificationforpharmacologicinsight
AT chadwicklaura drugdatatoknowledgepipelinelargescaleclaimsdataclassificationforpharmacologicinsight
AT mandlkennethd drugdatatoknowledgepipelinelargescaleclaimsdataclassificationforpharmacologicinsight