Cargando…

Weakly Supervised Learning for Categorization of Medical Inquiries for Customer Service Effectiveness

With the growing unstructured data in healthcare and pharmaceutical, there has been a drastic adoption of natural language processing for generating actionable insights from text data sources. One of the key areas of our exploration is the Medical Information function within our organization. We rec...

Descripción completa

Detalles Bibliográficos
Autores principales: Singhal, Shikha, Hegde, Bharat, Karmalkar, Prathamesh, Muhith, Justna, Gurulingappa, Harsha
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8366288/
https://www.ncbi.nlm.nih.gov/pubmed/34409245
http://dx.doi.org/10.3389/frma.2021.683400
_version_ 1783738872371347456
author Singhal, Shikha
Hegde, Bharat
Karmalkar, Prathamesh
Muhith, Justna
Gurulingappa, Harsha
author_facet Singhal, Shikha
Hegde, Bharat
Karmalkar, Prathamesh
Muhith, Justna
Gurulingappa, Harsha
author_sort Singhal, Shikha
collection PubMed
description With the growing unstructured data in healthcare and pharmaceutical, there has been a drastic adoption of natural language processing for generating actionable insights from text data sources. One of the key areas of our exploration is the Medical Information function within our organization. We receive a significant amount of medical information inquires in the form of unstructured text. An enterprise-level solution must deal with medical information interactions via multiple communication channels which are always nuanced with a variety of keywords and emotions that are unique to the pharmaceutical industry. There is a strong need for an effective solution to leverage the contextual knowledge of the medical information business along with digital tenants of natural language processing (NLP) and machine learning to build an automated and scalable process that generates real-time insights on conversation categories. The traditional supervised learning methods rely on a huge set of manually labeled training data and this dataset is difficult to attain due to high labeling costs. Thus, the solution is incomplete without its ability to self-learn and improve. This necessitates techniques to automatically build relevant training data using a weakly supervised approach from textual inquiries across consumers, healthcare professionals, sales, and service providers. The solution has two fundamental layers of NLP and machine learning. The first layer leverages heuristics and knowledgebase to identify the potential categories and build an annotated training data. The second layer, based on machine learning and deep learning, utilizes the training data generated using the heuristic approach for identifying categories and sub-categories associated with verbatim. Here, we present a novel approach harnessing the power of weakly supervised learning combined with multi-class classification for improved categorization of medical information inquiries.
format Online
Article
Text
id pubmed-8366288
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-83662882021-08-17 Weakly Supervised Learning for Categorization of Medical Inquiries for Customer Service Effectiveness Singhal, Shikha Hegde, Bharat Karmalkar, Prathamesh Muhith, Justna Gurulingappa, Harsha Front Res Metr Anal Research Metrics and Analytics With the growing unstructured data in healthcare and pharmaceutical, there has been a drastic adoption of natural language processing for generating actionable insights from text data sources. One of the key areas of our exploration is the Medical Information function within our organization. We receive a significant amount of medical information inquires in the form of unstructured text. An enterprise-level solution must deal with medical information interactions via multiple communication channels which are always nuanced with a variety of keywords and emotions that are unique to the pharmaceutical industry. There is a strong need for an effective solution to leverage the contextual knowledge of the medical information business along with digital tenants of natural language processing (NLP) and machine learning to build an automated and scalable process that generates real-time insights on conversation categories. The traditional supervised learning methods rely on a huge set of manually labeled training data and this dataset is difficult to attain due to high labeling costs. Thus, the solution is incomplete without its ability to self-learn and improve. This necessitates techniques to automatically build relevant training data using a weakly supervised approach from textual inquiries across consumers, healthcare professionals, sales, and service providers. The solution has two fundamental layers of NLP and machine learning. The first layer leverages heuristics and knowledgebase to identify the potential categories and build an annotated training data. The second layer, based on machine learning and deep learning, utilizes the training data generated using the heuristic approach for identifying categories and sub-categories associated with verbatim. Here, we present a novel approach harnessing the power of weakly supervised learning combined with multi-class classification for improved categorization of medical information inquiries. Frontiers Media S.A. 2021-08-02 /pmc/articles/PMC8366288/ /pubmed/34409245 http://dx.doi.org/10.3389/frma.2021.683400 Text en Copyright © 2021 Singhal, Hegde, Karmalkar, Muhith and Gurulingappa. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Research Metrics and Analytics
Singhal, Shikha
Hegde, Bharat
Karmalkar, Prathamesh
Muhith, Justna
Gurulingappa, Harsha
Weakly Supervised Learning for Categorization of Medical Inquiries for Customer Service Effectiveness
title Weakly Supervised Learning for Categorization of Medical Inquiries for Customer Service Effectiveness
title_full Weakly Supervised Learning for Categorization of Medical Inquiries for Customer Service Effectiveness
title_fullStr Weakly Supervised Learning for Categorization of Medical Inquiries for Customer Service Effectiveness
title_full_unstemmed Weakly Supervised Learning for Categorization of Medical Inquiries for Customer Service Effectiveness
title_short Weakly Supervised Learning for Categorization of Medical Inquiries for Customer Service Effectiveness
title_sort weakly supervised learning for categorization of medical inquiries for customer service effectiveness
topic Research Metrics and Analytics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8366288/
https://www.ncbi.nlm.nih.gov/pubmed/34409245
http://dx.doi.org/10.3389/frma.2021.683400
work_keys_str_mv AT singhalshikha weaklysupervisedlearningforcategorizationofmedicalinquiriesforcustomerserviceeffectiveness
AT hegdebharat weaklysupervisedlearningforcategorizationofmedicalinquiriesforcustomerserviceeffectiveness
AT karmalkarprathamesh weaklysupervisedlearningforcategorizationofmedicalinquiriesforcustomerserviceeffectiveness
AT muhithjustna weaklysupervisedlearningforcategorizationofmedicalinquiriesforcustomerserviceeffectiveness
AT gurulingappaharsha weaklysupervisedlearningforcategorizationofmedicalinquiriesforcustomerserviceeffectiveness