Cargando…

Automated Coding of Job Descriptions From a General Population Study: Overview of Existing Tools, Their Application and Comparison

OBJECTIVES: Automatic job coding tools were developed to reduce the laborious task of manually assigning job codes based on free-text job descriptions in census and survey data sources, including large occupational health studies. The objective of this study is to provide a case study of comparative...

Descripción completa

Detalles Bibliográficos
Autores principales: Wan, Wenxin, Ge, Calvin B, Friesen, Melissa C, Locke, Sarah J, Russ, Daniel E, Burstyn, Igor, Baker, Christopher J O, Adisesh, Anil, Lan, Qing, Rothman, Nathaniel, Huss, Anke, van Tongeren, Martie, Vermeulen, Roel, Peters, Susan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10243927/
https://www.ncbi.nlm.nih.gov/pubmed/36734402
http://dx.doi.org/10.1093/annweh/wxad002
_version_ 1785054532297818112
author Wan, Wenxin
Ge, Calvin B
Friesen, Melissa C
Locke, Sarah J
Russ, Daniel E
Burstyn, Igor
Baker, Christopher J O
Adisesh, Anil
Lan, Qing
Rothman, Nathaniel
Huss, Anke
van Tongeren, Martie
Vermeulen, Roel
Peters, Susan
author_facet Wan, Wenxin
Ge, Calvin B
Friesen, Melissa C
Locke, Sarah J
Russ, Daniel E
Burstyn, Igor
Baker, Christopher J O
Adisesh, Anil
Lan, Qing
Rothman, Nathaniel
Huss, Anke
van Tongeren, Martie
Vermeulen, Roel
Peters, Susan
author_sort Wan, Wenxin
collection PubMed
description OBJECTIVES: Automatic job coding tools were developed to reduce the laborious task of manually assigning job codes based on free-text job descriptions in census and survey data sources, including large occupational health studies. The objective of this study is to provide a case study of comparative performance of job coding and JEM (Job-Exposure Matrix)-assigned exposures agreement using existing coding tools. METHODS: We compared three automatic job coding tools [AUTONOC, CASCOT (Computer-Assisted Structured Coding Tool), and LabourR], which were selected based on availability, coding of English free-text into coding systems closely related to the 1988 version of the International Standard Classification of Occupations (ISCO-88), and capability to perform batch coding. We used manually coded job histories from the AsiaLymph case-control study that were translated into English prior to auto-coding to assess their performance. We applied two general population JEMs to assess agreement at exposure level. Percent agreement and PABAK (Prevalence-Adjusted Bias-Adjusted Kappa) were used to compare the agreement of results from manual coders and automatic coding tools. RESULTS: The coding per cent agreement among the three tools ranged from 17.7 to 26.0% for exact matches at the most detailed 4-digit ISCO-88 level. The agreement was better at a more general level of job coding (e.g. 43.8–58.1% in 1-digit ISCO-88), and in exposure assignments (median values of PABAK coefficient ranging 0.69–0.78 across 12 JEM-assigned exposures). Based on our testing data, CASCOT was found to outperform others in terms of better agreement in both job coding (26% 4-digit agreement) and exposure assignment (median kappa 0.61). CONCLUSIONS: In this study, we observed that agreement on job coding was generally low for the three tools but noted a higher degree of agreement in assigned exposures. The results indicate the need for study-specific evaluations prior to their automatic use in general population studies, as well as improvements in the evaluated automatic coding tools.
format Online
Article
Text
id pubmed-10243927
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-102439272023-06-07 Automated Coding of Job Descriptions From a General Population Study: Overview of Existing Tools, Their Application and Comparison Wan, Wenxin Ge, Calvin B Friesen, Melissa C Locke, Sarah J Russ, Daniel E Burstyn, Igor Baker, Christopher J O Adisesh, Anil Lan, Qing Rothman, Nathaniel Huss, Anke van Tongeren, Martie Vermeulen, Roel Peters, Susan Ann Work Expo Health Original Articles OBJECTIVES: Automatic job coding tools were developed to reduce the laborious task of manually assigning job codes based on free-text job descriptions in census and survey data sources, including large occupational health studies. The objective of this study is to provide a case study of comparative performance of job coding and JEM (Job-Exposure Matrix)-assigned exposures agreement using existing coding tools. METHODS: We compared three automatic job coding tools [AUTONOC, CASCOT (Computer-Assisted Structured Coding Tool), and LabourR], which were selected based on availability, coding of English free-text into coding systems closely related to the 1988 version of the International Standard Classification of Occupations (ISCO-88), and capability to perform batch coding. We used manually coded job histories from the AsiaLymph case-control study that were translated into English prior to auto-coding to assess their performance. We applied two general population JEMs to assess agreement at exposure level. Percent agreement and PABAK (Prevalence-Adjusted Bias-Adjusted Kappa) were used to compare the agreement of results from manual coders and automatic coding tools. RESULTS: The coding per cent agreement among the three tools ranged from 17.7 to 26.0% for exact matches at the most detailed 4-digit ISCO-88 level. The agreement was better at a more general level of job coding (e.g. 43.8–58.1% in 1-digit ISCO-88), and in exposure assignments (median values of PABAK coefficient ranging 0.69–0.78 across 12 JEM-assigned exposures). Based on our testing data, CASCOT was found to outperform others in terms of better agreement in both job coding (26% 4-digit agreement) and exposure assignment (median kappa 0.61). CONCLUSIONS: In this study, we observed that agreement on job coding was generally low for the three tools but noted a higher degree of agreement in assigned exposures. The results indicate the need for study-specific evaluations prior to their automatic use in general population studies, as well as improvements in the evaluated automatic coding tools. Oxford University Press 2023-02-03 /pmc/articles/PMC10243927/ /pubmed/36734402 http://dx.doi.org/10.1093/annweh/wxad002 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the British Occupational Hygiene Society. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Articles
Wan, Wenxin
Ge, Calvin B
Friesen, Melissa C
Locke, Sarah J
Russ, Daniel E
Burstyn, Igor
Baker, Christopher J O
Adisesh, Anil
Lan, Qing
Rothman, Nathaniel
Huss, Anke
van Tongeren, Martie
Vermeulen, Roel
Peters, Susan
Automated Coding of Job Descriptions From a General Population Study: Overview of Existing Tools, Their Application and Comparison
title Automated Coding of Job Descriptions From a General Population Study: Overview of Existing Tools, Their Application and Comparison
title_full Automated Coding of Job Descriptions From a General Population Study: Overview of Existing Tools, Their Application and Comparison
title_fullStr Automated Coding of Job Descriptions From a General Population Study: Overview of Existing Tools, Their Application and Comparison
title_full_unstemmed Automated Coding of Job Descriptions From a General Population Study: Overview of Existing Tools, Their Application and Comparison
title_short Automated Coding of Job Descriptions From a General Population Study: Overview of Existing Tools, Their Application and Comparison
title_sort automated coding of job descriptions from a general population study: overview of existing tools, their application and comparison
topic Original Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10243927/
https://www.ncbi.nlm.nih.gov/pubmed/36734402
http://dx.doi.org/10.1093/annweh/wxad002
work_keys_str_mv AT wanwenxin automatedcodingofjobdescriptionsfromageneralpopulationstudyoverviewofexistingtoolstheirapplicationandcomparison
AT gecalvinb automatedcodingofjobdescriptionsfromageneralpopulationstudyoverviewofexistingtoolstheirapplicationandcomparison
AT friesenmelissac automatedcodingofjobdescriptionsfromageneralpopulationstudyoverviewofexistingtoolstheirapplicationandcomparison
AT lockesarahj automatedcodingofjobdescriptionsfromageneralpopulationstudyoverviewofexistingtoolstheirapplicationandcomparison
AT russdaniele automatedcodingofjobdescriptionsfromageneralpopulationstudyoverviewofexistingtoolstheirapplicationandcomparison
AT burstynigor automatedcodingofjobdescriptionsfromageneralpopulationstudyoverviewofexistingtoolstheirapplicationandcomparison
AT bakerchristopherjo automatedcodingofjobdescriptionsfromageneralpopulationstudyoverviewofexistingtoolstheirapplicationandcomparison
AT adiseshanil automatedcodingofjobdescriptionsfromageneralpopulationstudyoverviewofexistingtoolstheirapplicationandcomparison
AT lanqing automatedcodingofjobdescriptionsfromageneralpopulationstudyoverviewofexistingtoolstheirapplicationandcomparison
AT rothmannathaniel automatedcodingofjobdescriptionsfromageneralpopulationstudyoverviewofexistingtoolstheirapplicationandcomparison
AT hussanke automatedcodingofjobdescriptionsfromageneralpopulationstudyoverviewofexistingtoolstheirapplicationandcomparison
AT vantongerenmartie automatedcodingofjobdescriptionsfromageneralpopulationstudyoverviewofexistingtoolstheirapplicationandcomparison
AT vermeulenroel automatedcodingofjobdescriptionsfromageneralpopulationstudyoverviewofexistingtoolstheirapplicationandcomparison
AT peterssusan automatedcodingofjobdescriptionsfromageneralpopulationstudyoverviewofexistingtoolstheirapplicationandcomparison