Cargando…

Using Natural Language Processing to Examine the Uptake, Content, and Readability of Media Coverage of a Pan-Canadian Drug Safety Research Project: Cross-Sectional Observational Study

BACKGROUND: Isotretinoin, for treating cystic acne, increases the risk of miscarriage and fetal abnormalities when taken during pregnancy. The Health Canada–approved product monograph for isotretinoin includes pregnancy prevention guidelines. A recent study by the Canadian Network for Observational...

Descripción completa

Detalles Bibliográficos
Autores principales: Mohammadhassanzadeh, Hossein, Sketris, Ingrid, Traynor, Robyn, Alexander, Susan, Winquist, Brandace, Stewart, Samuel Alan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6996767/
https://www.ncbi.nlm.nih.gov/pubmed/31934872
http://dx.doi.org/10.2196/13296
_version_ 1783493571557457920
author Mohammadhassanzadeh, Hossein
Sketris, Ingrid
Traynor, Robyn
Alexander, Susan
Winquist, Brandace
Stewart, Samuel Alan
author_facet Mohammadhassanzadeh, Hossein
Sketris, Ingrid
Traynor, Robyn
Alexander, Susan
Winquist, Brandace
Stewart, Samuel Alan
author_sort Mohammadhassanzadeh, Hossein
collection PubMed
description BACKGROUND: Isotretinoin, for treating cystic acne, increases the risk of miscarriage and fetal abnormalities when taken during pregnancy. The Health Canada–approved product monograph for isotretinoin includes pregnancy prevention guidelines. A recent study by the Canadian Network for Observational Drug Effect Studies (CNODES) on the occurrence of pregnancy and pregnancy outcomes during isotretinoin therapy estimated poor adherence to these guidelines. Media uptake of this study was unknown; awareness of this uptake could help improve drug safety communication. OBJECTIVE: The aim of this study was to understand how the media present pharmacoepidemiological research using the CNODES isotretinoin study as a case study. METHODS: Google News was searched (April 25-May 6, 2016), using a predefined set of terms, for mention of the CNODES study. In total, 26 articles and 3 CNODES publications (original article, press release, and podcast) were identified. The article texts were cleaned (eg, advertisements and links removed), and the podcast was transcribed. A dictionary of 1295 unique words was created using natural language processing (NLP) techniques (term frequency-inverse document frequency, Porter stemming, and stop-word filtering) to identify common words and phrases. Similarity between the articles and reference publications was calculated using Euclidian distance; articles were grouped using hierarchical agglomerative clustering. Nine readability scales were applied to measure text readability based on factors such as number of words, difficult words, syllables, sentence counts, and other textual metrics. RESULTS: The top 5 dictionary words were pregnancy (250 appearances), isotretinoin (220), study (209), drug (201), and women (185). Three distinct clusters were identified: Clusters 2 (5 articles) and 3 (4 articles) were from health-related websites and media, respectively; Cluster 1 (18 articles) contained largely media sources; 2 articles fell outside these clusters. Use of the term isotretinoin versus Accutane (a brand name of isotretinoin), discussion of pregnancy complications, and assignment of responsibility for guideline adherence varied between clusters. For example, the term pregnanc appeared most often in Clusters 1 (14.6 average times per article) and 2 (11.4) and relatively infrequently in Cluster 3 (1.8). Average readability for all articles was high (eg, Flesch-Kincaid, 13; Gunning Fog, 15; SMOG Index, 10; Coleman Liau Index, 15; Linsear Write Index, 13; and Text Standard, 13). Readability increased from Cluster 2 (Gunning Fog of 16.9) to 3 (12.2). It varied between clusters (average 13th-15th grade) but exceeded the recommended health information reading level (grade 6th to 8th), overall. CONCLUSIONS: Media interpretation of the CNODES study varied, with differences in synonym usage and areas of focus. All articles were written above the recommended health information reading level. Analyzing media using NLP techniques can help determine drug safety communication effectiveness. This project is important for understanding how drug safety studies are taken up and redistributed in the media.
format Online
Article
Text
id pubmed-6996767
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-69967672020-02-20 Using Natural Language Processing to Examine the Uptake, Content, and Readability of Media Coverage of a Pan-Canadian Drug Safety Research Project: Cross-Sectional Observational Study Mohammadhassanzadeh, Hossein Sketris, Ingrid Traynor, Robyn Alexander, Susan Winquist, Brandace Stewart, Samuel Alan JMIR Form Res Original Paper BACKGROUND: Isotretinoin, for treating cystic acne, increases the risk of miscarriage and fetal abnormalities when taken during pregnancy. The Health Canada–approved product monograph for isotretinoin includes pregnancy prevention guidelines. A recent study by the Canadian Network for Observational Drug Effect Studies (CNODES) on the occurrence of pregnancy and pregnancy outcomes during isotretinoin therapy estimated poor adherence to these guidelines. Media uptake of this study was unknown; awareness of this uptake could help improve drug safety communication. OBJECTIVE: The aim of this study was to understand how the media present pharmacoepidemiological research using the CNODES isotretinoin study as a case study. METHODS: Google News was searched (April 25-May 6, 2016), using a predefined set of terms, for mention of the CNODES study. In total, 26 articles and 3 CNODES publications (original article, press release, and podcast) were identified. The article texts were cleaned (eg, advertisements and links removed), and the podcast was transcribed. A dictionary of 1295 unique words was created using natural language processing (NLP) techniques (term frequency-inverse document frequency, Porter stemming, and stop-word filtering) to identify common words and phrases. Similarity between the articles and reference publications was calculated using Euclidian distance; articles were grouped using hierarchical agglomerative clustering. Nine readability scales were applied to measure text readability based on factors such as number of words, difficult words, syllables, sentence counts, and other textual metrics. RESULTS: The top 5 dictionary words were pregnancy (250 appearances), isotretinoin (220), study (209), drug (201), and women (185). Three distinct clusters were identified: Clusters 2 (5 articles) and 3 (4 articles) were from health-related websites and media, respectively; Cluster 1 (18 articles) contained largely media sources; 2 articles fell outside these clusters. Use of the term isotretinoin versus Accutane (a brand name of isotretinoin), discussion of pregnancy complications, and assignment of responsibility for guideline adherence varied between clusters. For example, the term pregnanc appeared most often in Clusters 1 (14.6 average times per article) and 2 (11.4) and relatively infrequently in Cluster 3 (1.8). Average readability for all articles was high (eg, Flesch-Kincaid, 13; Gunning Fog, 15; SMOG Index, 10; Coleman Liau Index, 15; Linsear Write Index, 13; and Text Standard, 13). Readability increased from Cluster 2 (Gunning Fog of 16.9) to 3 (12.2). It varied between clusters (average 13th-15th grade) but exceeded the recommended health information reading level (grade 6th to 8th), overall. CONCLUSIONS: Media interpretation of the CNODES study varied, with differences in synonym usage and areas of focus. All articles were written above the recommended health information reading level. Analyzing media using NLP techniques can help determine drug safety communication effectiveness. This project is important for understanding how drug safety studies are taken up and redistributed in the media. JMIR Publications 2020-01-14 /pmc/articles/PMC6996767/ /pubmed/31934872 http://dx.doi.org/10.2196/13296 Text en ©Hossein Mohammadhassanzadeh, Ingrid Sketris, Robyn Traynor, Susan Alexander, Brandace Winquist, Samuel Alan Stewart. Originally published in JMIR Formative Research (http://formative.jmir.org), 14.01.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Formative Research, is properly cited. The complete bibliographic information, a link to the original publication on http://formative.jmir.org, as well as this copyright and license information must be included.
spellingShingle Original Paper
Mohammadhassanzadeh, Hossein
Sketris, Ingrid
Traynor, Robyn
Alexander, Susan
Winquist, Brandace
Stewart, Samuel Alan
Using Natural Language Processing to Examine the Uptake, Content, and Readability of Media Coverage of a Pan-Canadian Drug Safety Research Project: Cross-Sectional Observational Study
title Using Natural Language Processing to Examine the Uptake, Content, and Readability of Media Coverage of a Pan-Canadian Drug Safety Research Project: Cross-Sectional Observational Study
title_full Using Natural Language Processing to Examine the Uptake, Content, and Readability of Media Coverage of a Pan-Canadian Drug Safety Research Project: Cross-Sectional Observational Study
title_fullStr Using Natural Language Processing to Examine the Uptake, Content, and Readability of Media Coverage of a Pan-Canadian Drug Safety Research Project: Cross-Sectional Observational Study
title_full_unstemmed Using Natural Language Processing to Examine the Uptake, Content, and Readability of Media Coverage of a Pan-Canadian Drug Safety Research Project: Cross-Sectional Observational Study
title_short Using Natural Language Processing to Examine the Uptake, Content, and Readability of Media Coverage of a Pan-Canadian Drug Safety Research Project: Cross-Sectional Observational Study
title_sort using natural language processing to examine the uptake, content, and readability of media coverage of a pan-canadian drug safety research project: cross-sectional observational study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6996767/
https://www.ncbi.nlm.nih.gov/pubmed/31934872
http://dx.doi.org/10.2196/13296
work_keys_str_mv AT mohammadhassanzadehhossein usingnaturallanguageprocessingtoexaminetheuptakecontentandreadabilityofmediacoverageofapancanadiandrugsafetyresearchprojectcrosssectionalobservationalstudy
AT sketrisingrid usingnaturallanguageprocessingtoexaminetheuptakecontentandreadabilityofmediacoverageofapancanadiandrugsafetyresearchprojectcrosssectionalobservationalstudy
AT traynorrobyn usingnaturallanguageprocessingtoexaminetheuptakecontentandreadabilityofmediacoverageofapancanadiandrugsafetyresearchprojectcrosssectionalobservationalstudy
AT alexandersusan usingnaturallanguageprocessingtoexaminetheuptakecontentandreadabilityofmediacoverageofapancanadiandrugsafetyresearchprojectcrosssectionalobservationalstudy
AT winquistbrandace usingnaturallanguageprocessingtoexaminetheuptakecontentandreadabilityofmediacoverageofapancanadiandrugsafetyresearchprojectcrosssectionalobservationalstudy
AT stewartsamuelalan usingnaturallanguageprocessingtoexaminetheuptakecontentandreadabilityofmediacoverageofapancanadiandrugsafetyresearchprojectcrosssectionalobservationalstudy