Cargando…

Confounding Effect of Undergraduate Semester–Driven “Academic" Internet Searches on the Ability to Detect True Disease Seasonality in Google Trends Data: Fourier Filter Method Development and Demonstration

BACKGROUND: Internet search volume for medical information, as tracked by Google Trends, has been used to demonstrate unexpected seasonality in the symptom burden of a variety of medical conditions. However, when more technical medical language is used (eg, diagnoses), we believe that this technique...

Descripción completa

Detalles Bibliográficos
Autores principales: Gillis, Timber, Garrison, Scott
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9987186/
https://www.ncbi.nlm.nih.gov/pubmed/37113451
http://dx.doi.org/10.2196/34464
_version_ 1784901328766500864
author Gillis, Timber
Garrison, Scott
author_facet Gillis, Timber
Garrison, Scott
author_sort Gillis, Timber
collection PubMed
description BACKGROUND: Internet search volume for medical information, as tracked by Google Trends, has been used to demonstrate unexpected seasonality in the symptom burden of a variety of medical conditions. However, when more technical medical language is used (eg, diagnoses), we believe that this technique is confounded by the cyclic, school year–driven internet search patterns of health care students. OBJECTIVE: This study aimed to (1) demonstrate that artificial “academic cycling” of Google Trends’ search volume is present in many health care terms, (2) demonstrate how signal processing techniques can be used to filter academic cycling out of Google Trends data, and (3) apply this filtering technique to some clinically relevant examples. METHODS: We obtained the Google Trends search volume data for a variety of academic terms demonstrating strong academic cycling and used a Fourier analysis technique to (1) identify the frequency domain fingerprint of this modulating pattern in one particularly strong example, and (2) filter that pattern out of the original data. After this illustrative example, we then applied the same filtering technique to internet searches for information on 3 medical conditions believed to have true seasonal modulation (myocardial infarction, hypertension, and depression), and all bacterial genus terms within a common medical microbiology textbook. RESULTS: Academic cycling explains much of the seasonal variation in internet search volume for many technically oriented search terms, including the bacterial genus term [“Staphylococcus”], for which academic cycling explained 73.8% of the variability in search volume (using the squared Spearman rank correlation coefficient, P<.001). Of the 56 bacterial genus terms examined, 6 displayed sufficiently strong seasonality to warrant further examination post filtering. This included (1) [“Aeromonas” + “Plesiomonas”] (nosocomial infections that were searched for more frequently during the summer), (2) [“Ehrlichia”] (a tick-borne pathogen that was searched for more frequently during late spring), (3) [“Moraxella”] and [“Haemophilus”] (respiratory infections that were searched for more frequently during late winter), (4) [“Legionella”] (searched for more frequently during midsummer), and (5) [“Vibrio”] (which spiked for 2 months during midsummer). The terms [“myocardial infarction”] and [“hypertension”] lacked any obvious seasonal cycling after filtering, whereas [“depression”] maintained an annual cycling pattern. CONCLUSIONS: Although it is reasonable to search for seasonal modulation of medical conditions using Google Trends’ internet search volume and lay-appropriate search terms, the variation in more technical search terms may be driven by health care students whose search frequency varies with the academic school year. When this is the case, using Fourier analysis to filter out academic cycling is a potential means to establish whether additional seasonality is present.
format Online
Article
Text
id pubmed-9987186
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-99871862023-04-26 Confounding Effect of Undergraduate Semester–Driven “Academic" Internet Searches on the Ability to Detect True Disease Seasonality in Google Trends Data: Fourier Filter Method Development and Demonstration Gillis, Timber Garrison, Scott JMIR Infodemiology Original Paper BACKGROUND: Internet search volume for medical information, as tracked by Google Trends, has been used to demonstrate unexpected seasonality in the symptom burden of a variety of medical conditions. However, when more technical medical language is used (eg, diagnoses), we believe that this technique is confounded by the cyclic, school year–driven internet search patterns of health care students. OBJECTIVE: This study aimed to (1) demonstrate that artificial “academic cycling” of Google Trends’ search volume is present in many health care terms, (2) demonstrate how signal processing techniques can be used to filter academic cycling out of Google Trends data, and (3) apply this filtering technique to some clinically relevant examples. METHODS: We obtained the Google Trends search volume data for a variety of academic terms demonstrating strong academic cycling and used a Fourier analysis technique to (1) identify the frequency domain fingerprint of this modulating pattern in one particularly strong example, and (2) filter that pattern out of the original data. After this illustrative example, we then applied the same filtering technique to internet searches for information on 3 medical conditions believed to have true seasonal modulation (myocardial infarction, hypertension, and depression), and all bacterial genus terms within a common medical microbiology textbook. RESULTS: Academic cycling explains much of the seasonal variation in internet search volume for many technically oriented search terms, including the bacterial genus term [“Staphylococcus”], for which academic cycling explained 73.8% of the variability in search volume (using the squared Spearman rank correlation coefficient, P<.001). Of the 56 bacterial genus terms examined, 6 displayed sufficiently strong seasonality to warrant further examination post filtering. This included (1) [“Aeromonas” + “Plesiomonas”] (nosocomial infections that were searched for more frequently during the summer), (2) [“Ehrlichia”] (a tick-borne pathogen that was searched for more frequently during late spring), (3) [“Moraxella”] and [“Haemophilus”] (respiratory infections that were searched for more frequently during late winter), (4) [“Legionella”] (searched for more frequently during midsummer), and (5) [“Vibrio”] (which spiked for 2 months during midsummer). The terms [“myocardial infarction”] and [“hypertension”] lacked any obvious seasonal cycling after filtering, whereas [“depression”] maintained an annual cycling pattern. CONCLUSIONS: Although it is reasonable to search for seasonal modulation of medical conditions using Google Trends’ internet search volume and lay-appropriate search terms, the variation in more technical search terms may be driven by health care students whose search frequency varies with the academic school year. When this is the case, using Fourier analysis to filter out academic cycling is a potential means to establish whether additional seasonality is present. JMIR Publications 2022-07-19 /pmc/articles/PMC9987186/ /pubmed/37113451 http://dx.doi.org/10.2196/34464 Text en ©Timber Gillis, Scott Garrison. Originally published in JMIR Infodemiology (https://infodemiology.jmir.org), 19.07.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Infodemiology, is properly cited. The complete bibliographic information, a link to the original publication on https://infodemiology.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Gillis, Timber
Garrison, Scott
Confounding Effect of Undergraduate Semester–Driven “Academic" Internet Searches on the Ability to Detect True Disease Seasonality in Google Trends Data: Fourier Filter Method Development and Demonstration
title Confounding Effect of Undergraduate Semester–Driven “Academic" Internet Searches on the Ability to Detect True Disease Seasonality in Google Trends Data: Fourier Filter Method Development and Demonstration
title_full Confounding Effect of Undergraduate Semester–Driven “Academic" Internet Searches on the Ability to Detect True Disease Seasonality in Google Trends Data: Fourier Filter Method Development and Demonstration
title_fullStr Confounding Effect of Undergraduate Semester–Driven “Academic" Internet Searches on the Ability to Detect True Disease Seasonality in Google Trends Data: Fourier Filter Method Development and Demonstration
title_full_unstemmed Confounding Effect of Undergraduate Semester–Driven “Academic" Internet Searches on the Ability to Detect True Disease Seasonality in Google Trends Data: Fourier Filter Method Development and Demonstration
title_short Confounding Effect of Undergraduate Semester–Driven “Academic" Internet Searches on the Ability to Detect True Disease Seasonality in Google Trends Data: Fourier Filter Method Development and Demonstration
title_sort confounding effect of undergraduate semester–driven “academic" internet searches on the ability to detect true disease seasonality in google trends data: fourier filter method development and demonstration
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9987186/
https://www.ncbi.nlm.nih.gov/pubmed/37113451
http://dx.doi.org/10.2196/34464
work_keys_str_mv AT gillistimber confoundingeffectofundergraduatesemesterdrivenacademicinternetsearchesontheabilitytodetecttruediseaseseasonalityingoogletrendsdatafourierfiltermethoddevelopmentanddemonstration
AT garrisonscott confoundingeffectofundergraduatesemesterdrivenacademicinternetsearchesontheabilitytodetecttruediseaseseasonalityingoogletrendsdatafourierfiltermethoddevelopmentanddemonstration