Cargando…

Query sampler: generating query sets for analyzing search engines using keyword research tools

Search engine queries are the starting point for studies in different fields, such as health or political science. These studies usually aim to make statements about social phenomena. However, the queries used in the studies are often created rather unsystematically and do not correspond to actual u...

Descripción completa

Detalles Bibliográficos
Autores principales: Schultheiß, Sebastian, Lewandowski, Dirk, von Mach, Sonja, Yagci, Nurce
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280501/
https://www.ncbi.nlm.nih.gov/pubmed/37346534
http://dx.doi.org/10.7717/peerj-cs.1421
_version_ 1785060808202387456
author Schultheiß, Sebastian
Lewandowski, Dirk
von Mach, Sonja
Yagci, Nurce
author_facet Schultheiß, Sebastian
Lewandowski, Dirk
von Mach, Sonja
Yagci, Nurce
author_sort Schultheiß, Sebastian
collection PubMed
description Search engine queries are the starting point for studies in different fields, such as health or political science. These studies usually aim to make statements about social phenomena. However, the queries used in the studies are often created rather unsystematically and do not correspond to actual user behavior. Therefore, the evidential value of the studies must be questioned. We address this problem by developing an approach (query sampler) to sample queries from commercial search engines, using keyword research tools designed to support search engine marketing. This allows us to generate large numbers of queries related to a given topic and derive information on how often each keyword is searched for, that is, the query volume. We empirically test our approach with queries from two published studies, and the results show that the number of queries and total search volume could be considerably expanded. Our approach has a wide range of applications for studies that seek to draw conclusions about social phenomena using search engine queries. The approach can be applied flexibly to different topics and is relatively straightforward to implement, as we provide the code for querying Google Ads API. Limitations are that the approach needs to be tested with a broader range of topics and thoroughly checked for problems with topic drift and the role of close variants provided by keyword research tools.
format Online
Article
Text
id pubmed-10280501
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-102805012023-06-21 Query sampler: generating query sets for analyzing search engines using keyword research tools Schultheiß, Sebastian Lewandowski, Dirk von Mach, Sonja Yagci, Nurce PeerJ Comput Sci Human-Computer Interaction Search engine queries are the starting point for studies in different fields, such as health or political science. These studies usually aim to make statements about social phenomena. However, the queries used in the studies are often created rather unsystematically and do not correspond to actual user behavior. Therefore, the evidential value of the studies must be questioned. We address this problem by developing an approach (query sampler) to sample queries from commercial search engines, using keyword research tools designed to support search engine marketing. This allows us to generate large numbers of queries related to a given topic and derive information on how often each keyword is searched for, that is, the query volume. We empirically test our approach with queries from two published studies, and the results show that the number of queries and total search volume could be considerably expanded. Our approach has a wide range of applications for studies that seek to draw conclusions about social phenomena using search engine queries. The approach can be applied flexibly to different topics and is relatively straightforward to implement, as we provide the code for querying Google Ads API. Limitations are that the approach needs to be tested with a broader range of topics and thoroughly checked for problems with topic drift and the role of close variants provided by keyword research tools. PeerJ Inc. 2023-06-07 /pmc/articles/PMC10280501/ /pubmed/37346534 http://dx.doi.org/10.7717/peerj-cs.1421 Text en ©2023 Schultheiß et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Human-Computer Interaction
Schultheiß, Sebastian
Lewandowski, Dirk
von Mach, Sonja
Yagci, Nurce
Query sampler: generating query sets for analyzing search engines using keyword research tools
title Query sampler: generating query sets for analyzing search engines using keyword research tools
title_full Query sampler: generating query sets for analyzing search engines using keyword research tools
title_fullStr Query sampler: generating query sets for analyzing search engines using keyword research tools
title_full_unstemmed Query sampler: generating query sets for analyzing search engines using keyword research tools
title_short Query sampler: generating query sets for analyzing search engines using keyword research tools
title_sort query sampler: generating query sets for analyzing search engines using keyword research tools
topic Human-Computer Interaction
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280501/
https://www.ncbi.nlm.nih.gov/pubmed/37346534
http://dx.doi.org/10.7717/peerj-cs.1421
work_keys_str_mv AT schultheißsebastian querysamplergeneratingquerysetsforanalyzingsearchenginesusingkeywordresearchtools
AT lewandowskidirk querysamplergeneratingquerysetsforanalyzingsearchenginesusingkeywordresearchtools
AT vonmachsonja querysamplergeneratingquerysetsforanalyzingsearchenginesusingkeywordresearchtools
AT yagcinurce querysamplergeneratingquerysetsforanalyzingsearchenginesusingkeywordresearchtools