Cargando…
Query sampler: generating query sets for analyzing search engines using keyword research tools
Search engine queries are the starting point for studies in different fields, such as health or political science. These studies usually aim to make statements about social phenomena. However, the queries used in the studies are often created rather unsystematically and do not correspond to actual u...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280501/ https://www.ncbi.nlm.nih.gov/pubmed/37346534 http://dx.doi.org/10.7717/peerj-cs.1421 |
_version_ | 1785060808202387456 |
---|---|
author | Schultheiß, Sebastian Lewandowski, Dirk von Mach, Sonja Yagci, Nurce |
author_facet | Schultheiß, Sebastian Lewandowski, Dirk von Mach, Sonja Yagci, Nurce |
author_sort | Schultheiß, Sebastian |
collection | PubMed |
description | Search engine queries are the starting point for studies in different fields, such as health or political science. These studies usually aim to make statements about social phenomena. However, the queries used in the studies are often created rather unsystematically and do not correspond to actual user behavior. Therefore, the evidential value of the studies must be questioned. We address this problem by developing an approach (query sampler) to sample queries from commercial search engines, using keyword research tools designed to support search engine marketing. This allows us to generate large numbers of queries related to a given topic and derive information on how often each keyword is searched for, that is, the query volume. We empirically test our approach with queries from two published studies, and the results show that the number of queries and total search volume could be considerably expanded. Our approach has a wide range of applications for studies that seek to draw conclusions about social phenomena using search engine queries. The approach can be applied flexibly to different topics and is relatively straightforward to implement, as we provide the code for querying Google Ads API. Limitations are that the approach needs to be tested with a broader range of topics and thoroughly checked for problems with topic drift and the role of close variants provided by keyword research tools. |
format | Online Article Text |
id | pubmed-10280501 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-102805012023-06-21 Query sampler: generating query sets for analyzing search engines using keyword research tools Schultheiß, Sebastian Lewandowski, Dirk von Mach, Sonja Yagci, Nurce PeerJ Comput Sci Human-Computer Interaction Search engine queries are the starting point for studies in different fields, such as health or political science. These studies usually aim to make statements about social phenomena. However, the queries used in the studies are often created rather unsystematically and do not correspond to actual user behavior. Therefore, the evidential value of the studies must be questioned. We address this problem by developing an approach (query sampler) to sample queries from commercial search engines, using keyword research tools designed to support search engine marketing. This allows us to generate large numbers of queries related to a given topic and derive information on how often each keyword is searched for, that is, the query volume. We empirically test our approach with queries from two published studies, and the results show that the number of queries and total search volume could be considerably expanded. Our approach has a wide range of applications for studies that seek to draw conclusions about social phenomena using search engine queries. The approach can be applied flexibly to different topics and is relatively straightforward to implement, as we provide the code for querying Google Ads API. Limitations are that the approach needs to be tested with a broader range of topics and thoroughly checked for problems with topic drift and the role of close variants provided by keyword research tools. PeerJ Inc. 2023-06-07 /pmc/articles/PMC10280501/ /pubmed/37346534 http://dx.doi.org/10.7717/peerj-cs.1421 Text en ©2023 Schultheiß et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Human-Computer Interaction Schultheiß, Sebastian Lewandowski, Dirk von Mach, Sonja Yagci, Nurce Query sampler: generating query sets for analyzing search engines using keyword research tools |
title | Query sampler: generating query sets for analyzing search engines using keyword research tools |
title_full | Query sampler: generating query sets for analyzing search engines using keyword research tools |
title_fullStr | Query sampler: generating query sets for analyzing search engines using keyword research tools |
title_full_unstemmed | Query sampler: generating query sets for analyzing search engines using keyword research tools |
title_short | Query sampler: generating query sets for analyzing search engines using keyword research tools |
title_sort | query sampler: generating query sets for analyzing search engines using keyword research tools |
topic | Human-Computer Interaction |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280501/ https://www.ncbi.nlm.nih.gov/pubmed/37346534 http://dx.doi.org/10.7717/peerj-cs.1421 |
work_keys_str_mv | AT schultheißsebastian querysamplergeneratingquerysetsforanalyzingsearchenginesusingkeywordresearchtools AT lewandowskidirk querysamplergeneratingquerysetsforanalyzingsearchenginesusingkeywordresearchtools AT vonmachsonja querysamplergeneratingquerysetsforanalyzingsearchenginesusingkeywordresearchtools AT yagcinurce querysamplergeneratingquerysetsforanalyzingsearchenginesusingkeywordresearchtools |