Cargando…

The Gender Gap Tracker: Using Natural Language Processing to measure gender bias in media

We examine gender bias in media by tallying the number of men and women quoted in news text, using the Gender Gap Tracker, a software system we developed specifically for this purpose. The Gender Gap Tracker downloads and analyzes the online daily publication of seven English-language Canadian news...

Descripción completa

Detalles Bibliográficos
Autores principales: Asr, Fatemeh Torabi, Mazraeh, Mohammad, Lopes, Alexandre, Gautam, Vasundhara, Gonzales, Junette, Rao, Prashanth, Taboada, Maite
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7845988/
https://www.ncbi.nlm.nih.gov/pubmed/33513175
http://dx.doi.org/10.1371/journal.pone.0245533
_version_ 1783644657375248384
author Asr, Fatemeh Torabi
Mazraeh, Mohammad
Lopes, Alexandre
Gautam, Vasundhara
Gonzales, Junette
Rao, Prashanth
Taboada, Maite
author_facet Asr, Fatemeh Torabi
Mazraeh, Mohammad
Lopes, Alexandre
Gautam, Vasundhara
Gonzales, Junette
Rao, Prashanth
Taboada, Maite
author_sort Asr, Fatemeh Torabi
collection PubMed
description We examine gender bias in media by tallying the number of men and women quoted in news text, using the Gender Gap Tracker, a software system we developed specifically for this purpose. The Gender Gap Tracker downloads and analyzes the online daily publication of seven English-language Canadian news outlets and enhances the data with multiple layers of linguistic information. We describe the Natural Language Processing technology behind this system, the curation of off-the-shelf tools and resources that we used to build it, and the parts that we developed. We evaluate the system in each language processing task and report errors using real-world examples. Finally, by applying the Tracker to the data, we provide valuable insights about the proportion of people mentioned and quoted, by gender, news organization, and author gender. Data collected between October 1, 2018 and September 30, 2020 shows that, in general, men are quoted about three times as frequently as women. While this proportion varies across news outlets and time intervals, the general pattern is consistent. We believe that, in a world with about 50% women, this should not be the case. Although journalists naturally need to quote newsmakers who are men, they also have a certain amount of control over who they approach as sources. The Gender Gap Tracker relies on the same principles as fitness or goal-setting trackers: By quantifying and measuring regular progress, we hope to motivate news organizations to provide a more diverse set of voices in their reporting.
format Online
Article
Text
id pubmed-7845988
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-78459882021-02-04 The Gender Gap Tracker: Using Natural Language Processing to measure gender bias in media Asr, Fatemeh Torabi Mazraeh, Mohammad Lopes, Alexandre Gautam, Vasundhara Gonzales, Junette Rao, Prashanth Taboada, Maite PLoS One Research Article We examine gender bias in media by tallying the number of men and women quoted in news text, using the Gender Gap Tracker, a software system we developed specifically for this purpose. The Gender Gap Tracker downloads and analyzes the online daily publication of seven English-language Canadian news outlets and enhances the data with multiple layers of linguistic information. We describe the Natural Language Processing technology behind this system, the curation of off-the-shelf tools and resources that we used to build it, and the parts that we developed. We evaluate the system in each language processing task and report errors using real-world examples. Finally, by applying the Tracker to the data, we provide valuable insights about the proportion of people mentioned and quoted, by gender, news organization, and author gender. Data collected between October 1, 2018 and September 30, 2020 shows that, in general, men are quoted about three times as frequently as women. While this proportion varies across news outlets and time intervals, the general pattern is consistent. We believe that, in a world with about 50% women, this should not be the case. Although journalists naturally need to quote newsmakers who are men, they also have a certain amount of control over who they approach as sources. The Gender Gap Tracker relies on the same principles as fitness or goal-setting trackers: By quantifying and measuring regular progress, we hope to motivate news organizations to provide a more diverse set of voices in their reporting. Public Library of Science 2021-01-29 /pmc/articles/PMC7845988/ /pubmed/33513175 http://dx.doi.org/10.1371/journal.pone.0245533 Text en © 2021 Asr et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Asr, Fatemeh Torabi
Mazraeh, Mohammad
Lopes, Alexandre
Gautam, Vasundhara
Gonzales, Junette
Rao, Prashanth
Taboada, Maite
The Gender Gap Tracker: Using Natural Language Processing to measure gender bias in media
title The Gender Gap Tracker: Using Natural Language Processing to measure gender bias in media
title_full The Gender Gap Tracker: Using Natural Language Processing to measure gender bias in media
title_fullStr The Gender Gap Tracker: Using Natural Language Processing to measure gender bias in media
title_full_unstemmed The Gender Gap Tracker: Using Natural Language Processing to measure gender bias in media
title_short The Gender Gap Tracker: Using Natural Language Processing to measure gender bias in media
title_sort gender gap tracker: using natural language processing to measure gender bias in media
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7845988/
https://www.ncbi.nlm.nih.gov/pubmed/33513175
http://dx.doi.org/10.1371/journal.pone.0245533
work_keys_str_mv AT asrfatemehtorabi thegendergaptrackerusingnaturallanguageprocessingtomeasuregenderbiasinmedia
AT mazraehmohammad thegendergaptrackerusingnaturallanguageprocessingtomeasuregenderbiasinmedia
AT lopesalexandre thegendergaptrackerusingnaturallanguageprocessingtomeasuregenderbiasinmedia
AT gautamvasundhara thegendergaptrackerusingnaturallanguageprocessingtomeasuregenderbiasinmedia
AT gonzalesjunette thegendergaptrackerusingnaturallanguageprocessingtomeasuregenderbiasinmedia
AT raoprashanth thegendergaptrackerusingnaturallanguageprocessingtomeasuregenderbiasinmedia
AT taboadamaite thegendergaptrackerusingnaturallanguageprocessingtomeasuregenderbiasinmedia
AT asrfatemehtorabi gendergaptrackerusingnaturallanguageprocessingtomeasuregenderbiasinmedia
AT mazraehmohammad gendergaptrackerusingnaturallanguageprocessingtomeasuregenderbiasinmedia
AT lopesalexandre gendergaptrackerusingnaturallanguageprocessingtomeasuregenderbiasinmedia
AT gautamvasundhara gendergaptrackerusingnaturallanguageprocessingtomeasuregenderbiasinmedia
AT gonzalesjunette gendergaptrackerusingnaturallanguageprocessingtomeasuregenderbiasinmedia
AT raoprashanth gendergaptrackerusingnaturallanguageprocessingtomeasuregenderbiasinmedia
AT taboadamaite gendergaptrackerusingnaturallanguageprocessingtomeasuregenderbiasinmedia