Cargando…

How to make causal inferences using texts

Text as data techniques offer a great promise: the ability to inductively discover measures that are useful for testing social science theories with large collections of text. Nearly all text-based causal inferences depend on a latent representation of the text, but we show that estimating this late...

Descripción completa

Detalles Bibliográficos
Autores principales: Egami, Naoki, Fong, Christian J., Grimmer, Justin, Roberts, Margaret E., Stewart, Brandon M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Association for the Advancement of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9581481/
https://www.ncbi.nlm.nih.gov/pubmed/36260669
http://dx.doi.org/10.1126/sciadv.abg2652
Descripción
Sumario:Text as data techniques offer a great promise: the ability to inductively discover measures that are useful for testing social science theories with large collections of text. Nearly all text-based causal inferences depend on a latent representation of the text, but we show that estimating this latent representation from the data creates underacknowledged risks: we may introduce an identification problem or overfit. To address these risks, we introduce a split-sample workflow for making rigorous causal inferences with discovered measures as treatments or outcomes. We then apply it to estimate causal effects from an experiment on immigration attitudes and a study on bureaucratic responsiveness.