Cargando…

PHYSTAT seminar: From COVID-19 Testing to Election Prediction: How Small Are Our Big Data?

<!--HTML--><p>The term “Big Data” emphasizes data quantity, not quality. What will be the effective sample size when we take into account the deterioration of data quality because of, for example, the selection bias in COVID-19 testing or the non-response bias in 2016&nbsp;US Electio...

Descripción completa

Detalles Bibliográficos
Autor principal: Meng, Xiao-Li
Lenguaje:eng
Publicado: 2021
Materias:
Acceso en línea:http://cds.cern.ch/record/2776121
Descripción
Sumario:<!--HTML--><p>The term “Big Data” emphasizes data quantity, not quality. What will be the effective sample size when we take into account the deterioration of data quality because of, for example, the selection bias in COVID-19 testing or the non-response bias in 2016&nbsp;US Election polling results? This talk provides an answer to such questions, based on the concept of data defect index (ddi) developed in [1].&nbsp;It will also discuss briefly the application of ddi for 2020 US Election, as reported in [2].</p> <p><span><span><span><span><span><span><span><span><span><span><span><span><span><span><span style="color:#ffffff"><span><span><span style="color:black">[1]</span></span></span>&nbsp;</span><span style="color:#000000">Xiao-Li Meng (2018) “Statistical paradises and paradoxes in big data (I): Law of large populations, big data paradox,&nbsp;</span></span></span></span></span></span></span></span></span></span></span></span></span></span></span><span style="color:#000000">and the 2016 US presidential election”, Annals of Applied Statistics&nbsp;<strong>12</strong>&nbsp;685.</span></p> <p><span><span><span><span><span><span><span><span><span><span><span><span><span><span><span style="color:#ffffff"><span><span><span style="color:black">[2] M. Isakov and S. Kuriwaki (2020) “Towards Principled Unskewing: Viewing 2020 Election Polls Through a Corrective Lens from 2016”, Harvard Data Science Review,&nbsp;<a href="https://hdsr.mitpress.mit.edu/pub/cnxbwum6/release/3">https://hdsr.mitpress.mit.edu/pub/cnxbwum6/release/3</a></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></p> <p><em><span><span><span><span><span><span><span><span><span><span><span><span><span><span><span style="color:#ffffff"><span style="color:black">&nbsp;Xiao-Li Meng is Professor of Statistics at Harvard. He was Chair of their Statistics Department, and is</span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span><span><span><span><span><span><span><span><span><span><span><span><span><span><span><span style="color:#ffffff"><span style="color:black">&nbsp;the Founding Editor in Chief of the Harvard Data Science Review. His research interests include ‘Statistical</span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span><span><span><span><span><span><span><span><span><span><span><span><span><span><span><span style="color:#ffffff"><span style="color:black">&nbsp;theory for data science’; and ‘Signal extraction and uncertainty estimates’. He is a very popular lecturer.</span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></em></p> EP/IT Data Science Seminar: https://cern.zoom.us/j/98545267593?pwd=akZWdmlyK01zbjFQa0x2c2ZXWW9ydz09