Cargando…

The Problem with Big Data: Operating on Smaller Datasets to Bridge the Implementation Gap

Big datasets have the potential to revolutionize public health. However, there is a mismatch between the political and scientific optimism surrounding big data and the public’s perception of its benefit. We suggest a systematic and concerted emphasis on developing models derived from smaller dataset...

Descripción completa

Detalles Bibliográficos
Autores principales: Mann, Richard P., Mushtaq, Faisal, White, Alan D., Mata-Cervantes, Gabriel, Pike, Tom, Coker, Dalton, Murdoch, Stuart, Hiles, Tim, Smith, Clare, Berridge, David, Hinchliffe, Suzanne, Hall, Geoff, Smye, Stephen, Wilkie, Richard M., Lodge, J. Peter A., Mon-Williams, Mark
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5130981/
https://www.ncbi.nlm.nih.gov/pubmed/27990415
http://dx.doi.org/10.3389/fpubh.2016.00248
Descripción
Sumario:Big datasets have the potential to revolutionize public health. However, there is a mismatch between the political and scientific optimism surrounding big data and the public’s perception of its benefit. We suggest a systematic and concerted emphasis on developing models derived from smaller datasets to illustrate to the public how big data can produce tangible benefits in the long term. In order to highlight the immediate value of a small data approach, we produced a proof-of-concept model predicting hospital length of stay. The results demonstrate that existing small datasets can be used to create models that generate a reasonable prediction, facilitating health-care delivery. We propose that greater attention (and funding) needs to be directed toward the utilization of existing information resources in parallel with current efforts to create and exploit “big data.”