Cargando…

The Problem with Big Data: Operating on Smaller Datasets to Bridge the Implementation Gap

Big datasets have the potential to revolutionize public health. However, there is a mismatch between the political and scientific optimism surrounding big data and the public’s perception of its benefit. We suggest a systematic and concerted emphasis on developing models derived from smaller dataset...

Descripción completa

Detalles Bibliográficos
Autores principales: Mann, Richard P., Mushtaq, Faisal, White, Alan D., Mata-Cervantes, Gabriel, Pike, Tom, Coker, Dalton, Murdoch, Stuart, Hiles, Tim, Smith, Clare, Berridge, David, Hinchliffe, Suzanne, Hall, Geoff, Smye, Stephen, Wilkie, Richard M., Lodge, J. Peter A., Mon-Williams, Mark
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5130981/
https://www.ncbi.nlm.nih.gov/pubmed/27990415
http://dx.doi.org/10.3389/fpubh.2016.00248
_version_ 1782470808091754496
author Mann, Richard P.
Mushtaq, Faisal
White, Alan D.
Mata-Cervantes, Gabriel
Pike, Tom
Coker, Dalton
Murdoch, Stuart
Hiles, Tim
Smith, Clare
Berridge, David
Hinchliffe, Suzanne
Hall, Geoff
Smye, Stephen
Wilkie, Richard M.
Lodge, J. Peter A.
Mon-Williams, Mark
author_facet Mann, Richard P.
Mushtaq, Faisal
White, Alan D.
Mata-Cervantes, Gabriel
Pike, Tom
Coker, Dalton
Murdoch, Stuart
Hiles, Tim
Smith, Clare
Berridge, David
Hinchliffe, Suzanne
Hall, Geoff
Smye, Stephen
Wilkie, Richard M.
Lodge, J. Peter A.
Mon-Williams, Mark
author_sort Mann, Richard P.
collection PubMed
description Big datasets have the potential to revolutionize public health. However, there is a mismatch between the political and scientific optimism surrounding big data and the public’s perception of its benefit. We suggest a systematic and concerted emphasis on developing models derived from smaller datasets to illustrate to the public how big data can produce tangible benefits in the long term. In order to highlight the immediate value of a small data approach, we produced a proof-of-concept model predicting hospital length of stay. The results demonstrate that existing small datasets can be used to create models that generate a reasonable prediction, facilitating health-care delivery. We propose that greater attention (and funding) needs to be directed toward the utilization of existing information resources in parallel with current efforts to create and exploit “big data.”
format Online
Article
Text
id pubmed-5130981
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-51309812016-12-16 The Problem with Big Data: Operating on Smaller Datasets to Bridge the Implementation Gap Mann, Richard P. Mushtaq, Faisal White, Alan D. Mata-Cervantes, Gabriel Pike, Tom Coker, Dalton Murdoch, Stuart Hiles, Tim Smith, Clare Berridge, David Hinchliffe, Suzanne Hall, Geoff Smye, Stephen Wilkie, Richard M. Lodge, J. Peter A. Mon-Williams, Mark Front Public Health Public Health Big datasets have the potential to revolutionize public health. However, there is a mismatch between the political and scientific optimism surrounding big data and the public’s perception of its benefit. We suggest a systematic and concerted emphasis on developing models derived from smaller datasets to illustrate to the public how big data can produce tangible benefits in the long term. In order to highlight the immediate value of a small data approach, we produced a proof-of-concept model predicting hospital length of stay. The results demonstrate that existing small datasets can be used to create models that generate a reasonable prediction, facilitating health-care delivery. We propose that greater attention (and funding) needs to be directed toward the utilization of existing information resources in parallel with current efforts to create and exploit “big data.” Frontiers Media S.A. 2016-12-01 /pmc/articles/PMC5130981/ /pubmed/27990415 http://dx.doi.org/10.3389/fpubh.2016.00248 Text en Copyright © 2016 Mann, Mushtaq, White, Mata-Cervantes, Pike, Coker, Murdoch, Hiles, Smith, Berridge, Hinchliffe, Hall, Smye, Wilkie, Lodge and Mon-Williams. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Public Health
Mann, Richard P.
Mushtaq, Faisal
White, Alan D.
Mata-Cervantes, Gabriel
Pike, Tom
Coker, Dalton
Murdoch, Stuart
Hiles, Tim
Smith, Clare
Berridge, David
Hinchliffe, Suzanne
Hall, Geoff
Smye, Stephen
Wilkie, Richard M.
Lodge, J. Peter A.
Mon-Williams, Mark
The Problem with Big Data: Operating on Smaller Datasets to Bridge the Implementation Gap
title The Problem with Big Data: Operating on Smaller Datasets to Bridge the Implementation Gap
title_full The Problem with Big Data: Operating on Smaller Datasets to Bridge the Implementation Gap
title_fullStr The Problem with Big Data: Operating on Smaller Datasets to Bridge the Implementation Gap
title_full_unstemmed The Problem with Big Data: Operating on Smaller Datasets to Bridge the Implementation Gap
title_short The Problem with Big Data: Operating on Smaller Datasets to Bridge the Implementation Gap
title_sort problem with big data: operating on smaller datasets to bridge the implementation gap
topic Public Health
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5130981/
https://www.ncbi.nlm.nih.gov/pubmed/27990415
http://dx.doi.org/10.3389/fpubh.2016.00248
work_keys_str_mv AT mannrichardp theproblemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT mushtaqfaisal theproblemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT whitealand theproblemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT matacervantesgabriel theproblemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT piketom theproblemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT cokerdalton theproblemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT murdochstuart theproblemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT hilestim theproblemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT smithclare theproblemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT berridgedavid theproblemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT hinchliffesuzanne theproblemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT hallgeoff theproblemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT smyestephen theproblemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT wilkierichardm theproblemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT lodgejpetera theproblemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT monwilliamsmark theproblemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT mannrichardp problemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT mushtaqfaisal problemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT whitealand problemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT matacervantesgabriel problemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT piketom problemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT cokerdalton problemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT murdochstuart problemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT hilestim problemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT smithclare problemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT berridgedavid problemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT hinchliffesuzanne problemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT hallgeoff problemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT smyestephen problemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT wilkierichardm problemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT lodgejpetera problemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap
AT monwilliamsmark problemwithbigdataoperatingonsmallerdatasetstobridgetheimplementationgap