Cargando…

Using Benford’s law to investigate Natural Hazard dataset homogeneity

Working with a large temporal dataset spanning several decades often represents a challenging task, especially when the record is heterogeneous and incomplete. The use of statistical laws could potentially overcome these problems. Here we apply Benford’s Law (also called the “First-Digit Law”) to th...

Descripción completa

Detalles Bibliográficos
Autores principales: Joannes-Boyau, Renaud, Bodin, Thomas, Scheffers, Anja, Sambridge, Malcolm, May, Simon Matthias
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4496784/
https://www.ncbi.nlm.nih.gov/pubmed/26156060
http://dx.doi.org/10.1038/srep12046
_version_ 1782380457792372736
author Joannes-Boyau, Renaud
Bodin, Thomas
Scheffers, Anja
Sambridge, Malcolm
May, Simon Matthias
author_facet Joannes-Boyau, Renaud
Bodin, Thomas
Scheffers, Anja
Sambridge, Malcolm
May, Simon Matthias
author_sort Joannes-Boyau, Renaud
collection PubMed
description Working with a large temporal dataset spanning several decades often represents a challenging task, especially when the record is heterogeneous and incomplete. The use of statistical laws could potentially overcome these problems. Here we apply Benford’s Law (also called the “First-Digit Law”) to the traveled distances of tropical cyclones since 1842. The record of tropical cyclones has been extensively impacted by improvements in detection capabilities over the past decades. We have found that, while the first-digit distribution for the entire record follows Benford’s Law prediction, specific changes such as satellite detection have had serious impacts on the dataset. The least-square misfit measure is used as a proxy to observe temporal variations, allowing us to assess data quality and homogeneity over the entire record, and at the same time over specific periods. Such information is crucial when running climatic models and Benford’s Law could potentially be used to overcome and correct for data heterogeneity and/or to select the most appropriate part of the record for detailed studies.
format Online
Article
Text
id pubmed-4496784
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-44967842015-07-13 Using Benford’s law to investigate Natural Hazard dataset homogeneity Joannes-Boyau, Renaud Bodin, Thomas Scheffers, Anja Sambridge, Malcolm May, Simon Matthias Sci Rep Article Working with a large temporal dataset spanning several decades often represents a challenging task, especially when the record is heterogeneous and incomplete. The use of statistical laws could potentially overcome these problems. Here we apply Benford’s Law (also called the “First-Digit Law”) to the traveled distances of tropical cyclones since 1842. The record of tropical cyclones has been extensively impacted by improvements in detection capabilities over the past decades. We have found that, while the first-digit distribution for the entire record follows Benford’s Law prediction, specific changes such as satellite detection have had serious impacts on the dataset. The least-square misfit measure is used as a proxy to observe temporal variations, allowing us to assess data quality and homogeneity over the entire record, and at the same time over specific periods. Such information is crucial when running climatic models and Benford’s Law could potentially be used to overcome and correct for data heterogeneity and/or to select the most appropriate part of the record for detailed studies. Nature Publishing Group 2015-07-09 /pmc/articles/PMC4496784/ /pubmed/26156060 http://dx.doi.org/10.1038/srep12046 Text en Copyright © 2015, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Joannes-Boyau, Renaud
Bodin, Thomas
Scheffers, Anja
Sambridge, Malcolm
May, Simon Matthias
Using Benford’s law to investigate Natural Hazard dataset homogeneity
title Using Benford’s law to investigate Natural Hazard dataset homogeneity
title_full Using Benford’s law to investigate Natural Hazard dataset homogeneity
title_fullStr Using Benford’s law to investigate Natural Hazard dataset homogeneity
title_full_unstemmed Using Benford’s law to investigate Natural Hazard dataset homogeneity
title_short Using Benford’s law to investigate Natural Hazard dataset homogeneity
title_sort using benford’s law to investigate natural hazard dataset homogeneity
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4496784/
https://www.ncbi.nlm.nih.gov/pubmed/26156060
http://dx.doi.org/10.1038/srep12046
work_keys_str_mv AT joannesboyaurenaud usingbenfordslawtoinvestigatenaturalhazarddatasethomogeneity
AT bodinthomas usingbenfordslawtoinvestigatenaturalhazarddatasethomogeneity
AT scheffersanja usingbenfordslawtoinvestigatenaturalhazarddatasethomogeneity
AT sambridgemalcolm usingbenfordslawtoinvestigatenaturalhazarddatasethomogeneity
AT maysimonmatthias usingbenfordslawtoinvestigatenaturalhazarddatasethomogeneity