Cargando…
Using Benford’s law to investigate Natural Hazard dataset homogeneity
Working with a large temporal dataset spanning several decades often represents a challenging task, especially when the record is heterogeneous and incomplete. The use of statistical laws could potentially overcome these problems. Here we apply Benford’s Law (also called the “First-Digit Law”) to th...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4496784/ https://www.ncbi.nlm.nih.gov/pubmed/26156060 http://dx.doi.org/10.1038/srep12046 |
_version_ | 1782380457792372736 |
---|---|
author | Joannes-Boyau, Renaud Bodin, Thomas Scheffers, Anja Sambridge, Malcolm May, Simon Matthias |
author_facet | Joannes-Boyau, Renaud Bodin, Thomas Scheffers, Anja Sambridge, Malcolm May, Simon Matthias |
author_sort | Joannes-Boyau, Renaud |
collection | PubMed |
description | Working with a large temporal dataset spanning several decades often represents a challenging task, especially when the record is heterogeneous and incomplete. The use of statistical laws could potentially overcome these problems. Here we apply Benford’s Law (also called the “First-Digit Law”) to the traveled distances of tropical cyclones since 1842. The record of tropical cyclones has been extensively impacted by improvements in detection capabilities over the past decades. We have found that, while the first-digit distribution for the entire record follows Benford’s Law prediction, specific changes such as satellite detection have had serious impacts on the dataset. The least-square misfit measure is used as a proxy to observe temporal variations, allowing us to assess data quality and homogeneity over the entire record, and at the same time over specific periods. Such information is crucial when running climatic models and Benford’s Law could potentially be used to overcome and correct for data heterogeneity and/or to select the most appropriate part of the record for detailed studies. |
format | Online Article Text |
id | pubmed-4496784 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-44967842015-07-13 Using Benford’s law to investigate Natural Hazard dataset homogeneity Joannes-Boyau, Renaud Bodin, Thomas Scheffers, Anja Sambridge, Malcolm May, Simon Matthias Sci Rep Article Working with a large temporal dataset spanning several decades often represents a challenging task, especially when the record is heterogeneous and incomplete. The use of statistical laws could potentially overcome these problems. Here we apply Benford’s Law (also called the “First-Digit Law”) to the traveled distances of tropical cyclones since 1842. The record of tropical cyclones has been extensively impacted by improvements in detection capabilities over the past decades. We have found that, while the first-digit distribution for the entire record follows Benford’s Law prediction, specific changes such as satellite detection have had serious impacts on the dataset. The least-square misfit measure is used as a proxy to observe temporal variations, allowing us to assess data quality and homogeneity over the entire record, and at the same time over specific periods. Such information is crucial when running climatic models and Benford’s Law could potentially be used to overcome and correct for data heterogeneity and/or to select the most appropriate part of the record for detailed studies. Nature Publishing Group 2015-07-09 /pmc/articles/PMC4496784/ /pubmed/26156060 http://dx.doi.org/10.1038/srep12046 Text en Copyright © 2015, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Joannes-Boyau, Renaud Bodin, Thomas Scheffers, Anja Sambridge, Malcolm May, Simon Matthias Using Benford’s law to investigate Natural Hazard dataset homogeneity |
title | Using Benford’s law to investigate Natural Hazard dataset homogeneity |
title_full | Using Benford’s law to investigate Natural Hazard dataset homogeneity |
title_fullStr | Using Benford’s law to investigate Natural Hazard dataset homogeneity |
title_full_unstemmed | Using Benford’s law to investigate Natural Hazard dataset homogeneity |
title_short | Using Benford’s law to investigate Natural Hazard dataset homogeneity |
title_sort | using benford’s law to investigate natural hazard dataset homogeneity |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4496784/ https://www.ncbi.nlm.nih.gov/pubmed/26156060 http://dx.doi.org/10.1038/srep12046 |
work_keys_str_mv | AT joannesboyaurenaud usingbenfordslawtoinvestigatenaturalhazarddatasethomogeneity AT bodinthomas usingbenfordslawtoinvestigatenaturalhazarddatasethomogeneity AT scheffersanja usingbenfordslawtoinvestigatenaturalhazarddatasethomogeneity AT sambridgemalcolm usingbenfordslawtoinvestigatenaturalhazarddatasethomogeneity AT maysimonmatthias usingbenfordslawtoinvestigatenaturalhazarddatasethomogeneity |