Cargando…

Design an efficient data driven decision support system to predict flooding by analysing heterogeneous and multiple data sources using Data Lake

Floods are the most common natural disaster in several countries throughout the world. Flooding has a major impact on people's lives and livelihoods. The impact of flood disasters on human lives can be mitigated by developing effective flood forecasting and prediction models. The majority of fl...

Descripción completa

Detalles Bibliográficos
Autores principales: H V, Sreepathy, Rao, B Dinesh, J, Mohan Kumar, Rao, B Deepak
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10336497/
https://www.ncbi.nlm.nih.gov/pubmed/37448950
http://dx.doi.org/10.1016/j.mex.2023.102262
_version_ 1785071222103474176
author H V, Sreepathy
Rao, B Dinesh
J, Mohan Kumar
Rao, B Deepak
author_facet H V, Sreepathy
Rao, B Dinesh
J, Mohan Kumar
Rao, B Deepak
author_sort H V, Sreepathy
collection PubMed
description Floods are the most common natural disaster in several countries throughout the world. Flooding has a major impact on people's lives and livelihoods. The impact of flood disasters on human lives can be mitigated by developing effective flood forecasting and prediction models. The majority of flood prediction models do not take all flood-causing factors into account when they are designed. It is difficult to collect and handle some of these flood-causing variables since they are heterogeneous in nature. This paper presents a new big data architecture called Data Lake, which can ingest and store all important flood-causing heterogeneous data sources in their raw format for machine learning model creation. The statistical relevance of important flood producing factors on flood prediction outcome is determined utilizing inferential statistical approaches. The outcome of this research is to create flood warning systems that can alert the public and government officials so that they can make decisions in the event of a severe flood, reducing socioeconomic loss. • Flood causing factors are from heterogeneous sources, so there is no big data architecture for handling variety of data sources. • To provide data architectural solution using data lake for collecting and analysing heterogeneous flood causing factors. • Uses inferential statistical approach to determine importance of different flood causing factors in design of efficient flood prediction models.
format Online
Article
Text
id pubmed-10336497
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-103364972023-07-13 Design an efficient data driven decision support system to predict flooding by analysing heterogeneous and multiple data sources using Data Lake H V, Sreepathy Rao, B Dinesh J, Mohan Kumar Rao, B Deepak MethodsX Computer Science Floods are the most common natural disaster in several countries throughout the world. Flooding has a major impact on people's lives and livelihoods. The impact of flood disasters on human lives can be mitigated by developing effective flood forecasting and prediction models. The majority of flood prediction models do not take all flood-causing factors into account when they are designed. It is difficult to collect and handle some of these flood-causing variables since they are heterogeneous in nature. This paper presents a new big data architecture called Data Lake, which can ingest and store all important flood-causing heterogeneous data sources in their raw format for machine learning model creation. The statistical relevance of important flood producing factors on flood prediction outcome is determined utilizing inferential statistical approaches. The outcome of this research is to create flood warning systems that can alert the public and government officials so that they can make decisions in the event of a severe flood, reducing socioeconomic loss. • Flood causing factors are from heterogeneous sources, so there is no big data architecture for handling variety of data sources. • To provide data architectural solution using data lake for collecting and analysing heterogeneous flood causing factors. • Uses inferential statistical approach to determine importance of different flood causing factors in design of efficient flood prediction models. Elsevier 2023-06-22 /pmc/articles/PMC10336497/ /pubmed/37448950 http://dx.doi.org/10.1016/j.mex.2023.102262 Text en © 2023 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Computer Science
H V, Sreepathy
Rao, B Dinesh
J, Mohan Kumar
Rao, B Deepak
Design an efficient data driven decision support system to predict flooding by analysing heterogeneous and multiple data sources using Data Lake
title Design an efficient data driven decision support system to predict flooding by analysing heterogeneous and multiple data sources using Data Lake
title_full Design an efficient data driven decision support system to predict flooding by analysing heterogeneous and multiple data sources using Data Lake
title_fullStr Design an efficient data driven decision support system to predict flooding by analysing heterogeneous and multiple data sources using Data Lake
title_full_unstemmed Design an efficient data driven decision support system to predict flooding by analysing heterogeneous and multiple data sources using Data Lake
title_short Design an efficient data driven decision support system to predict flooding by analysing heterogeneous and multiple data sources using Data Lake
title_sort design an efficient data driven decision support system to predict flooding by analysing heterogeneous and multiple data sources using data lake
topic Computer Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10336497/
https://www.ncbi.nlm.nih.gov/pubmed/37448950
http://dx.doi.org/10.1016/j.mex.2023.102262
work_keys_str_mv AT hvsreepathy designanefficientdatadrivendecisionsupportsystemtopredictfloodingbyanalysingheterogeneousandmultipledatasourcesusingdatalake
AT raobdinesh designanefficientdatadrivendecisionsupportsystemtopredictfloodingbyanalysingheterogeneousandmultipledatasourcesusingdatalake
AT jmohankumar designanefficientdatadrivendecisionsupportsystemtopredictfloodingbyanalysingheterogeneousandmultipledatasourcesusingdatalake
AT raobdeepak designanefficientdatadrivendecisionsupportsystemtopredictfloodingbyanalysingheterogeneousandmultipledatasourcesusingdatalake