Cargando…

A curated dataset for data-driven turbulence modelling

The recent surge in machine learning augmented turbulence modelling is a promising approach for addressing the limitations of Reynolds-averaged Navier-Stokes (RANS) models. This work presents the development of the first open-source dataset, curated and structured for immediate use in machine learni...

Descripción completa

Detalles Bibliográficos
Autores principales: McConkey, Ryley, Yee, Eugene, Lien, Fue-Sang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8484471/
https://www.ncbi.nlm.nih.gov/pubmed/34593810
http://dx.doi.org/10.1038/s41597-021-01034-2
_version_ 1784577325461929984
author McConkey, Ryley
Yee, Eugene
Lien, Fue-Sang
author_facet McConkey, Ryley
Yee, Eugene
Lien, Fue-Sang
author_sort McConkey, Ryley
collection PubMed
description The recent surge in machine learning augmented turbulence modelling is a promising approach for addressing the limitations of Reynolds-averaged Navier-Stokes (RANS) models. This work presents the development of the first open-source dataset, curated and structured for immediate use in machine learning augmented corrective turbulence closure modelling. The dataset features a variety of RANS simulations with matching direct numerical simulation (DNS) and large-eddy simulation (LES) data. Four turbulence models are selected to form the initial dataset: k-ε, k-ε-ϕ(t)-f, k-ω, and k-ω SST. The dataset consists of 29 cases per turbulence model, for several parametrically sweeping reference DNS/LES cases: periodic hills, square duct, parametric bumps, converging-diverging channel, and a curved backward-facing step. At each of the 895,640 points, various RANS features with DNS/LES labels are available. The feature set includes quantities used in current state-of-the-art models, and additional fields which enable the generation of new feature sets. The dataset reduces effort required to train, test, and benchmark new corrective RANS models. The dataset is available at 10.34740/kaggle/dsv/2637500.
format Online
Article
Text
id pubmed-8484471
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-84844712021-10-12 A curated dataset for data-driven turbulence modelling McConkey, Ryley Yee, Eugene Lien, Fue-Sang Sci Data Data Descriptor The recent surge in machine learning augmented turbulence modelling is a promising approach for addressing the limitations of Reynolds-averaged Navier-Stokes (RANS) models. This work presents the development of the first open-source dataset, curated and structured for immediate use in machine learning augmented corrective turbulence closure modelling. The dataset features a variety of RANS simulations with matching direct numerical simulation (DNS) and large-eddy simulation (LES) data. Four turbulence models are selected to form the initial dataset: k-ε, k-ε-ϕ(t)-f, k-ω, and k-ω SST. The dataset consists of 29 cases per turbulence model, for several parametrically sweeping reference DNS/LES cases: periodic hills, square duct, parametric bumps, converging-diverging channel, and a curved backward-facing step. At each of the 895,640 points, various RANS features with DNS/LES labels are available. The feature set includes quantities used in current state-of-the-art models, and additional fields which enable the generation of new feature sets. The dataset reduces effort required to train, test, and benchmark new corrective RANS models. The dataset is available at 10.34740/kaggle/dsv/2637500. Nature Publishing Group UK 2021-09-30 /pmc/articles/PMC8484471/ /pubmed/34593810 http://dx.doi.org/10.1038/s41597-021-01034-2 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) applies to the metadata files associated with this article.
spellingShingle Data Descriptor
McConkey, Ryley
Yee, Eugene
Lien, Fue-Sang
A curated dataset for data-driven turbulence modelling
title A curated dataset for data-driven turbulence modelling
title_full A curated dataset for data-driven turbulence modelling
title_fullStr A curated dataset for data-driven turbulence modelling
title_full_unstemmed A curated dataset for data-driven turbulence modelling
title_short A curated dataset for data-driven turbulence modelling
title_sort curated dataset for data-driven turbulence modelling
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8484471/
https://www.ncbi.nlm.nih.gov/pubmed/34593810
http://dx.doi.org/10.1038/s41597-021-01034-2
work_keys_str_mv AT mcconkeyryley acurateddatasetfordatadriventurbulencemodelling
AT yeeeugene acurateddatasetfordatadriventurbulencemodelling
AT lienfuesang acurateddatasetfordatadriventurbulencemodelling
AT mcconkeyryley curateddatasetfordatadriventurbulencemodelling
AT yeeeugene curateddatasetfordatadriventurbulencemodelling
AT lienfuesang curateddatasetfordatadriventurbulencemodelling