Cargando…
Geo Fossils-I: A synthetic dataset of 2D fossil images for computer vision applications on geology
Geo Fossils-I is a synthetic image dataset used as a solution for resolving the limited availability of geological datasets intended for image classification and object detection on 2D images of geological outcrops. The Geo Fossils-I dataset was created to train a custom image classification model f...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10293944/ https://www.ncbi.nlm.nih.gov/pubmed/37383796 http://dx.doi.org/10.1016/j.dib.2023.109188 |
_version_ | 1785063093285421056 |
---|---|
author | Nathanail, Athanasios |
author_facet | Nathanail, Athanasios |
author_sort | Nathanail, Athanasios |
collection | PubMed |
description | Geo Fossils-I is a synthetic image dataset used as a solution for resolving the limited availability of geological datasets intended for image classification and object detection on 2D images of geological outcrops. The Geo Fossils-I dataset was created to train a custom image classification model for geological fossil identification and inspire additional work in generating synthetic geological data with Stable Diffusion models. The Geo Fossils-I dataset was generated through a custom training process and the fine-tuning of a pre-trained Stable Diffusion model. Stable Diffusion is an advanced text-to-image model that can create highly realistic images based on textual input. An effective technique for instructing Stable Diffusion on novel concepts is the application of Dreambooth, a specialized form of fine-tuning. Dreambooth was used to generate new images of fossils or to modify existing ones per the provided textual description. The Geo Fossils-I dataset contains six different fossil types present in geological outcrops, each one being characteristic of a particular depositional environment. The dataset contains a total of 1200 fossil images equally spread among different fossil types such as ammonites, belemnites, corals, crinoids, leaf fossils, and trilobites. This dataset is the first set within a series to be compiled aiming to enrich the available resources with respect to 2D outcrop images allowing geoscientists to progress in the field of automated interpretation of depositional environments. |
format | Online Article Text |
id | pubmed-10293944 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-102939442023-06-28 Geo Fossils-I: A synthetic dataset of 2D fossil images for computer vision applications on geology Nathanail, Athanasios Data Brief Data Article Geo Fossils-I is a synthetic image dataset used as a solution for resolving the limited availability of geological datasets intended for image classification and object detection on 2D images of geological outcrops. The Geo Fossils-I dataset was created to train a custom image classification model for geological fossil identification and inspire additional work in generating synthetic geological data with Stable Diffusion models. The Geo Fossils-I dataset was generated through a custom training process and the fine-tuning of a pre-trained Stable Diffusion model. Stable Diffusion is an advanced text-to-image model that can create highly realistic images based on textual input. An effective technique for instructing Stable Diffusion on novel concepts is the application of Dreambooth, a specialized form of fine-tuning. Dreambooth was used to generate new images of fossils or to modify existing ones per the provided textual description. The Geo Fossils-I dataset contains six different fossil types present in geological outcrops, each one being characteristic of a particular depositional environment. The dataset contains a total of 1200 fossil images equally spread among different fossil types such as ammonites, belemnites, corals, crinoids, leaf fossils, and trilobites. This dataset is the first set within a series to be compiled aiming to enrich the available resources with respect to 2D outcrop images allowing geoscientists to progress in the field of automated interpretation of depositional environments. Elsevier 2023-04-27 /pmc/articles/PMC10293944/ /pubmed/37383796 http://dx.doi.org/10.1016/j.dib.2023.109188 Text en © 2023 The Author. Published by Elsevier Inc. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Data Article Nathanail, Athanasios Geo Fossils-I: A synthetic dataset of 2D fossil images for computer vision applications on geology |
title | Geo Fossils-I: A synthetic dataset of 2D fossil images for computer vision applications on geology |
title_full | Geo Fossils-I: A synthetic dataset of 2D fossil images for computer vision applications on geology |
title_fullStr | Geo Fossils-I: A synthetic dataset of 2D fossil images for computer vision applications on geology |
title_full_unstemmed | Geo Fossils-I: A synthetic dataset of 2D fossil images for computer vision applications on geology |
title_short | Geo Fossils-I: A synthetic dataset of 2D fossil images for computer vision applications on geology |
title_sort | geo fossils-i: a synthetic dataset of 2d fossil images for computer vision applications on geology |
topic | Data Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10293944/ https://www.ncbi.nlm.nih.gov/pubmed/37383796 http://dx.doi.org/10.1016/j.dib.2023.109188 |
work_keys_str_mv | AT nathanailathanasios geofossilsiasyntheticdatasetof2dfossilimagesforcomputervisionapplicationsongeology |