Cargando…

ACHENY: A standard Chenopodiaceae image dataset for deep learning models

This paper contains datasets related to the “Efficient Deep Learning Models for Categorizing Chenopodiaceae in the wild” (Heidary-Sharifabad et al., 2021). There are about 1500 species of Chenopodiaceae that are spread worldwide and often are ecologically important. Biodiversity conservation of thes...

Descripción completa

Detalles Bibliográficos
Autores principales: Heidary-Sharifabad, Ahmad, Zarchi, Mohsen Sardari, Emadi, Sima, Zarei, Gholamreza
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8529081/
https://www.ncbi.nlm.nih.gov/pubmed/34712755
http://dx.doi.org/10.1016/j.dib.2021.107478
_version_ 1784586391349362688
author Heidary-Sharifabad, Ahmad
Zarchi, Mohsen Sardari
Emadi, Sima
Zarei, Gholamreza
author_facet Heidary-Sharifabad, Ahmad
Zarchi, Mohsen Sardari
Emadi, Sima
Zarei, Gholamreza
author_sort Heidary-Sharifabad, Ahmad
collection PubMed
description This paper contains datasets related to the “Efficient Deep Learning Models for Categorizing Chenopodiaceae in the wild” (Heidary-Sharifabad et al., 2021). There are about 1500 species of Chenopodiaceae that are spread worldwide and often are ecologically important. Biodiversity conservation of these species is critical due to the destructive effects of human activities on them. For this purpose, identification and surveillance of Chenopodiaceae species in their natural habitat are necessary and can be facilitated by deep learning. The feasibility of applying deep learning algorithms to identify Chenopodiaceae species depends on access to the appropriate relevant dataset. Therefore, ACHENY dataset was collected from natural habitats of different bushes of Chenopodiaceae species, in real-world conditions from desert and semi-desert areas of the Yazd province of IRAN. This imbalanced dataset is compiled of 27,030 RGB color images from 30 Chenopodiaceae species, each species 300-1461 images. Imaging is performed from multiple bushes for each species, with different camera-to-target distances, viewpoints, angles, and natural sunlight in November and December. The collected images are not pre-processed, only are resized to 224 × 224 dimensions which can be used on some of the successful deep learning models and then were grouped into their respective class. The images in each class are separated by 10% for testing, 18% for validation, and 72% for training. Test images are often manually selected from plant bushes different from the training set. Then training and validation images are randomly separated from the remaining images in each category. The small-sized images with 64 × 64 dimensions also are included in ACHENY which can be used on some other deep models.
format Online
Article
Text
id pubmed-8529081
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-85290812021-10-27 ACHENY: A standard Chenopodiaceae image dataset for deep learning models Heidary-Sharifabad, Ahmad Zarchi, Mohsen Sardari Emadi, Sima Zarei, Gholamreza Data Brief Data Article This paper contains datasets related to the “Efficient Deep Learning Models for Categorizing Chenopodiaceae in the wild” (Heidary-Sharifabad et al., 2021). There are about 1500 species of Chenopodiaceae that are spread worldwide and often are ecologically important. Biodiversity conservation of these species is critical due to the destructive effects of human activities on them. For this purpose, identification and surveillance of Chenopodiaceae species in their natural habitat are necessary and can be facilitated by deep learning. The feasibility of applying deep learning algorithms to identify Chenopodiaceae species depends on access to the appropriate relevant dataset. Therefore, ACHENY dataset was collected from natural habitats of different bushes of Chenopodiaceae species, in real-world conditions from desert and semi-desert areas of the Yazd province of IRAN. This imbalanced dataset is compiled of 27,030 RGB color images from 30 Chenopodiaceae species, each species 300-1461 images. Imaging is performed from multiple bushes for each species, with different camera-to-target distances, viewpoints, angles, and natural sunlight in November and December. The collected images are not pre-processed, only are resized to 224 × 224 dimensions which can be used on some of the successful deep learning models and then were grouped into their respective class. The images in each class are separated by 10% for testing, 18% for validation, and 72% for training. Test images are often manually selected from plant bushes different from the training set. Then training and validation images are randomly separated from the remaining images in each category. The small-sized images with 64 × 64 dimensions also are included in ACHENY which can be used on some other deep models. Elsevier 2021-10-14 /pmc/articles/PMC8529081/ /pubmed/34712755 http://dx.doi.org/10.1016/j.dib.2021.107478 Text en © 2021 Published by Elsevier Inc. https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Data Article
Heidary-Sharifabad, Ahmad
Zarchi, Mohsen Sardari
Emadi, Sima
Zarei, Gholamreza
ACHENY: A standard Chenopodiaceae image dataset for deep learning models
title ACHENY: A standard Chenopodiaceae image dataset for deep learning models
title_full ACHENY: A standard Chenopodiaceae image dataset for deep learning models
title_fullStr ACHENY: A standard Chenopodiaceae image dataset for deep learning models
title_full_unstemmed ACHENY: A standard Chenopodiaceae image dataset for deep learning models
title_short ACHENY: A standard Chenopodiaceae image dataset for deep learning models
title_sort acheny: a standard chenopodiaceae image dataset for deep learning models
topic Data Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8529081/
https://www.ncbi.nlm.nih.gov/pubmed/34712755
http://dx.doi.org/10.1016/j.dib.2021.107478
work_keys_str_mv AT heidarysharifabadahmad achenyastandardchenopodiaceaeimagedatasetfordeeplearningmodels
AT zarchimohsensardari achenyastandardchenopodiaceaeimagedatasetfordeeplearningmodels
AT emadisima achenyastandardchenopodiaceaeimagedatasetfordeeplearningmodels
AT zareigholamreza achenyastandardchenopodiaceaeimagedatasetfordeeplearningmodels