Cargando…

Sensory Descriptor Analysis of Whisky Lexicons through the Use of Deep Learning

This paper is concerned with extracting relevant terms from a text corpus on whisk(e)y. “Relevant” terms are usually contextually defined in their domain of use. Arguably, every domain has a specialized vocabulary used for describing things. For example, the field of Sensory Science, a sub-field of...

Descripción completa

Detalles Bibliográficos
Autores principales: Miller, Chreston, Hamilton, Leah, Lahne, Jacob
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8303687/
https://www.ncbi.nlm.nih.gov/pubmed/34359502
http://dx.doi.org/10.3390/foods10071633
_version_ 1783727148098387968
author Miller, Chreston
Hamilton, Leah
Lahne, Jacob
author_facet Miller, Chreston
Hamilton, Leah
Lahne, Jacob
author_sort Miller, Chreston
collection PubMed
description This paper is concerned with extracting relevant terms from a text corpus on whisk(e)y. “Relevant” terms are usually contextually defined in their domain of use. Arguably, every domain has a specialized vocabulary used for describing things. For example, the field of Sensory Science, a sub-field of Food Science, investigates human responses to food products and differentiates “descriptive” terms for flavors from “ordinary”, non-descriptive language. Within the field, descriptors are generated through Descriptive Analysis, a method wherein a human panel of experts tastes multiple food products and defines descriptors. This process is both time-consuming and expensive. However, one could leverage existing data to identify and build a flavor language automatically. For example, there are thousands of professional and semi-professional reviews of whisk(e)y published on the internet, providing abundant descriptors interspersed with non-descriptive language. The aim, then, is to be able to automatically identify descriptive terms in unstructured reviews for later use in product flavor characterization. We created two systems to perform this task. The first is an interactive visual tool that can be used to tag examples of descriptive terms from thousands of whisky reviews. This creates a training dataset that we use to perform transfer learning using GloVe word embeddings and a Long Short-Term Memory deep learning model architecture. The result is a model that can accurately identify descriptors within a corpus of whisky review texts with a train/test accuracy of 99% and precision, recall, and F1-scores of 0.99. We tested for overfitting by comparing the training and validation loss for divergence. Our results show that the language structure for descriptive terms can be programmatically learned.
format Online
Article
Text
id pubmed-8303687
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-83036872021-07-25 Sensory Descriptor Analysis of Whisky Lexicons through the Use of Deep Learning Miller, Chreston Hamilton, Leah Lahne, Jacob Foods Article This paper is concerned with extracting relevant terms from a text corpus on whisk(e)y. “Relevant” terms are usually contextually defined in their domain of use. Arguably, every domain has a specialized vocabulary used for describing things. For example, the field of Sensory Science, a sub-field of Food Science, investigates human responses to food products and differentiates “descriptive” terms for flavors from “ordinary”, non-descriptive language. Within the field, descriptors are generated through Descriptive Analysis, a method wherein a human panel of experts tastes multiple food products and defines descriptors. This process is both time-consuming and expensive. However, one could leverage existing data to identify and build a flavor language automatically. For example, there are thousands of professional and semi-professional reviews of whisk(e)y published on the internet, providing abundant descriptors interspersed with non-descriptive language. The aim, then, is to be able to automatically identify descriptive terms in unstructured reviews for later use in product flavor characterization. We created two systems to perform this task. The first is an interactive visual tool that can be used to tag examples of descriptive terms from thousands of whisky reviews. This creates a training dataset that we use to perform transfer learning using GloVe word embeddings and a Long Short-Term Memory deep learning model architecture. The result is a model that can accurately identify descriptors within a corpus of whisky review texts with a train/test accuracy of 99% and precision, recall, and F1-scores of 0.99. We tested for overfitting by comparing the training and validation loss for divergence. Our results show that the language structure for descriptive terms can be programmatically learned. MDPI 2021-07-14 /pmc/articles/PMC8303687/ /pubmed/34359502 http://dx.doi.org/10.3390/foods10071633 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Miller, Chreston
Hamilton, Leah
Lahne, Jacob
Sensory Descriptor Analysis of Whisky Lexicons through the Use of Deep Learning
title Sensory Descriptor Analysis of Whisky Lexicons through the Use of Deep Learning
title_full Sensory Descriptor Analysis of Whisky Lexicons through the Use of Deep Learning
title_fullStr Sensory Descriptor Analysis of Whisky Lexicons through the Use of Deep Learning
title_full_unstemmed Sensory Descriptor Analysis of Whisky Lexicons through the Use of Deep Learning
title_short Sensory Descriptor Analysis of Whisky Lexicons through the Use of Deep Learning
title_sort sensory descriptor analysis of whisky lexicons through the use of deep learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8303687/
https://www.ncbi.nlm.nih.gov/pubmed/34359502
http://dx.doi.org/10.3390/foods10071633
work_keys_str_mv AT millerchreston sensorydescriptoranalysisofwhiskylexiconsthroughtheuseofdeeplearning
AT hamiltonleah sensorydescriptoranalysisofwhiskylexiconsthroughtheuseofdeeplearning
AT lahnejacob sensorydescriptoranalysisofwhiskylexiconsthroughtheuseofdeeplearning