Cargando…
Sensory Descriptor Analysis of Whisky Lexicons through the Use of Deep Learning
This paper is concerned with extracting relevant terms from a text corpus on whisk(e)y. “Relevant” terms are usually contextually defined in their domain of use. Arguably, every domain has a specialized vocabulary used for describing things. For example, the field of Sensory Science, a sub-field of...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8303687/ https://www.ncbi.nlm.nih.gov/pubmed/34359502 http://dx.doi.org/10.3390/foods10071633 |
_version_ | 1783727148098387968 |
---|---|
author | Miller, Chreston Hamilton, Leah Lahne, Jacob |
author_facet | Miller, Chreston Hamilton, Leah Lahne, Jacob |
author_sort | Miller, Chreston |
collection | PubMed |
description | This paper is concerned with extracting relevant terms from a text corpus on whisk(e)y. “Relevant” terms are usually contextually defined in their domain of use. Arguably, every domain has a specialized vocabulary used for describing things. For example, the field of Sensory Science, a sub-field of Food Science, investigates human responses to food products and differentiates “descriptive” terms for flavors from “ordinary”, non-descriptive language. Within the field, descriptors are generated through Descriptive Analysis, a method wherein a human panel of experts tastes multiple food products and defines descriptors. This process is both time-consuming and expensive. However, one could leverage existing data to identify and build a flavor language automatically. For example, there are thousands of professional and semi-professional reviews of whisk(e)y published on the internet, providing abundant descriptors interspersed with non-descriptive language. The aim, then, is to be able to automatically identify descriptive terms in unstructured reviews for later use in product flavor characterization. We created two systems to perform this task. The first is an interactive visual tool that can be used to tag examples of descriptive terms from thousands of whisky reviews. This creates a training dataset that we use to perform transfer learning using GloVe word embeddings and a Long Short-Term Memory deep learning model architecture. The result is a model that can accurately identify descriptors within a corpus of whisky review texts with a train/test accuracy of 99% and precision, recall, and F1-scores of 0.99. We tested for overfitting by comparing the training and validation loss for divergence. Our results show that the language structure for descriptive terms can be programmatically learned. |
format | Online Article Text |
id | pubmed-8303687 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-83036872021-07-25 Sensory Descriptor Analysis of Whisky Lexicons through the Use of Deep Learning Miller, Chreston Hamilton, Leah Lahne, Jacob Foods Article This paper is concerned with extracting relevant terms from a text corpus on whisk(e)y. “Relevant” terms are usually contextually defined in their domain of use. Arguably, every domain has a specialized vocabulary used for describing things. For example, the field of Sensory Science, a sub-field of Food Science, investigates human responses to food products and differentiates “descriptive” terms for flavors from “ordinary”, non-descriptive language. Within the field, descriptors are generated through Descriptive Analysis, a method wherein a human panel of experts tastes multiple food products and defines descriptors. This process is both time-consuming and expensive. However, one could leverage existing data to identify and build a flavor language automatically. For example, there are thousands of professional and semi-professional reviews of whisk(e)y published on the internet, providing abundant descriptors interspersed with non-descriptive language. The aim, then, is to be able to automatically identify descriptive terms in unstructured reviews for later use in product flavor characterization. We created two systems to perform this task. The first is an interactive visual tool that can be used to tag examples of descriptive terms from thousands of whisky reviews. This creates a training dataset that we use to perform transfer learning using GloVe word embeddings and a Long Short-Term Memory deep learning model architecture. The result is a model that can accurately identify descriptors within a corpus of whisky review texts with a train/test accuracy of 99% and precision, recall, and F1-scores of 0.99. We tested for overfitting by comparing the training and validation loss for divergence. Our results show that the language structure for descriptive terms can be programmatically learned. MDPI 2021-07-14 /pmc/articles/PMC8303687/ /pubmed/34359502 http://dx.doi.org/10.3390/foods10071633 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Miller, Chreston Hamilton, Leah Lahne, Jacob Sensory Descriptor Analysis of Whisky Lexicons through the Use of Deep Learning |
title | Sensory Descriptor Analysis of Whisky Lexicons through the Use of Deep Learning |
title_full | Sensory Descriptor Analysis of Whisky Lexicons through the Use of Deep Learning |
title_fullStr | Sensory Descriptor Analysis of Whisky Lexicons through the Use of Deep Learning |
title_full_unstemmed | Sensory Descriptor Analysis of Whisky Lexicons through the Use of Deep Learning |
title_short | Sensory Descriptor Analysis of Whisky Lexicons through the Use of Deep Learning |
title_sort | sensory descriptor analysis of whisky lexicons through the use of deep learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8303687/ https://www.ncbi.nlm.nih.gov/pubmed/34359502 http://dx.doi.org/10.3390/foods10071633 |
work_keys_str_mv | AT millerchreston sensorydescriptoranalysisofwhiskylexiconsthroughtheuseofdeeplearning AT hamiltonleah sensorydescriptoranalysisofwhiskylexiconsthroughtheuseofdeeplearning AT lahnejacob sensorydescriptoranalysisofwhiskylexiconsthroughtheuseofdeeplearning |