Cargando…
Identification of animal behavioral strategies by inverse reinforcement learning
Animals are able to reach a desired state in an environment by controlling various behavioral patterns. Identification of the behavioral strategy used for this control is important for understanding animals’ decision-making and is fundamental to dissect information processing done by the nervous sys...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5951592/ https://www.ncbi.nlm.nih.gov/pubmed/29718905 http://dx.doi.org/10.1371/journal.pcbi.1006122 |
_version_ | 1783323057254825984 |
---|---|
author | Yamaguchi, Shoichiro Naoki, Honda Ikeda, Muneki Tsukada, Yuki Nakano, Shunji Mori, Ikue Ishii, Shin |
author_facet | Yamaguchi, Shoichiro Naoki, Honda Ikeda, Muneki Tsukada, Yuki Nakano, Shunji Mori, Ikue Ishii, Shin |
author_sort | Yamaguchi, Shoichiro |
collection | PubMed |
description | Animals are able to reach a desired state in an environment by controlling various behavioral patterns. Identification of the behavioral strategy used for this control is important for understanding animals’ decision-making and is fundamental to dissect information processing done by the nervous system. However, methods for quantifying such behavioral strategies have not been fully established. In this study, we developed an inverse reinforcement-learning (IRL) framework to identify an animal’s behavioral strategy from behavioral time-series data. We applied this framework to C. elegans thermotactic behavior; after cultivation at a constant temperature with or without food, fed worms prefer, while starved worms avoid the cultivation temperature on a thermal gradient. Our IRL approach revealed that the fed worms used both the absolute temperature and its temporal derivative and that their behavior involved two strategies: directed migration (DM) and isothermal migration (IM). With DM, worms efficiently reached specific temperatures, which explains their thermotactic behavior when fed. With IM, worms moved along a constant temperature, which reflects isothermal tracking, well-observed in previous studies. In contrast to fed animals, starved worms escaped the cultivation temperature using only the absolute, but not the temporal derivative of temperature. We also investigated the neural basis underlying these strategies, by applying our method to thermosensory neuron-deficient worms. Thus, our IRL-based approach is useful in identifying animal strategies from behavioral time-series data and could be applied to a wide range of behavioral studies, including decision-making, in other organisms. |
format | Online Article Text |
id | pubmed-5951592 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-59515922018-05-25 Identification of animal behavioral strategies by inverse reinforcement learning Yamaguchi, Shoichiro Naoki, Honda Ikeda, Muneki Tsukada, Yuki Nakano, Shunji Mori, Ikue Ishii, Shin PLoS Comput Biol Research Article Animals are able to reach a desired state in an environment by controlling various behavioral patterns. Identification of the behavioral strategy used for this control is important for understanding animals’ decision-making and is fundamental to dissect information processing done by the nervous system. However, methods for quantifying such behavioral strategies have not been fully established. In this study, we developed an inverse reinforcement-learning (IRL) framework to identify an animal’s behavioral strategy from behavioral time-series data. We applied this framework to C. elegans thermotactic behavior; after cultivation at a constant temperature with or without food, fed worms prefer, while starved worms avoid the cultivation temperature on a thermal gradient. Our IRL approach revealed that the fed worms used both the absolute temperature and its temporal derivative and that their behavior involved two strategies: directed migration (DM) and isothermal migration (IM). With DM, worms efficiently reached specific temperatures, which explains their thermotactic behavior when fed. With IM, worms moved along a constant temperature, which reflects isothermal tracking, well-observed in previous studies. In contrast to fed animals, starved worms escaped the cultivation temperature using only the absolute, but not the temporal derivative of temperature. We also investigated the neural basis underlying these strategies, by applying our method to thermosensory neuron-deficient worms. Thus, our IRL-based approach is useful in identifying animal strategies from behavioral time-series data and could be applied to a wide range of behavioral studies, including decision-making, in other organisms. Public Library of Science 2018-05-02 /pmc/articles/PMC5951592/ /pubmed/29718905 http://dx.doi.org/10.1371/journal.pcbi.1006122 Text en © 2018 Yamaguchi et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Yamaguchi, Shoichiro Naoki, Honda Ikeda, Muneki Tsukada, Yuki Nakano, Shunji Mori, Ikue Ishii, Shin Identification of animal behavioral strategies by inverse reinforcement learning |
title | Identification of animal behavioral strategies by inverse reinforcement learning |
title_full | Identification of animal behavioral strategies by inverse reinforcement learning |
title_fullStr | Identification of animal behavioral strategies by inverse reinforcement learning |
title_full_unstemmed | Identification of animal behavioral strategies by inverse reinforcement learning |
title_short | Identification of animal behavioral strategies by inverse reinforcement learning |
title_sort | identification of animal behavioral strategies by inverse reinforcement learning |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5951592/ https://www.ncbi.nlm.nih.gov/pubmed/29718905 http://dx.doi.org/10.1371/journal.pcbi.1006122 |
work_keys_str_mv | AT yamaguchishoichiro identificationofanimalbehavioralstrategiesbyinversereinforcementlearning AT naokihonda identificationofanimalbehavioralstrategiesbyinversereinforcementlearning AT ikedamuneki identificationofanimalbehavioralstrategiesbyinversereinforcementlearning AT tsukadayuki identificationofanimalbehavioralstrategiesbyinversereinforcementlearning AT nakanoshunji identificationofanimalbehavioralstrategiesbyinversereinforcementlearning AT moriikue identificationofanimalbehavioralstrategiesbyinversereinforcementlearning AT ishiishin identificationofanimalbehavioralstrategiesbyinversereinforcementlearning |