Cargando…
Optimistic Value Iteration
Markov decision processes are widely used for planning and verification in settings that combine controllable or adversarial choices with probabilistic behaviour. The standard analysis algorithm, value iteration, only provides lower bounds on infinite-horizon probabilities and rewards. Two “sound” v...
Autores principales: | Hartmanns, Arnd, Kaminski, Benjamin Lucien |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7363440/ http://dx.doi.org/10.1007/978-3-030-53291-8_26 |
Ejemplares similares
Ejemplares similares
-
Optimistic biases in observational learning of value
por: Nicolle, A., et al.
Publicado: (2011) -
Portrait of an optimist
por: Sette, Alessandro
Publicado: (2017) -
Once an optimist, always an optimist? Studying cognitive judgment bias in mice
por: Bračić, Marko, et al.
Publicado: (2022) -
Replicating [Formula: see text] with Prolonged Retrials: An Experimental Report
por: Budde, Carlos E., et al.
Publicado: (2021) -
Laughing Rats Are Optimistic
por: Rygula, Rafal, et al.
Publicado: (2012)