English
 
User Manual Privacy Policy Disclaimer Contact us
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Free Energy and the Generalized Optimality Equations for Sequential Decision Making

Ortega, P., & Braun, D. (2012). Free Energy and the Generalized Optimality Equations for Sequential Decision Making. In 10th European Workshop on Reinforcement Learning (EWRL 2012) (pp. 1-10).

Item is

Basic

show hide
Item Permalink: http://hdl.handle.net/11858/00-001M-0000-0013-B6C2-2 Version Permalink: http://hdl.handle.net/21.11116/0000-0001-8EA2-8
Genre: Conference Paper

Files

show Files

Locators

show
hide
Description:
-

Creators

show
hide
 Creators:
Ortega, PA1, 2, Author              
Braun, DA1, 2, Author              
Affiliations:
1Research Group Sensorimotor Learning and Decision-Making, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_1497809              
2Max Planck Institute for Biological Cybernetics, Max Planck Society, Spemannstrasse 38, 72076 Tübingen, DE, ou_1497794              

Content

show
hide
Free keywords: -
 Abstract: The free energy functional has recently been proposed as a variational principle for bounded rational decision-making, since it instantiates a natural trade-off between utility gains and information processing costs that can be axiomatically derived. Here we apply the free energy principle to general decision trees that include both adversarial and stochastic environments. We derive generalized sequential optimality equations that not only include the Bellman optimality equations as a limit case, but also lead to well-known decision-rules such as Expectimax, Minimax and Expectiminimax. We show how these decision-rules can be derived from a single free energy principle that assigns a resource parameter to each node in the decision tree. These resource parameters express a concrete computational cost that can be measured as the amount of samples that are needed from the distribution that belongs to each node. The free energy principle therefore provides the normative basis for generalized optimality equations that account for both adversarial and stochastic environments.

Details

show
hide
Language(s):
 Dates: 2012-07
 Publication Status: Published in print
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: BibTex Citekey: OrtegaB2012_2
 Degree: -

Event

show
hide
Title: 10th European Workshop on Reinforcement Learning (EWRL 2012)
Place of Event: Edinburgh, Scotland
Start-/End Date: -

Legal Case

show

Project information

show

Source 1

show
hide
Title: 10th European Workshop on Reinforcement Learning (EWRL 2012)
Source Genre: Proceedings
 Creator(s):
Affiliations:
Publ. Info: -
Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 1 - 10 Identifier: -