English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Approximate Planning from Better Bounds on Q

Sezener, C., & Dayan, P. (2017). Approximate Planning from Better Bounds on Q. Poster presented at 3rd Multidisciplinary Conference on Reinforcement Learning and Decision Making (RLDM 2017), Ann Arbor, MI, USA.

Item is

Files

show Files

Locators

show
hide
Description:
-
OA-Status:

Creators

show
hide
 Creators:
Sezener, CE, Author
Dayan, P1, Author           
Affiliations:
1External Organizations, ou_persistent22              

Content

show
hide
Free keywords: -
 Abstract: Planning problems are often solved approximately using simulation based methods such as Monte
Carlo Tree Search (MCTS). Indeed, UCT, perhaps the most popular MCTS algorithm, lies at the heart of
many successful applications. However, UCT is fundamentally inefficient as a planning algorithm, since it
is not focused exclusively on the value of the action that is ultimately chosen. Accordingly, even as simple
a modification to UCT as accounting for myopic information values at the root of the search tree can result
in significant performance improvements. Here, we propose a method that extends value of information-
like computations to arbitrarily many nodes of the search tree for simple acyclic MDPs. We demonstrate significant performance improvements over other planning algorithms.

Details

show
hide
Language(s):
 Dates: 2017-06
 Publication Status: Published online
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: -
 Degree: -

Event

show
hide
Title: 3rd Multidisciplinary Conference on Reinforcement Learning and Decision Making (RLDM 2017)
Place of Event: Ann Arbor, MI, USA
Start-/End Date: 2017-06-11 - 2017-06-14

Legal Case

show

Project information

show

Source 1

show
hide
Title: 3rd Multidisciplinary Conference on Reinforcement Learning and Decision Making (RLDM 2017)
Source Genre: Proceedings
 Creator(s):
Affiliations:
Publ. Info: -
Pages: - Volume / Issue: - Sequence Number: T72 Start / End Page: 86 Identifier: -