Approximate Planning from Better Bounds on Q

Sezener, CE; Dayan, P

Local TagsRelease HistoryDetailsSummary

Approximate Planning from Better Bounds on Q

Sezener, C., & Dayan, P. (2017). Approximate Planning from Better Bounds on Q. Poster presented at 3rd Multidisciplinary Conference on Reinforcement Learning and Decision Making (RLDM 2017), Ann Arbor, MI, USA.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/21.11116/0000-0004-DAE0-9 Version Permalink: https://hdl.handle.net/21.11116/0000-0004-DAE1-8

Genre: Poster

Files

show Files

Locators

show

hide

Locator:
http://rldm.org/wp-content/uploads/2017/06/RLDM17AbstractsBooklet.pdf (Abstract) Open Access status unknown

Description:
-

OA-Status:

Creators

show

hide

Creators:
Sezener, CE, Author
Dayan, P¹, Author

Affiliations:
1External Organizations, ou_persistent22

Content

show

hide

Free keywords: -

Abstract: Planning problems are often solved approximately using simulation based methods such as Monte
Carlo Tree Search (MCTS). Indeed, UCT, perhaps the most popular MCTS algorithm, lies at the heart of
many successful applications. However, UCT is fundamentally inefficient as a planning algorithm, since it
is not focused exclusively on the value of the action that is ultimately chosen. Accordingly, even as simple
a modification to UCT as accounting for myopic information values at the root of the search tree can result
in significant performance improvements. Here, we propose a method that extends value of information-
like computations to arbitrarily many nodes of the search tree for simple acyclic MDPs. We demonstrate significant performance improvements over other planning algorithms.

Details

show

hide

Language(s):

Dates: Published Online: 2017-06

Publication Status: Published online

Pages: -

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: -

Degree: -

Event

show

hide

Title: 3rd Multidisciplinary Conference on Reinforcement Learning and Decision Making (RLDM 2017)

Place of Event: Ann Arbor, MI, USA

Start-/End Date: 2017-06-11 - 2017-06-14

Legal Case

show

Project information

show

Source 1

show

hide

Title: 3rd Multidisciplinary Conference on Reinforcement Learning and Decision Making (RLDM 2017)

Source Genre: Proceedings

Creator(s):

Affiliations:

Publ. Info: -

Pages: - Volume / Issue: - Sequence Number: T72 Start / End Page: 86 Identifier: -