The Robot Routing Problem for Collecting Aggregate Stochastic Rewards

Dimitrova, Rayna; Gavran, Ivan; Majumdar, Rupak; Prabhu, Vinayak; Soudjani, Sadegh

Local TagsRelease HistoryDetailsSummary

The Robot Routing Problem for Collecting Aggregate Stochastic Rewards

Dimitrova, R., Gavran, I., Majumdar, R., Prabhu, V., & Soudjani, S. (2017). The Robot Routing Problem for Collecting Aggregate Stochastic Rewards. Retrieved from http://arxiv.org/abs/1704.05303.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/21.11116/0000-0000-ED2C-5 Version Permalink: https://hdl.handle.net/21.11116/0000-0000-ED2D-4

Genre: Paper

Files

show Files

hide Files

:

arXiv:1704.05303.pdf (Preprint), 741KB

View Save

File Permalink:
https://hdl.handle.net/21.11116/0000-0000-ED2E-3

Name:
arXiv:1704.05303.pdf

Description:
File downloaded from arXiv at 2018-03-23 09:03 Full version of the CONCUR (28th International Conference on Concurrency Theory) 2017 paper

OA-Status:

Visibility:
Public

MIME-Type / Checksum:
application/pdf / [MD5]

Technical Metadata:

View

Copyright Date:
-

Copyright Info:
-

License:
http://arxiv.org/help/license

Locators

show

Creators

show

hide

Creators:
Dimitrova, Rayna¹, Author
Gavran, Ivan¹, Author
Majumdar, Rupak¹, Author
Prabhu, Vinayak¹, Author
Soudjani, Sadegh¹, Author

Affiliations:
1Group R. Majumdar, Max Planck Institute for Software Systems, Max Planck Society, ou_2105292

Content

show

hide

Free keywords: cs.SY,Computer Science, Computational Complexity, cs.CC,Computer Science, Data Structures and Algorithms, cs.DS,Computer Science, Computer Science and Game Theory, cs.GT,Mathematics, Optimization and Control, math.OC

Abstract: We propose a new model for formalizing reward collection problems on graphs with dynamically generated rewards which may appear and disappear based on a stochastic model. The *robot routing problem* is modeled as a graph whose nodes are stochastic processes generating potential rewards over discrete time. The rewards are generated according to the stochastic process, but at each step, an existing reward disappears with a given probability. The edges in the graph encode the (unit-distance) paths between the rewards' locations. On visiting a node, the robot collects the accumulated reward at the node at that time, but traveling between the nodes takes time. The optimization question asks to compute an optimal (or epsilon-optimal) path that maximizes the expected collected rewards. We consider the finite and infinite-horizon robot routing problems. For finite-horizon, the goal is to maximize the total expected reward, while for infinite horizon we consider limit-average objectives. We study the computational and strategy complexity of these problems, establish NP-lower bounds and show that optimal strategies require memory in general. We also provide an algorithm for computing epsilon-optimal infinite paths for arbitrary epsilon > 0.

Details

show

hide

Language(s): eng - English

Dates: Created: 2017-04-18Modified: 2017-07-17Published Online: 2017

Publication Status: Published online

Pages: 20 p.

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: arXiv: 1704.05303
URI: http://arxiv.org/abs/1704.05303
BibTex Citekey: Dimitrova_arXiv1704.05303

Degree: -

Event

show

Legal Case

show

Project information

show

Source

show