English
 
User Manual Privacy Policy Disclaimer Contact us
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Conference Paper

Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic

MPS-Authors
/persons/resource/persons217894

Gu,  Shixiang
Dept. Empirical Inference, Max Planck Institute for Intelligent Systems, Max Planck Society;

Locator

Link
(Any fulltext)

Fulltext (public)
There are no public fulltexts available
Supplementary Material (public)
There is no public supplementary material available
Citation

Gu, S., Lillicrap, T., Ghahramani, Z., Turner, R. E., & Levine, S. (2017). Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic. In Proceedings International Conference on Learning Representations 2017. Amherst, MA: OpenReviews.net. Retrieved from https://openreview.net/pdf?id=SJ3rcZcxl.


Cite as: http://hdl.handle.net/21.11116/0000-0001-1ED0-3
Abstract
There is no abstract available