Improving Generalization for Temporal Difference Learning: The Successor 
Representation

Dayan, P

doi:10.1162/neco.1993.5.4.613

Datensatz

DATENSATZ AKTIONENEXPORT

Zur Ablage hinzufügen

Lokale TagsFreigabegeschichteDetailsÜbersicht

Freigegeben

Zeitschriftenartikel

Improving Generalization for Temporal Difference Learning: The Successor Representation

MPG-Autoren

Es sind keine MPG-Autoren in der Publikation vorhanden

Externe Ressourcen

https://www.mitpressjournals.org/doi/pdf/10.1162/neco.1993.5.4.613
(Verlagsversion)

Volltexte (beschränkter Zugriff)

Für Ihren IP-Bereich sind aktuell keine Volltexte freigegeben.

Volltexte (frei zugänglich)

Es sind keine frei zugänglichen Volltexte in PuRe verfügbar

Ergänzendes Material (frei zugänglich)

Es sind keine frei zugänglichen Ergänzenden Materialien verfügbar

Zitation

Dayan, P. (1993). Improving Generalization for Temporal Difference Learning: The Successor Representation. Neural computation, 5(4), 613-624. doi:10.1162/neco.1993.5.4.613.

Zitierlink: https://hdl.handle.net/21.11116/0000-0002-D707-4

Zusammenfassung

Estimation of returns over time, the focus of temporal difference (TD) algorithms, imposes particular constraints on good function approximators or representations. Appropriate generalization between states is determined by how similar their successors are, and representations should follow suit. This paper shows how TD machinery can be used to learn such representations, and illustrates, using a navigation task, the appropriately distributed nature of the result.