Meta-cognitive planning for learning representations

Shen, T; Dayan, P; Bányai, M

doi:10.32470/CCN.2023.1242-0

アイテム詳細

登録内容を編集ファイル形式で保存

一時保存へ追加

タグ情報を表示リリース履歴を表示詳細要約

公開

会議論文

Meta-cognitive planning for learning representations

MPS-Authors

/persons/resource/persons269427

Shen, T
Department of Computational Neuroscience, Max Planck Institute for Biological Cybernetics, Max Planck Society;

/persons/resource/persons217460

Dayan, P
Department of Computational Neuroscience, Max Planck Institute for Biological Cybernetics, Max Planck Society;

/persons/resource/persons245576

Bányai, M
Department of Computational Neuroscience, Max Planck Institute for Biological Cybernetics, Max Planck Society;

External Resource

https://2023.ccneuro.org/view_paper.php?PaperNum=1242
(要旨)

Fulltext (restricted access)

There are currently no full texts shared for your IP range.

フルテキスト (公開)

公開されているフルテキストはありません

付随資料 (公開)

There is no public supplementary material available

引用

Shen, T., Dayan, P., & Bányai, M. (2023). Meta-cognitive planning for learning representations. In 2023 Conference on Cognitive Computational Neuroscience (pp. 425-428). doi:10.32470/CCN.2023.1242-0.

引用: https://hdl.handle.net/21.11116/0000-000D-4DAA-E

要旨

Learning representations that facilitate generalisation underlies efficient decision making in machines and animals alike. However, the representation at each point during the acquisition of competent behaviour needs to support not only the decisions at hand, but also the continuing improvement of the agent’s policy. Thus, the normative solution to the representation learning problem requires planning. We formalise this as meta-cognitive decision making in a model-based reinforcement learning agent that maintains its belief about the environment in an approximately Bayesian way. We use simple contextual bandits to show that representational planning confers an advantage in environments in which generalisation across contexts is possible and for suitable temporal horizons. Our model allows for detailed analyses of the relationships between the inductive biases of a learning agent, the characteristics of the environment, and the detailed dynamics of representational updates. This work contributes to a greater understanding of heuristic solutions in machine learning and the construction of abstractions by humans.