日本語
 
Help Privacy Policy ポリシー/免責事項
  詳細検索ブラウズ

アイテム詳細


公開

会議論文

Meta-cognitive planning for learning representations

MPS-Authors
/persons/resource/persons269427

Shen,  T
Department of Computational Neuroscience, Max Planck Institute for Biological Cybernetics, Max Planck Society;

/persons/resource/persons217460

Dayan,  P       
Department of Computational Neuroscience, Max Planck Institute for Biological Cybernetics, Max Planck Society;

/persons/resource/persons245576

Bányai,  M       
Department of Computational Neuroscience, Max Planck Institute for Biological Cybernetics, Max Planck Society;

Fulltext (restricted access)
There are currently no full texts shared for your IP range.
フルテキスト (公開)
公開されているフルテキストはありません
付随資料 (公開)
There is no public supplementary material available
引用

Shen, T., Dayan, P., & Bányai, M. (2023). Meta-cognitive planning for learning representations. In 2023 Conference on Cognitive Computational Neuroscience (pp. 425-428). doi:10.32470/CCN.2023.1242-0.


引用: https://hdl.handle.net/21.11116/0000-000D-4DAA-E
要旨
Learning representations that facilitate generalisation underlies efficient decision making in machines and animals alike. However, the representation at each point during the acquisition of competent behaviour needs to support not only the decisions at hand, but also the continuing improvement of the agent’s policy. Thus, the normative solution to the representation learning problem requires planning. We formalise this as meta-cognitive decision making in a model-based reinforcement learning agent that maintains its belief about the environment in an approximately Bayesian way. We use simple contextual bandits to show that representational planning confers an advantage in environments in which generalisation across contexts is possible and for suitable temporal horizons. Our model allows for detailed analyses of the relationships between the inductive biases of a learning agent, the characteristics of the environment, and the detailed dynamics of representational updates. This work contributes to a greater understanding of heuristic solutions in machine learning and the construction of abstractions by humans.