日本語
 
Help Privacy Policy ポリシー/免責事項
  詳細検索ブラウズ

アイテム詳細


公開

学術論文

Realizing a deep reinforcement learning agent discovering real-time feedback control strategies for a quantum system

MPS-Authors

Landgraf,  Jonas
Marquardt Division, Max Planck Institute for the Science of Light, Max Planck Society;
Physics Department, University of Erlangen-Nuremberg, Staudtstraße 5, 91058 Erlangen, Germany;

Fösel,  Thomas
Marquardt Division, Max Planck Institute for the Science of Light, Max Planck Society;
Physics Department, University of Erlangen-Nuremberg, Staudtstraße 5, 91058 Erlangen, Germany;

/persons/resource/persons201125

Marquardt,  Florian
Marquardt Division, Max Planck Institute for the Science of Light, Max Planck Society;
Physics Department, University of Erlangen-Nuremberg, Staudtstraße 5, 91058 Erlangen, Germany;

External Resource
There are no locators available
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
フルテキスト (公開)

s41467-023-42901-3.pdf
(全文テキスト(全般)), 2MB

付随資料 (公開)
引用

Reuer, K., Landgraf, J., Fösel, T., O'Sullivan, J., Beltrán, L., Akin, A., Norris, G. J., Remm, A., Kerschbaum, M., Besse, J.-C., Marquardt, F., Wallraff, A., & Eichler, C. (2023). Realizing a deep reinforcement learning agent discovering real-time feedback control strategies for a quantum system. Nature Communications, 14:. doi:10.1038/s41467-023-42901-3.


引用: https://hdl.handle.net/21.11116/0000-000B-5DBE-8
要旨
To realize the full potential of quantum technologies, finding good strategies to control quantum information processing devices in real time becomes increasingly important. Usually these strategies require a precise understanding of the device itself, which is generally not available. Model-free reinforcement learning circumvents this need by discovering control strategies from scratch without relying on an accurate description of the quantum system. Furthermore, important tasks like state


preparation, gate teleportation and error correction need feedback at time scales much shorter than the coherence time, which for superconducting circuits is in the microsecond range. Developing and training a deep reinforcement learning agent able to operate in this real-time feedback regime has been an open challenge. Here, we have implemented such an agent in the form of a latency-optimized deep neural network on a field-programmable gate array (FPGA). We demonstrate its use to efficiently initialize a superconducting qubit into a target state. To train the agent, we use


model-free reinforcement learning that is based solely on measurement data. We study the agent’s performance for strong and weak measurements, and for three-level readout, and compare with simple strategies based on thresholding. This demonstration motivates further research towards adoption of


reinforcement learning for real-time feedback control of quantum devices and more generally any physical system requiring learnable low-latency feedback control.