Brantner, B., & Kraus, M. (2023). Generalizing Adam to Manifolds For Efficiently Training Transformers. Talk presented at European Conference on Numerical Mathematics and Advanced Applications (ENUMATH 2023). Lisbon. 2023-09-04 - 2023-09-08.