Singh, Sidak Pal Dept. Empirical Inference, Max Planck Institute for Intelligent Systems, Max Planck Society; External Organizations;
https://proceedings.neurips.cc/paper_files/paper/2022/hash/ae0cba715b60c4052359b3d52a2cff7f-Abstract-Conference.html (Publisher version)
https://openreview.net/forum?id=FxVH7iToXS (Any fulltext)
https://doi.org/10.48550/arXiv.2206.03126 (Preprint)
Noci, L., Anagnostidis, S., Biggio, L., Orvieto, A., Singh, S. P., & Lucchi, A. (2022). Signal Propagation in Transformers: Theoretical Perspectives and the Role of Rank Collapse. In S. Koyejo, S. Mohamed, A. Agarwal, D. Belgrave, K. Cho, & A. Oh (Eds.), Advances in Neural Information Processing Systems 35 (pp. 27198-27211). Red Hook, NY: Curran Associates, Inc. Retrieved from https://proceedings.neurips.cc/paper_files/paper/2022/hash/ae0cba715b60c4052359b3d52a2cff7f-Abstract-Conference.html.