RMNP: Row-Momentum Normalized Preconditioning
ICML 2026 · Why Muon's orthogonalization helps neural networks — and how it provably collapses to a single row-wise normalization.
ICML 2026 · Why Muon's orthogonalization helps neural networks — and how it provably collapses to a single row-wise normalization.