Announcement_16
Four papers SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning, Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play, GEM: A Gym for Generalist LLMs and Scaling Agent Learning via Experience Synthesis accepted at ICLR 2026.