Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models Paper • 2508.10751 • Published Aug 14, 2025 • 28
YuLan-Mini Resources Collection Pre-Training & post-training resources for YuLan-Mini • 29 items • Updated May 7, 2025 • 3
WebThinker: Empowering Large Reasoning Models with Deep Research Capability Paper • 2504.21776 • Published Apr 30, 2025 • 59
YuLan-Mini Resources Collection Pre-Training & post-training resources for YuLan-Mini • 29 items • Updated May 7, 2025 • 3