ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models Paper • 2505.24864 • Published May 30, 2025 • 143
GlobalRAG: Enhancing Global Reasoning in Multi-hop Question Answering via Reinforcement Learning Paper • 2510.20548 • Published Oct 23, 2025 • 3
Reasoning Efficiency Research Collection Ultra-efficient reasoning model! SOTA Accuracy / CoT Length trade-offs • 3 items • Updated 16 days ago • 11
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs Paper • 2510.11696 • Published Oct 13, 2025 • 177
ReasonRank: Empowering Passage Ranking with Strong Reasoning Ability Paper • 2508.07050 • Published Aug 9, 2025 • 117