Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe Paper • 2603.21972 • Published 1 day ago • 2
Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe Paper • 2603.21972 • Published 1 day ago • 2
Repurposing Synthetic Data for Fine-grained Search Agent Supervision Paper • 2510.24694 • Published Oct 28, 2025 • 25
ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization Paper • 2509.13313 • Published Sep 16, 2025 • 80
WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning Paper • 2509.13305 • Published Sep 16, 2025 • 91
MoLoRAG: Bootstrapping Document Understanding via Multi-modal Logic-aware Retrieval Paper • 2509.07666 • Published Sep 6, 2025
WebSailor: Navigating Super-human Reasoning for Web Agent Paper • 2507.02592 • Published Jul 3, 2025 • 126