Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills Paper • 2603.25158 • Published 18 days ago • 50
MM-CondChain: A Programmatically Verified Benchmark for Visually Grounded Deep Compositional Reasoning Paper • 2603.12266 • Published Mar 12 • 19