SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Paper • 2603.16859 • Published 5 days ago • 241
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization Paper • 2512.24615 • Published Dec 31, 2025 • 119
Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting Paper • 2505.14059 • Published May 20, 2025 • 3
Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning Paper • 2505.14677 • Published May 20, 2025 • 15