Visual-ERM: Reward Modeling for Visual Equivalence
Paper • 2603.13224 • Published • 21
None defined yet.
TeamHOI: Learning a Unified Policy for Cooperative Human-Object Interactions with Any Team Size
Rethinking the Trust Region in LLM Reinforcement Learning