Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Xiaoqian Wu's picture

Xiaoqian Wu

PandaQQ

·

enlighten0707

AI & ML interests

None yet

Organizations

Collections 6

Grounded Reinforcement Learning for Visual Reasoning

Paper • 2505.23678 • Published May 29 • 2
Sherlock: Self-Correcting Reasoning in Vision-Language Models

Paper • 2505.22651 • Published May 28 • 48
V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning

Paper • 2506.09985 • Published Jun 11 • 29

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 120
Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21 • 88
RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published May 5 • 79

Grounded Reinforcement Learning for Visual Reasoning

Paper • 2505.23678 • Published May 29 • 2
Sherlock: Self-Correcting Reasoning in Vision-Language Models

Paper • 2505.22651 • Published May 28 • 48
V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning

Paper • 2506.09985 • Published Jun 11 • 29

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 120
Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21 • 88
RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published May 5 • 79

View 6 collections

models 0

None public yet

datasets 0

None public yet

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs