Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Xiaoqian Wu's picture

Xiaoqian Wu

PandaQQ
·
  • enlighten0707

AI & ML interests

None yet

Organizations

Shanghai Jiao Tong University's profile picture

Collections 6

reasoning
  • Grounded Reinforcement Learning for Visual Reasoning

    Paper • 2505.23678 • Published May 29 • 2
  • Sherlock: Self-Correcting Reasoning in Vision-Language Models

    Paper • 2505.22651 • Published May 28 • 48
  • V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning

    Paper • 2506.09985 • Published Jun 11 • 29
RL
  • TTRL: Test-Time Reinforcement Learning

    Paper • 2504.16084 • Published Apr 22 • 120
  • Learning to Reason under Off-Policy Guidance

    Paper • 2504.14945 • Published Apr 21 • 88
  • RM-R1: Reward Modeling as Reasoning

    Paper • 2505.02387 • Published May 5 • 79
reasoning
  • Grounded Reinforcement Learning for Visual Reasoning

    Paper • 2505.23678 • Published May 29 • 2
  • Sherlock: Self-Correcting Reasoning in Vision-Language Models

    Paper • 2505.22651 • Published May 28 • 48
  • V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning

    Paper • 2506.09985 • Published Jun 11 • 29
RL
  • TTRL: Test-Time Reinforcement Learning

    Paper • 2504.16084 • Published Apr 22 • 120
  • Learning to Reason under Off-Policy Guidance

    Paper • 2504.14945 • Published Apr 21 • 88
  • RM-R1: Reward Modeling as Reasoning

    Paper • 2505.02387 • Published May 5 • 79
View 6 collections

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs