Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yaoyao Qian's picture
2 8 6

Yaoyao Qian

FreaxRuby
andyoung's profile picture
·
https://h-freax.github.io/
  • RubyFreax
  • h-freax
  • rubyfreax

AI & ML interests

None yet

Organizations

Northeastern University 's profile picture

upvoted a paper 2 months ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5, 2025 • 81
upvoted a paper 3 months ago

GEM: A Gym for Agentic LLMs

Paper • 2510.01051 • Published Oct 1, 2025 • 89
upvoted a paper 6 months ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Paper • 2506.24119 • Published Jun 30, 2025 • 50
upvoted a paper 7 months ago

WHEN TO ACT, WHEN TO WAIT: Modeling Structural Trajectories for Intent Triggerability in Task-Oriented Dialogue

Paper • 2506.01881 • Published Jun 2, 2025 • 6
upvoted a paper 9 months ago

TextArena

Paper • 2504.11442 • Published Apr 15, 2025 • 30
upvoted an article 11 months ago
view article
Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025
•
270
upvoted 2 papers over 1 year ago

ThinkGrasp: A Vision-Language System for Strategic Part Grasping in Clutter

Paper • 2407.11298 • Published Jul 16, 2024 • 6

Imagination Policy: Using Generative Point Cloud Models for Learning Manipulation Policies

Paper • 2406.11740 • Published Jun 17, 2024 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs