Yaoyao Qian's picture

2 8 6

Yaoyao Qian

FreaxRuby

·

https://h-freax.github.io/

AI & ML interests

None yet

Organizations

upvoted a paper 2 months ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5, 2025 • 81

upvoted a paper 3 months ago

GEM: A Gym for Agentic LLMs

Paper • 2510.01051 • Published Oct 1, 2025 • 89

upvoted a paper 6 months ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Paper • 2506.24119 • Published Jun 30, 2025 • 50

upvoted a paper 7 months ago

WHEN TO ACT, WHEN TO WAIT: Modeling Structural Trajectories for Intent Triggerability in Task-Oriented Dialogue

Paper • 2506.01881 • Published Jun 2, 2025 • 6

upvoted a paper 9 months ago

TextArena

Paper • 2504.11442 • Published Apr 15, 2025 • 30

upvoted an article 11 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

270

upvoted 2 papers over 1 year ago

ThinkGrasp: A Vision-Language System for Strategic Part Grasping in Clutter

Paper • 2407.11298 • Published Jul 16, 2024 • 6

Imagination Policy: Using Generative Point Cloud Models for Learning Manipulation Policies

Paper • 2406.11740 • Published Jun 17, 2024 • 1