Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Privileged On-Policy Exploration

Team
classroom
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

wendyxwz  updated a dataset about 1 month ago
CMU-POPE/trial
CohenQu  authored a paper 12 months ago
Recursive Introspection: Teaching Language Model Agents How to Self-Improve
CohenQu  authored a paper 12 months ago
Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning
View all activity

Yuxiao Qu's profile picture Wen Ye's profile picture

wendyxwz 
updated a dataset about 1 month ago

CMU-POPE/trial

Viewer • Updated Jan 9 • 10 • 23
CohenQu 
authored 3 papers 12 months ago

Recursive Introspection: Teaching Language Model Agents How to Self-Improve

Paper • 2407.18219 • Published Jul 25, 2024 • 3

Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning

Paper • 2310.18247 • Published Oct 27, 2023

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published Mar 10, 2025 • 48
CohenQu 
authored a paper over 1 year ago

Harnessing Webpage UIs for Text-Rich Visual Understanding

Paper • 2410.13824 • Published Oct 17, 2024 • 30
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs