Privileged On-Policy Exploration

Team

classroom

AI & ML interests

None defined yet.

Recent Activity

wendyxwz updated a dataset about 1 month ago

CMU-POPE/trial

CohenQu authored a paper 12 months ago

Recursive Introspection: Teaching Language Model Agents How to Self-Improve

CohenQu authored a paper 12 months ago

Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning

View all activity

wendyxwz

updated a dataset about 1 month ago

CMU-POPE/trial

Viewer • Updated Jan 9 • 10 • 23

CohenQu

authored 3 papers 12 months ago

Recursive Introspection: Teaching Language Model Agents How to Self-Improve

Paper • 2407.18219 • Published Jul 25, 2024 • 3

Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning

Paper • 2310.18247 • Published Oct 27, 2023

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published Mar 10, 2025 • 48

CohenQu

authored a paper over 1 year ago

Harnessing Webpage UIs for Text-Rich Visual Understanding

Paper • 2410.13824 • Published Oct 17, 2024 • 30