arxiv:2404.03214
Walid Bousselham
WalidBouss
AI & ML interests
Computer Vision, Multi-modal learning and Zero-shot adaptation.
Recent Activity
upvoted
an
article
9 days ago
Deriving the PPO Loss from First Principles
upvoted
a
paper
about 1 month ago
PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing
upvoted
a
paper
about 1 month ago
On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models