Shenzhi Wang
shenzhi-wang
AI & ML interests
Large Language Model, Reinforcement Learning, and AI Agents
Recent Activity
authored a paper 1 day ago
Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models authored a paper 1 day ago
HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning submitted a paper 1 day ago
HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning