Yiwei Chen's picture

3

Yiwei Chen

YiweiChen

·

AI & ML interests

None yet

Organizations

upvoted a paper 3 months ago

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Paper • 2509.22576 • Published Sep 26, 2025 • 134

upvoted a paper 9 months ago

FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal Reasoning

Paper • 2504.00487 • Published Apr 1, 2025 • 18

upvoted a collection about 1 year ago

SimNPO-Unlearned Models

This collection hosts the SimNPO-unlearned models over TOFU, MUSE, and WMDP unlearning benchmarks. • 7 items • Updated Aug 8, 2025 • 2