jiangyuhao's picture

4 7

jiangyuhao

JYuhao88

·

JYuhao88

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 3 months ago

DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation

Paper • 2511.06307 • Published Nov 9, 2025 • 52

upvoted 2 papers 4 months ago

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

Paper • 2510.03222 • Published Oct 3, 2025 • 75

Reinforcement Learning on Pre-Training Data

Paper • 2509.19249 • Published Sep 23, 2025 • 67

upvoted an article about 1 year ago

Article

ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models

Jul 27, 2024

•

34