2 5 1

Zhou

xinyu04

AI & ML interests

None yet

Recent Activity

submitted a paper 7 days ago

Efficient RLVR Training via Weighted Mutual Information Data Selection

upvoted a paper 7 days ago

Efficient RLVR Training via Weighted Mutual Information Data Selection

authored a paper 7 days ago

LoGAH: Predicting 774-Million-Parameter Transformers using Graph HyperNetworks with 1/100 Parameters

View all activity

Organizations

submitted a paper to Daily Papers 7 days ago

Efficient RLVR Training via Weighted Mutual Information Data Selection

Paper • 2603.01907 • Published 8 days ago • 14

upvoted a paper 7 days ago

Efficient RLVR Training via Weighted Mutual Information Data Selection

Paper • 2603.01907 • Published 8 days ago • 14

authored 3 papers 7 days ago

LoGAH: Predicting 774-Million-Parameter Transformers using Graph HyperNetworks with 1/100 Parameters

Paper • 2405.16287 • Published May 25, 2024 • 11

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

Paper • 2602.17684 • Published Feb 4 • 22

Efficient RLVR Training via Weighted Mutual Information Data Selection

Paper • 2603.01907 • Published 8 days ago • 14

upvoted a paper 15 days ago

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

Paper • 2602.17684 • Published Feb 4 • 22

New activity in LARK-Lab/CodeScalerPair-51K about 1 month ago

Update README.md

#1 opened about 1 month ago by

xinyu04

upvoted a paper 4 months ago

The Station: An Open-World Environment for AI-Driven Discovery

Paper • 2511.06309 • Published Nov 9, 2025 • 37

liked a Space 6 months ago

AI Deadlines

⚡

676

Track upcoming AI conference deadlines

upvoted a paper 7 months ago

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19, 2025 • 118

updated a model about 1 year ago

xinyu04/ppo-LunarLander-v2

Reinforcement Learning • Updated Mar 5, 2025

published a model about 1 year ago

xinyu04/ppo-LunarLander-v2

Reinforcement Learning • Updated Mar 5, 2025

upvoted a paper almost 2 years ago

LoGAH: Predicting 774-Million-Parameter Transformers using Graph HyperNetworks with 1/100 Parameters

Paper • 2405.16287 • Published May 25, 2024 • 11

Zhou

AI & ML interests

Recent Activity

Organizations

xinyu04's activity

Update README.md

AI Deadlines