8 9

Xu Yutian

AmeliaSanchezed

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

liked a model 4 days ago

tencent/HY-Embodied-0.5

liked a dataset 6 days ago

lavita/medical-qa-shared-task-v1-toy

View all activity

Organizations

None yet

upvoted a paper 3 days ago

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

Paper • 2604.11626 • Published 6 days ago • 99

liked a model 4 days ago

tencent/HY-Embodied-0.5

Image-Text-to-Text • 4B • Updated 4 days ago • 1.45k • 861

liked 2 datasets 6 days ago

lavita/medical-qa-shared-task-v1-toy

Viewer • Updated Jul 20, 2023 • 64 • 524k • 29

yangwang825/91e3d56b0a-part013

Updated 17 minutes ago • 3.75k • 2

upvoted a paper 7 days ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published 11 days ago • 316

upvoted a paper 8 days ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published 16 days ago • 361

upvoted a paper 10 days ago

Friends and Grandmothers in Silico: Localizing Entity Cells in Language Models

Paper • 2604.01404 • Published 18 days ago • 5

liked a dataset 10 days ago

gsdf/EasyNegative

Viewer • Updated Feb 12, 2023 • 3 • 26.8k • 1.17k

liked a dataset 14 days ago

meryyllebr543/mix

Preview • Updated 9 days ago • 39.6k • 3

liked a dataset 15 days ago

OpenSQZ/AutoMathText-V2

Viewer • Updated 17 days ago • 15.2B • 98.2k • 76

liked a model 17 days ago

BAAI/bge-m3

liked a model 18 days ago

orkungedik/turkish-spell-based

Updated about 1 hour ago • 1.36k • 2

upvoted a paper 18 days ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published 29 days ago • 338

upvoted a paper 27 days ago

AI Can Learn Scientific Taste

Paper • 2603.14473 • Published Mar 15 • 424

upvoted 2 papers about 1 month ago

InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published Mar 17 • 308

HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions

Paper • 2603.15612 • Published Mar 16 • 152

liked a model about 1 month ago

LocoreMind/LocoTrainer-4B

Text Generation • 4B • Updated Mar 14 • 277 • 56

Xu Yutian

AI & ML interests

Recent Activity

Organizations

AmeliaSanchezed's activity