4 13 6

Xinyi Wan

ufotalent

AI & ML interests

ML Sys

Recent Activity

upvoted a paper 16 days ago

In-Context Reinforcement Learning for Tool Use in Large Language Models

upvoted a paper about 2 months ago

Rethinking the Trust Region in LLM Reinforcement Learning

authored a paper about 2 months ago

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

View all activity

Organizations

upvoted a paper 16 days ago

In-Context Reinforcement Learning for Tool Use in Large Language Models

Paper • 2603.08068 • Published 19 days ago • 41

upvoted a paper about 2 months ago

Rethinking the Trust Region in LLM Reinforcement Learning

Paper • 2602.04879 • Published Feb 4 • 37

authored 3 papers about 2 months ago

upvoted a paper about 2 months ago

Revisiting Parameter Server in LLM Post-Training

Paper • 2601.19362 • Published Jan 27 • 8

submitted a paper to Daily Papers about 2 months ago

Revisiting Parameter Server in LLM Post-Training

Paper • 2601.19362 • Published Jan 27 • 8

upvoted 2 papers 6 months ago

Language Models Can Learn from Verbal Feedback Without Scalar Rewards

Paper • 2509.22638 • Published Sep 26, 2025 • 70

Variational Reasoning for Language Models

Paper • 2509.22637 • Published Sep 26, 2025 • 69

upvoted a paper 7 months ago

Understanding Tool-Integrated Reasoning

Paper • 2508.19201 • Published Aug 26, 2025 • 32

liked a dataset 9 months ago

lkevinzc/math-collection

Viewer • Updated Feb 24, 2025 • 7.39k • 4 • 1

upvoted a paper 10 months ago

Optimizing Anytime Reasoning via Budget Relative Policy Optimization

Paper • 2505.13438 • Published May 19, 2025 • 36

authored a paper 12 months ago

PipeOffload: Improving Scalability of Pipeline Parallelism with Memory Optimization

Paper • 2503.01328 • Published Mar 3, 2025 • 16

upvoted a paper about 1 year ago

PipeOffload: Improving Scalability of Pipeline Parallelism with Memory Optimization

Paper • 2503.01328 • Published Mar 3, 2025 • 16

commented a paper about 1 year ago

PipeOffload: Improving Scalability of Pipeline Parallelism with Memory Optimization

Paper • 2503.01328 • Published Mar 3, 2025 • 16 •

liked a Space about 1 year ago

The Ultra-Scale Playbook

🌌

3.75k

The ultimate guide to training LLM on large GPU Clusters

published an article about 1 year ago

Article

双流并行(DualPipe) 没有双流会更好

Feb 28, 2025

•

upvoted an article about 1 year ago

Article

DualPipe could be better without the Dual

Feb 28, 2025

•

published an article about 1 year ago

Article

DualPipe could be better without the Dual

Feb 28, 2025

•

upvoted a collection over 1 year ago

🔱 Sailor2 Language Models

Collection

Sailing in South-East Asia with Inclusive Multilingual LLMs • 32 items • Updated 26 days ago • 30

Xinyi Wan

AI & ML interests

Recent Activity

Organizations

ufotalent's activity

The Ultra-Scale Playbook

双流并行(DualPipe) 没有双流会更好

DualPipe could be better without the Dual

DualPipe could be better without the Dual