Papers
updated
Unified Multimodal Understanding and Generation Models: Advances,
Challenges, and Opportunities
Paper
•
2505.02567
•
Published
•
80
TabSTAR: A Foundation Tabular Model With Semantically Target-Aware
Representations
Paper
•
2505.18125
•
Published
•
112
Distilling LLM Agent into Small Models with Retrieval and Code Tools
Paper
•
2505.17612
•
Published
•
81
One RL to See Them All: Visual Triple Unified Reinforcement Learning
Paper
•
2505.18129
•
Published
•
61
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning
Attention
Paper
•
2506.13585
•
Published
•
273
Scaling Test-time Compute for LLM Agents
Paper
•
2506.12928
•
Published
•
63
Reinforcement Pre-Training
Paper
•
2506.08007
•
Published
•
263
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical
Understanding and Reasoning
Paper
•
2506.07044
•
Published
•
113
ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical
Reasoning
Paper
•
2506.09513
•
Published
•
101
Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models
Paper
•
2506.06395
•
Published
•
133
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper
•
2505.24726
•
Published
•
277