SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving Paper • 2601.01426 • Published 4 days ago • 19
SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving Paper • 2601.01426 • Published 4 days ago • 19
SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving Paper • 2601.01426 • Published 4 days ago • 19
SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving Paper • 2601.01426 • Published 4 days ago • 19
SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving Paper • 2601.01426 • Published 4 days ago • 19
Memory-T1: Reinforcement Learning for Temporal Reasoning in Multi-session Agents Paper • 2512.20092 • Published 16 days ago • 8
Bridging the Long-Term Gap: A Memory-Active Policy for Multi-Session Task-Oriented Dialogue Paper • 2505.20231 • Published May 26, 2025
ReSURE: Regularizing Supervision Unreliability for Multi-turn Dialogue Fine-tuning Paper • 2508.19996 • Published Aug 27, 2025
Memory-T1: Reinforcement Learning for Temporal Reasoning in Multi-session Agents Paper • 2512.20092 • Published 16 days ago • 8
LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction Paper • 2509.07403 • Published Sep 9, 2025 • 34
Electrocardiogram Instruction Tuning for Report Generation Paper • 2403.04945 • Published Mar 7, 2024 • 2
Rethinking Kullback-Leibler Divergence in Knowledge Distillation for Large Language Models Paper • 2404.02657 • Published Apr 3, 2024 • 2
UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers Paper • 2301.13741 • Published Jan 31, 2023 • 1
D2O: Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models Paper • 2406.13035 • Published Jun 18, 2024 • 3
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies Paper • 2407.13623 • Published Jul 18, 2024 • 56
PhyX: Does Your Model Have the "Wits" for Physical Reasoning? Paper • 2505.15929 • Published May 21, 2025 • 49
The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs Paper • 2507.07562 • Published Jul 10, 2025 • 1
MMSearch-Plus: A Simple Yet Challenging Benchmark for Multimodal Browsing Agents Paper • 2508.21475 • Published Aug 29, 2025 • 2
CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers Paper • 2305.17455 • Published May 27, 2023