view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 20 days ago • 79
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 Feb 20 • 490
Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL Paper • 2602.03773 • Published Feb 3 • 12
view article Article Scaling OpenEnv: From Free Usage to Thousands of Concurrent Environments Jan 20 • 12
view article Article Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face Feb 11, 2025 • 109
Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR Paper • 2602.05261 • Published Feb 5 • 52
view article Article Preference Tuning LLMs with Direct Preference Optimization Methods +3 Jan 18, 2024 • 80
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 307