Your LLM Knows the Future: Uncovering Its Multi-Token Prediction Potential Paper • 2507.11851 • Published Jul 16, 2025 • 1
Bone: Block Affine Transformation as Parameter Efficient Fine-tuning Methods for Large Language Models Paper • 2409.15371 • Published Sep 19, 2024 • 3
view article Article Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines +2 Mar 5 • 50
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 Feb 20 • 501