Small-reasoning-models HuggingFaceTB/SmolLM3-3B Text Generation • 3B • Updated Sep 10, 2025 • 61.3k • • 893 Qwen/Qwen3-4B-Base Text Generation • 4B • Updated Jul 26, 2025 • 265k • 79
Fine-tuning Text-to-LoRA: Instant Transformer Adaption Paper • 2506.06105 • Published Jun 6, 2025 • 2 Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights Paper • 2506.16406 • Published Jun 19, 2025 • 130 ChatDOC/OCRFlux-3B Image-to-Text • 4B • Updated Jul 9, 2025 • 189k • 364
AI Labs - FT Datasets Datasets that we want to use for experimenting with fine-tuning Salesforce/ReasoningJudgeBench Viewer • Updated Jun 7, 2025 • 1.48k • 316 • 5 allenai/ai2_arc Viewer • Updated Dec 21, 2023 • 7.79k • 277k • 313 bobox/OpenbookQA-4ST Viewer • Updated Jul 9, 2024 • 9.28k • 35 • 3 Salesforce/wikitext Viewer • Updated Jan 4, 2024 • 3.71M • 835k • 633
Small-reasoning-models HuggingFaceTB/SmolLM3-3B Text Generation • 3B • Updated Sep 10, 2025 • 61.3k • • 893 Qwen/Qwen3-4B-Base Text Generation • 4B • Updated Jul 26, 2025 • 265k • 79
AI Labs - FT Datasets Datasets that we want to use for experimenting with fine-tuning Salesforce/ReasoningJudgeBench Viewer • Updated Jun 7, 2025 • 1.48k • 316 • 5 allenai/ai2_arc Viewer • Updated Dec 21, 2023 • 7.79k • 277k • 313 bobox/OpenbookQA-4ST Viewer • Updated Jul 9, 2024 • 9.28k • 35 • 3 Salesforce/wikitext Viewer • Updated Jan 4, 2024 • 3.71M • 835k • 633
Fine-tuning Text-to-LoRA: Instant Transformer Adaption Paper • 2506.06105 • Published Jun 6, 2025 • 2 Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights Paper • 2506.16406 • Published Jun 19, 2025 • 130 ChatDOC/OCRFlux-3B Image-to-Text • 4B • Updated Jul 9, 2025 • 189k • 364