smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Instruct-2507-cs16-summary_mean-bst1024-lr-3e6 Updated about 8 hours ago
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Instruct-2507-cs16-summary_mean-bst1024-lr-1e5 Updated 1 day ago
smcleish/Recurrent-TinyLlama-3T-train-recurrence-4-single-phase Text Generation • 0.8B • Updated Nov 11, 2025 • 5
smcleish/Recurrent-TinyLlama-3T-train-recurrence-4-two-phase Text Generation • 0.8B • Updated Nov 11, 2025 • 4
smcleish/Recurrent-OLMo-2-0425-train-recurrence-4 Text Generation • 1B • Updated Nov 11, 2025 • 9 • 1
smcleish/Recurrent-OLMo-2-0425-train-recurrence-32 Text Generation • 1B • Updated Nov 11, 2025 • 17 • 2
smcleish/Recurrent-TinyLlama-3T-train-recurrence-32 Text Generation • 0.8B • Updated Nov 11, 2025 • 19 • 1
smcleish/Recurrent-TinyLlama-3T-train-recurrence-8 Text Generation • 0.8B • Updated Nov 11, 2025 • 10
smcleish/Recurrent-TinyLlama-3T-train-recurrence-16 Text Generation • 0.8B • Updated Nov 11, 2025 • 8 • 1