MTP-LM Collection Models to accompany "Multi-Token Prediction via Self-Distillation" (arxiv:2602.06019) • 4 items • Updated 25 days ago • 3
MTP-LM Collection Models to accompany "Multi-Token Prediction via Self-Distillation" (arxiv:2602.06019) • 4 items • Updated 25 days ago • 3
jwkirchenbauer/debug_metamath_full_rand_k2-8_ex_valk_baseline_latest Text Generation • 8B • Updated Feb 9 • 3