MJPansa/MiniMax-M2.7-REAP-172B-A10B-AutoRound-W4A16 Text Generation β’ 24B β’ Updated 7 days ago β’ 1.85k β’ 6
MJPansa/MiniMax-M2.7-REAP-172B-A10B-AutoRound-W4A16 Text Generation β’ 24B β’ Updated 7 days ago β’ 1.85k β’ 6
MJPansa/MiniMax-M2.7-REAP-172B-A10B-AutoRound-W4A16 Text Generation β’ 24B β’ Updated 7 days ago β’ 1.85k β’ 6
Running 3.8k The Ultra-Scale Playbook π 3.8k The ultimate guide to training LLM on large GPU Clusters
openGPT-X/Teuken-7B-instruct-commercial-v0.4 Text Generation β’ 7B β’ Updated Dec 11, 2024 β’ 1.27k β’ 74
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper β’ 2412.13663 β’ Published Dec 18, 2024 β’ 163