-
deepseek-ai/DeepSeek-V3.2-Exp
Text Generation • 685B • Updated • 71.9k • • 915 -
deepseek-ai/DeepSeek-V3.2-Exp-Base
Text Generation • 685B • Updated • 295 • 53 -
deepseek-ai/DeepSeek-V3.2
Text Generation • 685B • Updated • 50.5k • • 901 -
deepseek-ai/DeepSeek-V3.2-Speciale
Text Generation • 685B • Updated • 10.2k • 582
DeepSeek
company
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning
-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 1.23M • • 12.9k -
deepseek-ai/DeepSeek-R1-Zero
Text Generation • 685B • Updated • 5.19k • 937 -
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation • 71B • Updated • 351k • • 734 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
Text Generation • 33B • Updated • 2.52M • • 1.47k
DeepSeek Math series
-
deepseek-ai/DeepSeek-Math-V2
Text Generation • 685B • Updated • 10.5k • 652 -
deepseek-ai/deepseek-math-7b-instruct
Text Generation • Updated • 9.54k • 146 -
deepseek-ai/deepseek-math-7b-rl
Text Generation • 7B • Updated • 3.93k • 89 -
deepseek-ai/deepseek-math-7b-base
Text Generation • Updated • 3.3k • 80
Janus is a novel autoregressive framework that unifies multimodal understanding and generation.
models for paper expert-specialized fine-tuning
DeepSeek Coder series
-
deepseek-ai/deepseek-coder-33b-instruct
Text Generation • 33B • Updated • 37.9k • 554 -
deepseek-ai/deepseek-coder-6.7b-instruct
Text Generation • 7B • Updated • 46.3k • 459 -
deepseek-ai/deepseek-coder-7b-instruct-v1.5
Text Generation • 7B • Updated • 9.57k • 143 -
deepseek-ai/deepseek-coder-1.3b-instruct
Text Generation • 1B • Updated • 629k • 150
-
Chat with DeepSeek-VL2-small
🌍567Generate responses using images and text input
-
deepseek-ai/deepseek-vl2-tiny
Image-Text-to-Text • 3B • Updated • 63.5k • 237 -
deepseek-ai/deepseek-vl2-small
Image-Text-to-Text • 16B • Updated • 7.24k • 169 -
deepseek-ai/deepseek-vl2
Image-Text-to-Text • 27B • Updated • 3.53k • 371
DeepSeek-Prover-Series
-
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • 236B • Updated • 97.4k • 673 -
deepseek-ai/DeepSeek-Coder-V2-Base
Text Generation • 236B • Updated • 90.5k • 80 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Base
Text Generation • 16B • Updated • 7.42k • 97 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
Text Generation • 16B • Updated • 221k • • 509
DeepSeek-VL model series
DeepSeek LLM series
DeepSeek MoE series
-
deepseek-ai/DeepSeek-V3.2-Exp
Text Generation • 685B • Updated • 71.9k • • 915 -
deepseek-ai/DeepSeek-V3.2-Exp-Base
Text Generation • 685B • Updated • 295 • 53 -
deepseek-ai/DeepSeek-V3.2
Text Generation • 685B • Updated • 50.5k • • 901 -
deepseek-ai/DeepSeek-V3.2-Speciale
Text Generation • 685B • Updated • 10.2k • 582
-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 1.23M • • 12.9k -
deepseek-ai/DeepSeek-R1-Zero
Text Generation • 685B • Updated • 5.19k • 937 -
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation • 71B • Updated • 351k • • 734 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
Text Generation • 33B • Updated • 2.52M • • 1.47k
DeepSeek Math series
-
deepseek-ai/DeepSeek-Math-V2
Text Generation • 685B • Updated • 10.5k • 652 -
deepseek-ai/deepseek-math-7b-instruct
Text Generation • Updated • 9.54k • 146 -
deepseek-ai/deepseek-math-7b-rl
Text Generation • 7B • Updated • 3.93k • 89 -
deepseek-ai/deepseek-math-7b-base
Text Generation • Updated • 3.3k • 80
-
Chat with DeepSeek-VL2-small
🌍567Generate responses using images and text input
-
deepseek-ai/deepseek-vl2-tiny
Image-Text-to-Text • 3B • Updated • 63.5k • 237 -
deepseek-ai/deepseek-vl2-small
Image-Text-to-Text • 16B • Updated • 7.24k • 169 -
deepseek-ai/deepseek-vl2
Image-Text-to-Text • 27B • Updated • 3.53k • 371
Janus is a novel autoregressive framework that unifies multimodal understanding and generation.
DeepSeek-Prover-Series
-
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • 236B • Updated • 97.4k • 673 -
deepseek-ai/DeepSeek-Coder-V2-Base
Text Generation • 236B • Updated • 90.5k • 80 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Base
Text Generation • 16B • Updated • 7.42k • 97 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
Text Generation • 16B • Updated • 221k • • 509
models for paper expert-specialized fine-tuning
DeepSeek-VL model series
DeepSeek Coder series
-
deepseek-ai/deepseek-coder-33b-instruct
Text Generation • 33B • Updated • 37.9k • 554 -
deepseek-ai/deepseek-coder-6.7b-instruct
Text Generation • 7B • Updated • 46.3k • 459 -
deepseek-ai/deepseek-coder-7b-instruct-v1.5
Text Generation • 7B • Updated • 9.57k • 143 -
deepseek-ai/deepseek-coder-1.3b-instruct
Text Generation • 1B • Updated • 629k • 150
DeepSeek LLM series
DeepSeek MoE series