leran1995 commited on
Commit
4367424
·
verified ·
1 Parent(s): 3aa07cc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -1
README.md CHANGED
@@ -23,11 +23,17 @@ tags:
23
  # Introduction
24
  Nanbeige4-3B-Thinking-2511 is an enhanced iteration over our previous Nanbeige4-3B-Thinking-2510.
25
  Through advanced distillation techniques and reinforcement learning (RL) optimization, we have effectively scaled the model’s reasoning capacity, resulting in superior performance across a broad range of benchmarks.
26
- Notably, Nanbeige4-3B-Thinking-2511 achieves state-of-the-art (SOTA) results among models smaller than 32B parameters on Arena-Hard-V2 and BFCL-V4.
 
27
  This marks a major milestone in delivering powerful, efficient reasoning performance at a compact scale.
28
 
29
  <div align="center">
30
 
 
 
 
 
 
31
  <img src="figures/performance_2511.png">
32
  </div>
33
 
 
23
  # Introduction
24
  Nanbeige4-3B-Thinking-2511 is an enhanced iteration over our previous Nanbeige4-3B-Thinking-2510.
25
  Through advanced distillation techniques and reinforcement learning (RL) optimization, we have effectively scaled the model’s reasoning capacity, resulting in superior performance across a broad range of benchmarks.
26
+ On math and science reasoning benchmarks, Nanbeige4-3B-Thinking-2511 outperforms Qwen3-4B-Thinking-2507, Qwen3-8B-Thinking-2504, and Qwen3-14B-Thinking-2504 with a significant margin.
27
+ Besides, Nanbeige4-3B-Thinking-2511 achieves state-of-the-art (SOTA) results among models smaller than 32B parameters on Arena-Hard-V2 and BFCL-V4.
28
  This marks a major milestone in delivering powerful, efficient reasoning performance at a compact scale.
29
 
30
  <div align="center">
31
 
32
+ <img src="figures/performance_reasoning.png">
33
+ </div>
34
+
35
+ <div align="center">
36
+
37
  <img src="figures/performance_2511.png">
38
  </div>
39