You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Model Card for gemma-3-tw-270m-thinking

gemma-3-tw-270m-thinkinggemma-3-tw-270m-itthinking(推理)版本:在指令微調版的基礎上加入帶 <think>...</think> 思考段落的訓練資料,使模型能在回答前先輸出思考過程,提升其在多步推理、條件判斷等任務上的穩定度,同時維持 270M 級的小尺寸。

⚠️ 規格重點: 本模型為 270M 參數 SLM、純文本單模態,回應前段為 <think>...</think> 推理區段,後段為最終答案。

Model Details

小規模語言模型在多步推理任務上常顯吃力。本模型嘗試把「先思考、再回答」的格式直接寫入 SFT 訓練資料,讓 270M 級的小模型也能模仿大型 reasoning model 的回答結構。雖然受限於模型容量、推理深度仍有上限,但對於日常條件判斷、簡單規則推理、結構化輸出已能提供顯著改善。

核心特點 (Key Features)

  1. 270M 級的 reasoning 能力:保留小尺寸優勢,同時透過 <think> 訓練格式取得可解釋的推理步驟。
  2. 端側可部署:適用於需要 reasoning 步驟、又不能上雲的場景。
  3. 可下游微調:作為小型 reasoning chatbot、教學助理等應用的基底。

Model Description

Model Sources

Citation

@misc{gemma_3_tw_270m_thinking,
  title        = {gemma-3-tw-270m-thinking: A Reasoning-style Lightweight Traditional Chinese Model for Taiwan},
  author       = {Huang, Liang Hsun},
  year         = {2025},
  howpublished = {\url{https://huggingface.co/lianghsun/gemma-3-tw-270m-thinking}}
}

Acknowledge

  • 特此感謝 APMIC 的算力支援。

Model Card Authors

Huang Liang Hsun

Model Card Contact

Huang Liang Hsun

Downloads last month
-
Safetensors
Model size
0.4B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for lianghsun/gemma-3-tw-270m-thinking

Finetuned
(5)
this model

Collection including lianghsun/gemma-3-tw-270m-thinking