STiFLeR7
/

Phi2-GPTQ

Text Generation

text-generation-inference

8-bit precision

Model card Files Files and versions

STiFLeR7 commited on Apr 4

Commit

0056b5d

·

verified ·

1 Parent(s): a15c650

Update README.md

Files changed (1) hide show

README.md +18 -2

README.md CHANGED Viewed

@@ -1,3 +1,19 @@
 # 🧠 Phi-2 GPTQ (Quantized)
 This repository provides a 4-bit GPTQ quantized version of the **Phi-2** model by Microsoft, optimized for efficient inference using `gptqmodel`.
@@ -30,8 +46,8 @@ This model is ready-to-use with the Hugging Face `transformers` library.
 ## 📖 References
-- Microsoft Phi-2: https://huggingface.co/microsoft/phi-2
-- GPTQModel: https://github.com/ModelCoud/GPTQModel
 - Transformers: https://github.com/huggingface/transformers
 ## ⚖️ License

+---
+license: apache-2.0
+tags:
+  - gptq
+  - quantized
+  - causal-lm
+  - transformers
+  - pytorch
+  - phi-2
+  - text-generation
+library_name: transformers
+pipeline_tag: text-generation
+base_model: microsoft/phi-2
+inference: true
+---
 # 🧠 Phi-2 GPTQ (Quantized)
 This repository provides a 4-bit GPTQ quantized version of the **Phi-2** model by Microsoft, optimized for efficient inference using `gptqmodel`.
 ## 📖 References
+- Microsoft Phi-2: https://huggingface.co/microsoft/phi-2
+- GPTQModel: https://github.com/ModelCoud/GPTQModel
 - Transformers: https://github.com/huggingface/transformers
 ## ⚖️ License