inference-optimization
/

Llama-3.1-8B-Instruct-Mixed-NVFP4-FP8_BLOCK-down_proj-all

compressed-tensors

Model card Files Files and versions

Llama-3.1-8B-Instruct-Mixed-NVFP4-FP8_BLOCK-down_proj-all

6.87 GB

Ctrl+K

Ctrl+K

1 contributor

History: 2 commits

krishnateja95's picture

Upload folder using huggingface_hub

d10e0c8 verified 6 months ago