Image-Text-to-Text
Transformers
Safetensors
PyTorch
llama4
facebook
meta
llama
conversational
text-generation-inference
Instructions to use sureshnam9/Llama-4-Maverick-17B-128E-Instruct-FP8 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use sureshnam9/Llama-4-Maverick-17B-128E-Instruct-FP8 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-text-to-text", model="sureshnam9/Llama-4-Maverick-17B-128E-Instruct-FP8") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] pipe(text=messages)# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("sureshnam9/Llama-4-Maverick-17B-128E-Instruct-FP8") model = AutoModelForImageTextToText.from_pretrained("sureshnam9/Llama-4-Maverick-17B-128E-Instruct-FP8") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] inputs = processor.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use sureshnam9/Llama-4-Maverick-17B-128E-Instruct-FP8 with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "sureshnam9/Llama-4-Maverick-17B-128E-Instruct-FP8" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "sureshnam9/Llama-4-Maverick-17B-128E-Instruct-FP8", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker
docker model run hf.co/sureshnam9/Llama-4-Maverick-17B-128E-Instruct-FP8
- SGLang
How to use sureshnam9/Llama-4-Maverick-17B-128E-Instruct-FP8 with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "sureshnam9/Llama-4-Maverick-17B-128E-Instruct-FP8" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "sureshnam9/Llama-4-Maverick-17B-128E-Instruct-FP8", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "sureshnam9/Llama-4-Maverick-17B-128E-Instruct-FP8" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "sureshnam9/Llama-4-Maverick-17B-128E-Instruct-FP8", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }' - Docker Model Runner
How to use sureshnam9/Llama-4-Maverick-17B-128E-Instruct-FP8 with Docker Model Runner:
docker model run hf.co/sureshnam9/Llama-4-Maverick-17B-128E-Instruct-FP8
Add files using upload-large-folder tool
Browse files- model-00001-of-00055.safetensors +3 -0
- model-00004-of-00055.safetensors +3 -0
- model-00006-of-00055.safetensors +3 -0
- model-00008-of-00055.safetensors +3 -0
- model-00010-of-00055.safetensors +3 -0
- model-00012-of-00055.safetensors +3 -0
- model-00015-of-00055.safetensors +3 -0
- model-00017-of-00055.safetensors +3 -0
- model-00019-of-00055.safetensors +3 -0
- model-00021-of-00055.safetensors +3 -0
- model-00022-of-00055.safetensors +3 -0
- model-00023-of-00055.safetensors +3 -0
- model-00026-of-00055.safetensors +3 -0
- model-00028-of-00055.safetensors +3 -0
- model-00029-of-00055.safetensors +3 -0
- model-00030-of-00055.safetensors +3 -0
- model-00032-of-00055.safetensors +3 -0
- model-00034-of-00055.safetensors +3 -0
- model-00037-of-00055.safetensors +3 -0
- model-00039-of-00055.safetensors +3 -0
- model-00041-of-00055.safetensors +3 -0
- model-00043-of-00055.safetensors +3 -0
- model-00045-of-00055.safetensors +3 -0
- model-00048-of-00055.safetensors +3 -0
- model-00050-of-00055.safetensors +3 -0
- model-00052-of-00055.safetensors +3 -0
model-00001-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:efce05b3a22ed37598d8d0786e20a2b9c29a21453292b54e801e327e0d06a4bb
|
| 3 |
+
size 21474836664
|
model-00004-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2dc9c29d4efc0ab450b399eb5fd90be0e184ed90e7b135391b2b81ecb2b9ed03
|
| 3 |
+
size 21474836664
|
model-00006-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e45a1cd01cedd92a2508f198777dee67db92e903ae54b58423a40383af4d665c
|
| 3 |
+
size 21474836664
|
model-00008-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c131d4dcf230c04bbb610072204124f3c878a25018cd00adfdd95e2f601a9d09
|
| 3 |
+
size 21474836664
|
model-00010-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1dafa9b11c218e910782365699492314b45791fe2b9ba7dc605f3f1da6114487
|
| 3 |
+
size 21474836664
|
model-00012-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:aa35f27320d3b3c6646368cb7767fd6ccdddbcb7c866ef2a2b75eb98fcf69f3c
|
| 3 |
+
size 21474836664
|
model-00015-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2183591307c18de9dd98d8ac9f5976887e38897a6eb842b8f44949c14fdb4b36
|
| 3 |
+
size 21474836664
|
model-00017-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6150f5bdca4aa7a318116c33ad5b8addc5f90fdb334699d09eb02d64b9bbd087
|
| 3 |
+
size 21474836664
|
model-00019-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2ce669fab9a87344f4b9bc93c179ddedbf862ada5ce0ecdf3e76b8e6184d36e6
|
| 3 |
+
size 21474836664
|
model-00021-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:36a251af6912ff38985ec53fb4435ca0bf41a068ff1bc9f296d2d9f93e80f7b9
|
| 3 |
+
size 21474836664
|
model-00022-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:25079d1f765578e7b89a33534d2272b4259af1c475a2f1da38df40e1943c4c1f
|
| 3 |
+
size 10737418416
|
model-00023-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:fb0678efd225f6f2a6af27345633ef65f87a57f3b243bfa2863c2a273d1e2e2b
|
| 3 |
+
size 21474836664
|
model-00026-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:97469ddd53051a4c610d890b0c602eeceebb81aafc56d56d5a43c4ad89d40e67
|
| 3 |
+
size 21474836664
|
model-00028-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b296784dc758a56e0355c22d858eedc6a0c000988954203fc8c3340a7ce445ec
|
| 3 |
+
size 21474836664
|
model-00029-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:883dc2ddfad309ab7667d132fba4d148a3e297fe559b11489a6b7a40148aa5e2
|
| 3 |
+
size 10737418416
|
model-00030-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:57217a11fd9d497fa6d5e77807017771860ee814a3a1bc3591ad975740f594eb
|
| 3 |
+
size 21474836664
|
model-00032-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f4f607e7980e3df4069d33944b6110e98c714df567f579b95ef24bcfe30a968e
|
| 3 |
+
size 21474836664
|
model-00034-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:321f076a6becaf984a2e91ed288c839679b7eb4403441756a7b7a84dff99d7ec
|
| 3 |
+
size 21474836664
|
model-00037-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b00d916897359fd9843e65ad89d0d107a032ba0996faf8277ff05e67134a9160
|
| 3 |
+
size 21474836664
|
model-00039-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d57f9c108f654dee9bb7a653562698136d016b847f198e376718d0763b9c524b
|
| 3 |
+
size 21474836664
|
model-00041-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:53b223b71f8d43168e47ba99511dc87c977d356720e01c2a52be0fc07d9edf96
|
| 3 |
+
size 21474836664
|
model-00043-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:4a6eb5d215f243e2b7c179270a5aae1f4a8a83aaf5de5781f6c73e0f1adbb48f
|
| 3 |
+
size 21474836664
|
model-00045-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:56da1e23a9025d3192cbf6f7263aca43fa02354a3a0bd26797816b9f9a5f118b
|
| 3 |
+
size 21474836664
|
model-00048-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c93d2f986787607cfc6ff50590bf5a95502ba8992c2ebca40bc99fac8f254f55
|
| 3 |
+
size 21474836664
|
model-00050-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d6f5abe291c5be3d46631cd140dd82ac3beb8fe4448ebdb5487cc8108f20abd2
|
| 3 |
+
size 21474836664
|
model-00052-of-00055.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:bbe4aa790be7dc38630216f5dbf4f0b1659f31ab66aeaeb8f03ea96cc293f92a
|
| 3 |
+
size 21474836664
|