Instructions to use prithivMLmods/Qwen3-VL-8B-Thinking-abliterated-v1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use prithivMLmods/Qwen3-VL-8B-Thinking-abliterated-v1 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-text-to-text", model="prithivMLmods/Qwen3-VL-8B-Thinking-abliterated-v1") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] pipe(text=messages)# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("prithivMLmods/Qwen3-VL-8B-Thinking-abliterated-v1") model = AutoModelForImageTextToText.from_pretrained("prithivMLmods/Qwen3-VL-8B-Thinking-abliterated-v1") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] inputs = processor.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use prithivMLmods/Qwen3-VL-8B-Thinking-abliterated-v1 with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "prithivMLmods/Qwen3-VL-8B-Thinking-abliterated-v1" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "prithivMLmods/Qwen3-VL-8B-Thinking-abliterated-v1", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker
docker model run hf.co/prithivMLmods/Qwen3-VL-8B-Thinking-abliterated-v1
- SGLang
How to use prithivMLmods/Qwen3-VL-8B-Thinking-abliterated-v1 with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "prithivMLmods/Qwen3-VL-8B-Thinking-abliterated-v1" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "prithivMLmods/Qwen3-VL-8B-Thinking-abliterated-v1", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "prithivMLmods/Qwen3-VL-8B-Thinking-abliterated-v1" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "prithivMLmods/Qwen3-VL-8B-Thinking-abliterated-v1", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }' - Docker Model Runner
How to use prithivMLmods/Qwen3-VL-8B-Thinking-abliterated-v1 with Docker Model Runner:
docker model run hf.co/prithivMLmods/Qwen3-VL-8B-Thinking-abliterated-v1
Qwen3-VL-8B-Thinking-abliterated
Qwen3-VL-8B-Thinking-abliterated is an abliterated (v1.0) variant of Qwen3-VL-8B-Thinking, designed for Abliterated Reasoning and Captioning. This model produces detailed captions and reasoning outputs across a wide range of visual and multimodal contexts, including complex, sensitive, or nuanced content. It supports diverse aspect ratios, resolutions, and prompt conditions while maintaining reasoning integrity and descriptive precision.
Key Highlights
Abliterated / Uncensored Captioning Fine-tuned to bypass conventional content filters while preserving factual, descriptive, and reasoning-rich outputs.
High-Fidelity Reasoning and Descriptions Generates comprehensive captions and reasoning for general, artistic, technical, abstract, and low-context images.
Robust Across Aspect Ratios Performs consistently on wide, tall, square, panoramic, and irregular image dimensions.
Variational Detail Control Produces outputs ranging from concise summaries to fine-grained, high-context reasoning and descriptions.
Foundation on Qwen3-VL-8B-Thinking Architecture Built upon the Qwen3-VL-8B-Thinking model’s advanced multimodal reasoning and instruction-following capabilities.
Multilingual Output Capability Defaults to English but can be adapted to other languages via prompt engineering.
Quick Start with Transformers
from transformers import Qwen3VLForConditionalGeneration, AutoProcessor
from qwen_vl_utils import process_vision_info
import torch
model = Qwen3VLForConditionalGeneration.from_pretrained(
"prithivMLmods/Qwen3-VL-8B-Thinking-abliterated-v1",
torch_dtype="auto",
device_map="auto"
)
processor = AutoProcessor.from_pretrained("prithivMLmods/Qwen3-VL-8B-Thinking-abliterated-v1")
messages = [
{
"role": "user",
"content": [
{
"type": "image",
"image": "https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen-VL/assets/demo.jpeg",
},
{"type": "text", "text": "Provide a detailed caption and reasoning for this image."},
],
}
]
text = processor.apply_chat_template(
messages, tokenize=False, add_generation_prompt=True
)
image_inputs, video_inputs = process_vision_info(messages)
inputs = processor(
text=[text],
images=image_inputs,
videos=video_inputs,
padding=True,
return_tensors="pt",
).to("cuda")
generated_ids = model.generate(**inputs, max_new_tokens=128)
generated_ids_trimmed = [
out_ids[len(in_ids):] for in_ids, out_ids in zip(inputs.input_ids, generated_ids)
]
output_text = processor.batch_decode(
generated_ids_trimmed,
skip_special_tokens=True,
clean_up_tokenization_spaces=False
)
print(output_text)
Intended Use
This model is suited for:
- Generating detailed, uncensored captions and reasoning for general-purpose, artistic, or research-oriented datasets.
- Research in content moderation, red-teaming, and generative safety analysis.
- Enabling descriptive captioning and reasoning for datasets typically excluded from mainstream models.
- Creative applications such as visual storytelling, art description, and multimodal reasoning exploration.
- Captioning and reasoning for images with non-standard or stylized visual structures.
Limitations
- May generate explicit, sensitive, or offensive content depending on prompts and image input.
- Not suitable for production systems that require strict content moderation.
- Output style, tone, and reasoning depth may vary based on input phrasing.
- Accuracy may fluctuate for abstract, synthetic, or highly stylized visuals.
- Downloads last month
- 171
Model tree for prithivMLmods/Qwen3-VL-8B-Thinking-abliterated-v1
Base model
Qwen/Qwen3-VL-8B-Thinking