Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

mistralai
/
Voxtral-Small-24B-2507

Audio-Text-to-Text
Safetensors
vllm
voxtral
Model card Files Files and versions
xet
Community
26
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

vLLM + Voxtral Setup Issue

#26 opened about 2 months ago by
susovantelecmi

How to turn on speaker diarization ?

1
#24 opened 3 months ago by
megabob

Jarvis

#23 opened 4 months ago by
Krishnasahu

Fine tunning

3
#22 opened 4 months ago by
rorosese

English Translation

βž• 1
#21 opened 5 months ago by
Sastrvvs

why consolidated.safetensors is required?

3
#20 opened 5 months ago by
Hansen-Wu

Quantised Version

πŸ”₯ 1
1
#19 opened 5 months ago by
steee

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cuda:3!

#17 opened 5 months ago by
hoseongahn

ValueError: There is no module or parameter named 'mm_whisper_embeddings' in LlamaForCausalLM

#15 opened 6 months ago by
dingyuansheng

Update README.md

#14 opened 6 months ago by
arafapollo

Really appreciate the work you put into this.🀍

πŸ”₯ 3
#12 opened 6 months ago by
deep-div

Add support to llama.cpp

πŸ‘ 6
#11 opened 6 months ago by
wraps

Improve model card: Update library, add paper link, abstract summary, and refine tags

#10 opened 6 months ago by
nielsr

Does this model support streaming ASR recognition, or are there any plans to open-source a streaming model?

4
#8 opened 6 months ago by
Qoboty

Large audio files

1
#7 opened 6 months ago by
nherve
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs