Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Kreshnik 's Collections
music
OCR
3D
Language
Image
Voice
Papers
Model training

Voice

updated about 21 hours ago
Upvote
-

  • microsoft/VibeVoice-1.5B

    Text-to-Speech • 3B • Updated Jan 22 • 61.5k • 2.27k

  • Configuration error
    Featured
    445

    FastVLM WebGPU

    🍎
    445

    Real-time video captioning powered by FastVLM


  • openbmb/VoxCPM-0.5B

    Text-to-Speech • Updated Sep 19, 2025 • 531 • 768

  • Running on CPU Upgrade
    78

    MiMo-Audio-Chat

    💬
    78

    Chat with Xiaomi MiMo-Audio using voice


  • FlashLabs/Chroma-4B

    Any-to-Any • Updated Jan 28 • 1.17k • 342

  • numind/NuMarkdown-8B-Thinking

    Image-to-Text • Updated Nov 13, 2025 • 108k • 449

  • CohereLabs/cohere-transcribe-03-2026

    Automatic Speech Recognition • Updated 1 day ago • 50.5k • 597
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs