| --- |
| license: cc-by-nc-4.0 |
| pipeline_tag: text-to-speech |
| library_name: f5-tts |
| datasets: |
| - amphion/Emilia-Dataset |
| --- |
| |
| Download [F5-TTS](https://huggingface.co/SWivid/F5-TTS/tree/main/F5TTS_Base) or [E2 TTS](https://huggingface.co/SWivid/E2-TTS/tree/main/E2TTS_Base) and place under ckpts/ |
| ``` |
| ckpts/ |
| F5TTS_v1_Base/ |
| model_1250000.safetensors |
| F5TTS_Base/ |
| model_1200000.safetensors |
| E2TTS_Base/ |
| model_1200000.safetensors |
| ``` |
| Github: https://github.com/SWivid/F5-TTS |
| Paper: [F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching](https://huggingface.co/papers/2410.06885) |