TTS WebUI
by Various
A single Gradio + React WebUI with extensions for ACE-Step, OmniVoice, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, S
Apps
TTS WebUI
Added 1 June 2026
Overview
TTS WebUI is a single Gradio and React interface that bundles extensions for numerous text-to-speech and audio generation models including ACE-Step, OmniVoice, Piper TTS, GPT-SoVITS, and many more. It allows users to load and switch between supported models locally, providing a unified environment for inference and experimentation.
Best for
Best for
Developers and audio enthusiasts who want a single hub to experiment with diverse TTS and audio generation models
Use cases
- Run multiple TTS models without separate installations or configurations
- Compare synthesized speech outputs from different engines side by side
- Integrate local TTS generation into custom pipelines via the web API
Notes
TTS WebUI is a single Gradio and React interface that bundles extensions for numerous text-to-speech and audio generation models including ACE-Step, OmniVoice, Piper TTS, GPT-SoVITS, and many more. It allows users to load and switch between supported models locally, providing a unified environment for inference and experimentation.
3,153 stars on GitHub. Last updated 2026-05-14. Licensed MIT.
Use cases
- Run multiple TTS models without separate installations or configurations
- Compare synthesized speech outputs from different engines side by side
- Integrate local TTS generation into custom pipelines via the web API
Pros
- Supports a wide range of popular and niche TTS and audio models in one tool
- Open source and actively maintained with over 3,000 GitHub stars
- Customizable through extensions and runs locally with no cloud dependency
Cons
- Requires local GPU and significant setup for resource-intensive models
- Not all model-specific advanced features are exposed through the UI
- Setup can be complex for users without command-line experience
Indexed from awesome-generative-ai and enriched against its public facts.
Pros
- Supports a wide range of popular and niche TTS and audio models in one tool
- Open source and actively maintained with over 3,000 GitHub stars
- Customizable through extensions and runs locally with no cloud dependency
Cons
- Requires local GPU and significant setup for resource-intensive models
- Not all model-specific advanced features are exposed through the UI
- Setup can be complex for users without command-line experience
Pairs with
Other entries in the index that connect to this one. Click through to see the chain.