P Apps and SaaS Productivity low

TTS WebUI

Name: TTS WebUI
Availability: InStock
Author: Various

by Various

A single Gradio + React WebUI with extensions for ACE-Step, OmniVoice, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, S

Visit Various Submit your build →

Apps

TTS WebUI

Added 1 June 2026

#ace-step #ai #audio-generation #cosyvoice #generative-ai #generator #gradio #music

Overview

TTS WebUI is a single Gradio and React interface that bundles extensions for numerous text-to-speech and audio generation models including ACE-Step, OmniVoice, Piper TTS, GPT-SoVITS, and many more. It allows users to load and switch between supported models locally, providing a unified environment for inference and experimentation.

Best for

Best for
Developers and audio enthusiasts who want a single hub to experiment with diverse TTS and audio generation models

Use cases

Run multiple TTS models without separate installations or configurations
Compare synthesized speech outputs from different engines side by side
Integrate local TTS generation into custom pipelines via the web API

Notes

3,153 stars on GitHub. Last updated 2026-05-14. Licensed MIT.

Use cases

Run multiple TTS models without separate installations or configurations
Compare synthesized speech outputs from different engines side by side
Integrate local TTS generation into custom pipelines via the web API

Pros

Supports a wide range of popular and niche TTS and audio models in one tool
Open source and actively maintained with over 3,000 GitHub stars
Customizable through extensions and runs locally with no cloud dependency

Cons

Requires local GPU and significant setup for resource-intensive models
Not all model-specific advanced features are exposed through the UI
Setup can be complex for users without command-line experience

Indexed from awesome-generative-ai and enriched against its public facts.

Pros

Supports a wide range of popular and niche TTS and audio models in one tool
Open source and actively maintained with over 3,000 GitHub stars
Customizable through extensions and runs locally with no cloud dependency

Cons

Requires local GPU and significant setup for resource-intensive models
Not all model-specific advanced features are exposed through the UI
Setup can be complex for users without command-line experience

Pairs with

Other entries in the index that connect to this one. Click through to see the chain.

Uses1entry

O OSS Obs medium

PyTorch

Community

Tensors and Dynamic neural networks in Python with strong GPU acceleration

★ 100,318 updated 1mo ago

Pairs with1entry

P Apps Productivity low

Open WebUI

Various

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

★ 139,558 updated 1mo ago

Pairs with2entries

P Apps Productivity one click

ElevenLabs

AI voice generation, cloning, and conversational voice agents. The default voice layer for the AI ecosystem.

P Apps Productivity low

TorToiSe

Various

A multi-voice TTS system trained with an emphasis on quality

★ 14,852 updated 1y ago

Alternatives2entries

P Apps Productivity low

Resemble AI

Various

Resemble AI helps enterprises generate secure voice AI, verify proper usage, and detect deepfakes instantly. Available on-prem or via cloud. Built for enterprise scale with gover

P Apps Productivity low

WellSaid

Various

Create professional-quality voice overs in any dialect or production style with our secure AI voices. Try WellSaid’s text-to-speech AI voices for free today.

← Back to Apps and SaaS Submit your own entry →