whisper

★ 100,318 updated 1mo ago

PyTorch

Community

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Alternative to1entry

Faster Whisper

Community

Faster Whisper transcription with CTranslate2

★ 23,312 updated 7mo ago

Used by28entries

A Agents Autonomous low

gpt-migrate

Community

Easily migrate your codebase from one language or framework to another.

clanker-records/crompton-network

Various

Machine-native listening platform for C.W.A.'s Straight Outta Crompton. Your agent can listen. For real.

★ 0 updated 1mo ago

eviscerations/whisper-windows-mcp

Various

Windows-native MCP server for local audio transcription — GPU accelerated via Vulkan, works with Claude Desktop

★ 0 updated 1mo ago

mediar-ai/screenpipe

Various

YC (S26) | AI that knows what you've seen, said, or heard. Records everything you do, say, hear 24/7, local, private, secure

★ 19,049 updated 1mo ago

mohitbadwal/ringback

Various

Let your AI agent call your phone and talk to you — MCP servers for live, interruptible voice calls + tiered alerts, using free self-hosted pieces (pjsua2 + whisper.cpp + Linphone)

★ 3 updated 1mo ago

samson-art/transcriptor-mcp

Various

An MCP server (stdio + HTTP/SSE) that fetches video transcripts/subtitles via yt-dlp, with pagination for large responses. Supports YouTube, Twitter/X, Instagram, TikTok, Twitch, V

★ 10 updated 1mo ago

transcribe-app/mcp-transcribe

Various

Add transcription tools to your AI-powered assistants.

★ 6 updated 3mo ago

waxberry-dev/live-translate-mcp

Various

MCP server for local speech translation (EN ↔ 中文) via Whisper + Claude + Piper

★ 1 updated 1mo ago

AudioGPT

Community

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

★ 10,179 updated 2y ago

langchain_yt_tools

Community

Langchain tools to search/extract/transcribe text transcripts of Youtube videos. Some of this has been integrated into LangChain main branch

★ 76 updated 3y ago

Marvin

Community

an ambient intelligence library

★ 6,162 updated 2mo ago

Off Grid

Community

The Swiss Army Knife of Offline AI. Chat, Speak, and Generate Images - Privacy First, Zero Internet. Download an LLM and use it on your mobile device. No data ever leaves your phon

★ 2,335 updated 1mo ago

Pipecat

Community

Open Source framework for voice and multimodal conversational AI

★ 12,588 updated 1mo ago

P Apps Productivity one click

whisper-ctranslate2

Community

Whisper command line client compatible with original OpenAI client based on CTranslate2.

★ 1,309 updated 5mo ago

Fireflies

Fireflies.ai

AI meeting assistant. Records, transcribes, summarises, and pipes the output to your stack.

P Apps Productivity one click

Granola

AI notepad for meetings. Take your own notes, Granola enhances them after the call with the audio context.

P Apps Productivity one click

Krisp

AI noise cancellation, meeting transcription, and accent translation. The audio layer for the modern call.

Limitless

Various

Go beyond your mind’s limitations: Personalized AI powered by what you’ve seen, said, and heard.

Loopin AI

Various

loopinhq.com

Magnific

Various

The complete platform of creative AI tools for image, video, and audio generation. Create anything from campaigns, product shots to filmmaking. Be Magnific. #magnific

Otter.ai

Various

Otter AI Meeting Agent supports real-time transcription, live chat, automated summaries, insights, and action items.

PyGPT

Various

PyGPT is an open‑source desktop AI assistant for Windows, macOS and Linux. Chat, agents, web search, run Python, TTS/STT, plugins, long‑term memory.

Read AI

Various

Read AI, the fastest growing AI meeting assistant, ever, delivers real-time transcription, smart summaries, and enables AI search and discovery across all your content including

RunThisLLM

Various

Find out exactly what hardware you need to run any local LLM, image, video, or audio AI model. 275+ models with full build specs and performance estimates.

Teleprompter

Various

An on-device AI for your meetings that listens to you and makes charismatic quote suggestions.

★ 335 updated 3y ago

Vibe Transcribe

Various

Local-first transcription for audio and video with AI summaries, multilingual support, and privacy-focused processing.

Wispr Flow

Various

Flow makes writing quick and clear with seamless voice dictation. It is the fastest, smartest way to type with your voice.

YouTube Summary with ChatGPT

Various

Use ChatGPT to summarize YouTube videos.

Powers1entry

video-edit-mcp

Various

🐍 🏠 🍎 🪟 - Comprehensive video and audio editing MCP server with advanced operations including trimming, merging, effects, overlays, format conversion, audio processing, YouTube

★ 22 updated 11mo ago

Pairs with4entries

bark

Community

🔊 Text-Prompted Generative Audio Model

★ 39,142 updated 1y ago

CTranslate2

Community

Fast inference engine for Transformer models

★ 4,507 updated 1mo ago

simulate-sdk

Community

Enterprise Grade, Voice AI simulation SDK for testing your AI Agents

★ 59 updated 2mo ago

Affogato

Various

Create AI product video ads in seconds. Generate TikTok, Reels & Shorts that sell — no crew, no editing, just studio-quality ads fast.

Alternatives1entry