Speech Audio

Speech-to-text and text-to-speech with faster-whisper, Silero, and speaker diarization.

Category MCP Servers
Added Mar 28, 2026
Views 1

About

Unified audio processing server combining speech-to-text (faster-whisper), text-to-speech (Silero), and speaker diarization (SpeechBrain). Supports GPU acceleration, multiple Whisper models, and Russian/English TTS voices. Exposes capabilities as both REST API and MCP tools.

Is this your project?

Claim this listing to manage your page, access analytics, and unlock upgrades. Verification takes 60 seconds.

Log In to Claim

Share This Project

Embed Badge

Add this badge to your README:

[![Listed on AiList](https://hifriendbot.com/ai-list/badge/speech-audio.svg)](https://hifriendbot.com/ai-list/speech-audio/)
Listed on AiList

List Your Project

Join the directory Ai agents read. Free forever.

Submit Your Project