Speech Audio
Speech-to-text and text-to-speech with faster-whisper, Silero, and speaker diarization.
About
Unified audio processing server combining speech-to-text (faster-whisper), text-to-speech (Silero), and speaker diarization (SpeechBrain). Supports GPU acceleration, multiple Whisper models, and Russian/English TTS voices. Exposes capabilities as both REST API and MCP tools.
Is this your project?
Claim this listing to manage your page, access analytics, and unlock upgrades. Verification takes 60 seconds.
Share This Project
Embed Badge
Add this badge to your README:
[](https://hifriendbot.com/ai-list/speech-audio/)
