Voice App Development Tools
89 tools found
Showing 31โ60 of 89 tools
Optimised Whisper implementation for faster speech transcription.
Real-time speech recognition API with custom model training support.
Speech AI API with transcription, LeMUR, and audio intelligence features.
AI-powered speech recognition with 50+ languages and real-time streaming.
Speech recognition API with human-in-the-loop accuracy options.
Speech-to-text API with translation and audio intelligence capabilities.
Python bindings for PortAudio for audio I/O in voice applications.
Python library for speech recognition with multiple engine backends.
Python text-to-speech conversion library working offline.
SDK for adding speech capabilities to apps on any platform.
On-device wake word detection engine for privacy-preserving voice apps.
End-to-end on-device voice AI platform with privacy-first processing.
Open-source wake word detection using audio embeddings.
Pre-trained speech models for STT, TTS, and VAD in multiple languages.
Toolkit for building conversational AI with speech and NLP models.
Speech recognition toolkit widely used in research and production.
Open-source speech-to-text engine based on DeepSpeech architecture.
Offline speech recognition toolkit for 20+ languages with small footprint.
Open-source speech synthesiser with support for 100+ languages.
General multi-lingual speech synthesis system from University of Edinburgh.
Microsoft open-source SDK for building conversational voice chatbots.
Framework for chaining LLMs with speech models for voice applications.
API for building human-like voice AI agents for phone and web.
Programmable AI phone agents for automated voice calls at scale.
Voice AI infrastructure for building real-time voice assistants.
Platform for building and testing AI voice agents for customer service.
Open-source library for building voice bots and real-time audio agents.
Open-source framework for voice and multimodal conversational AI agents.
Open-source WebRTC infrastructure for real-time voice and video agents.
Programmable voice API for adding phone calling to any application.