Amsterdam talks recording are live ⚡

    voicera_mono_repository

    Git Repo
    COSS-India

    Framework for building real-time voice AI agents with telephony integration, speech processing, multi-provider support, and automated knowledge retrieval.

    About voicera_mono_repository

    Voicera is an open-source framework designed to bridge the gap between building a voice AI prototype and deploying it to actual phone lines. It provides a unified building block for real-time speech-to-text, text-to-speech, and LLM-powered conversational agents with a unique emphasis on developer experience.

    For the Non-Technical Reader

    Imagine you are building a flight simulator for customer service. Usually, to even test a voice AI, you have to buy a phone number, set up complex wiring, and pay for every minute of testing. Voicera changes this by being "browser-first." You can build and talk to your AI assistant directly in your web browser for free before you ever connect it to a real phone line. Additionally, it features a "knowledge base" where you can upload a PDF, and the AI will instantly know how to answer questions based on that document, making it a plug-and-play solution for automated support.

    For the Technical Reader

    Voicera is a provider-agnostic mono-repository that leverages Pipecat AI for WebSocket audio streaming and orchestration. The architecture is designed for low-latency, real-time interactions with built-in barge-in / interruption handling. Key technical specifications include:

    • Provider Agnostic: Supports LLMs like OpenAI, Anthropic, Grok (xAI), and self-hosted vLLM (Qwen).
    • STT/TTS Versatility: Integrates with Deepgram, Cartesia, ElevenLabs, and specialized Indic-language providers like Sarvam, AI4Bharat, and Bhashini (supporting 22+ languages).
    • RAG Pipeline: Automatic PDF chunking and embedding into a vector store for retrieval-augmented generation during live calls.
    • Deployment: Fully self-hostable via Docker Compose, featuring multi-tenant organization scoping.
    • Telephony: Native integration with Plivo and Vobiz for inbound/outbound calling and recording.

    Why It Matters

    The project represents a shift toward sovereign voice AI. By offering an MIT-licensed, self-hostable stack, Voicera allows enterprises to bypass the high costs and privacy concerns of proprietary