VoxCPM

    VoxCPM

    Tech
    TTS
    Open Source

    Open source text to speech platform generating expressive multilingual voices.

    VoxCPM banner

    About VoxCPM

    VoxCPM: Open Source TTS for Creative & Expressive Voices

    VoxCPM is an open-source text-to-speech platform designed to generate creative, expressive, and high-fidelity voices. The platform supports over 30 languages and offers controllable voice creativity, enabling users to produce seamless multilingual speech, expressive audiobooks, and unique game characters with precision audio engineering.

    Key Features

    • Multilingual Capabilities: Supports over 30 languages with stable timbre and seamless switching between languages.
    • High-Fidelity Audio: Generates 48kHz high-fidelity audio for professional-grade sound quality.
    • Controllable Voice Creativity: Offers text-based control for highly customizable voice styles and character performances.
    • Expressive Intonation: Produces natural rhythms and rich emotional layers, including specific emotions and non-verbal sounds.
    • Open Source Ecosystem: Free and open-source models available on GitHub and Hugging Face, supporting model fine-tuning and collaborative development.
    • Easy Deployment: Streamlined setup process for quick integration into various creative workflows.

    Use Cases

    • Creating expressive audiobooks with natural rhythm and intonation
    • Developing unique voice profiles and emotional performances for game NPCs
    • Producing vibrant, multilingual podcasts and announcer broadcasts

    Getting Started

    • Website: https://voxcpm.com/en/
    • Demo: Try the interactive voice generation demo directly on the homepage.
    • Source Access: Download models from GitHub or Hugging Face to begin deployment.

    VoxCPM empowers creators and developers with limitless voice generation capabilities, combining open-source accessibility with advanced audio engineering to build the next digital frontier in speech synthesis.