Amazon Nova Sonic
Real-time, expressive speech-to-speech model for conversational AI applications.

About Amazon Nova Sonic
Amazon Nova Sonic: Real-Time Speech-to-Speech AI on AWS
Amazon Nova Sonic is a state-of-the-art speech-to-speech AI model available through Amazon Bedrock. It enables real-time, human-like voice conversations with low latency and industry-leading price performance. Nova Sonic can understand streaming speech in various speaking styles and generate expressive, adaptive responses that mirror the prosody and emotion of the input speech.
The model supports both masculine and feminine expressive voices in multiple English accents (including American and British), making it suitable for a wide range of conversational AI applications such as customer support automation, outbound marketing, voice-enabled personal assistants, and interactive education or language learning.
Nova Sonic is accessed via a bidirectional streaming API in Amazon Bedrock, enabling two-way, low-latency communication essential for interactive voice experiences. It also includes responsible AI features like built-in content moderation and watermarking for safety and compliance.
Key Features
Real-Time Speech-to-Speech:
Enables instant, natural voice conversations between users and AI.Expressive, Adaptive Voices:
Supports multiple English accents and both masculine and feminine voice styles.Bidirectional Streaming API:
Two-way streaming for seamless, low-latency interaction.Fast and Cost-Effective:
Optimized for high performance and affordable operation at scale.Knowledge Grounding:
Can integrate with enterprise knowledge bases for accurate, context-aware responses.Tool Use & Agentic Workflows:
Supports function calling and integration into complex agent workflows.Responsible AI:
Built-in content moderation and watermarking for safety and compliance.
Use Cases
Customer service call automation
Voice-enabled business assistants and agents
Outbound marketing campaigns
Interactive education and language learning tools
Real-time language practice for non-native speakers
Model Selection
Amazon Nova Sonic
Speech-to-speech model with expressive, adaptive voice generation.
Getting Started
Official Overview: Amazon Nova Sonic on AWS
API Access: Use via Amazon Bedrock’s bidirectional streaming API
Tutorial: Step-by-step video guide (see page for details)
Documentation: Access through Amazon Bedrock documentation
Responsible AI: Learn about built-in protections and compliance on the product page
Amazon Nova Sonic empowers developers and enterprises to build advanced, conversational voice AI solutions that feel natural, responsive, and safe-unlocking new possibilities for customer engagement, automation, and interactive learning.