LLM चैटrealtimevoicemultilingual

Nova 2 Sonic

द्वारा Amazon · रिलीज़ 2026

Amazon Nova 2 Sonic — flagship speech-to-speech voice model (voice agent only). 16 expressive voices across 8 languages including Hindi and Indian English, with natural turn-taking and function calling. The default voice-agent model.

LLM चैट

Nova 2 Sonic

द्वारा संचालित Amazon · Realtime speech-to-speech

कॉन्टेक्स्ट विंडो

32K

पैरामीटर

Not disclosed

अधिकतम आउटपुट

N/A

श्रेणी

LLM चैट

अवलोकन

Amazon Nova 2 Sonic is a native speech-to-speech foundation model: a single model that understands speech and generates speech directly, rather than bolting text-to-speech onto a separate language model. One connection listens, reasons, and speaks with low enough latency for natural live conversation, including human-like turn-taking (the model detects when the caller has finished a thought) and graceful handling of interruptions without dropping context. On CallMissed it is the default voice model — create a session via `/v1/voice/sessions` with `llm_model` set to `nova-sonic-2`, or leave it unset since it is the default. It is not available on `/v1/chat/completions`; it is voice-agent only over WebSocket.

Nova 2 Sonic ships 16 expressive voices across eight languages: English (US, UK, India, and Australia), Hindi, Spanish, French, Italian, German, and Portuguese. Two of the voices (Tiffany and Matthew) are polyglot — a single voice persona that can switch languages mid-conversation without sounding like a different speaker, which is ideal for multilingual support lines where a caller code-switches between, say, Hindi and English. The model is robust to background noise and to a range of accents, and supports asynchronous function calling so tools can run while the assistant keeps talking.

Pricing on CallMissed is $4.00 per million input tokens and $15.00 per million output tokens (speech). That is dramatically cheaper than the older gpt-realtime class while delivering native speech-to-speech quality, which is why Nova 2 Sonic is the platform default for voice agents. Budget for continuous audio: minutes of conversation accumulate tokens faster than text-only chat, so pilot with recorded calls to estimate monthly spend before enabling toll-free numbers.

Use Nova 2 Sonic for phone bots, voice assistants, appointment booking, customer support automation, and any hands-free workflow where a single unified model is simpler than chaining separate STT, LLM, and TTS providers. You trade some flexibility (mixing your favorite STT + text LLM + TTS) for operational simplicity and lower latency. For Indian-language telephony specifically, the en-IN and Hindi voices plus polyglot code-switching make it a strong default.

Limitations: voice-pipeline only (no text chat completions endpoint), and like all realtime models it depends on client-side audio capture quality. For batch transcription after the fact, use a dedicated STT model instead. CallMissed runs Nova Sonic through AWS Bedrock; if AWS credentials are not configured in a region the platform falls back to the standard STT→LLM→TTS pipeline automatically so calls still connect.

प्राइसिंग

मेट्रिककीमत
इनपुट /1M tokens₹400.0000
आउटपुट /1M tokens₹1500.0000

1 क्रेडिट = ₹1 = $0.01 USD। कीमतें प्रोवाइडर से दिखाई गई हैं; CallMissed ~35% मार्कअप के साथ पास-थ्रू करता है।

मुख्य बातें

  • Native speech-to-speech
  • 16 voices · 8 languages
  • Hindi + Indian English
  • Polyglot voices
  • Default voice model

तकनीकी विवरण

  • Model id: nova-sonic-2
  • केवल वॉयस-एजेंट WebSocket
  • Natural turn-taking + barge-in

ताकतें

  • Native speech-to-speech
  • Multilingual incl. Hindi
  • कम लेटेंसी
  • लागत-कुशल

सीमाएं

  • चैट completions पर उपलब्ध नहीं
  • केवल वॉयस सतह

उपयोग के मामले

वॉयस एजेंटफ़ोन बॉटMultilingual support linesअपॉइंटमेंट बुकिंग

API उदाहरण

# Create a voice session with llm_model=nova-sonic-2 via POST /v1/voice/sessions

एंडपॉइंट: WebSocket /v1/voice/sessions · मॉडल ID: nova-sonic-2

Nova 2 Sonic अभी आज़माएं

साइनअप पर 1000 फ्री API क्रेडिट पाएं। कोई क्रेडिट कार्ड ज़रूरी नहीं।