| Output $/MTok | $1.00 |
|---|---|
| Input $/MTok | $1.00 |
| Cached input $/MTok | — |
| Context window | 128,000 |
| Output cap | 8,000 |
| Modalities | in: text / out: text |
| Tool use | — |
| Structured output | ✓ |
| Family | Sonar |
| Knowledge cutoff | — |
| Verified | 2026-05-19 |
| Source | provider page ↗ |
| $/1M chars | $50.00 | $50.00 | $160.00 |
|---|---|---|---|
| Voice quality | neural | neural | neural |
| Voice cloning | ✓ included | ✓ included | — |
| Voice count | — | — | — |
| Languages | 32+ | 15+ | 40+ |
| SSML support | — | — | ✓ |
| TTFB | 75ms | 90ms | — |
| Output formats | mp3_44100_128, pcm_16000, wav_44100, opus_48000_128, ulaw_8000, alaw_8000 | — | — |
| Verified | 2026-05-05stale 32d | 2026-05-19 | 2026-05-19 |
| Source | provider page ↗ | provider page ↗ | provider page ↗ |