| Output $/MTok | $4.40 |
|---|---|
| Input $/MTok | $1.10 |
| Cached input $/MTok | $0.2750 |
| Context window | 200,000 |
| Output cap | 100,000 |
| Modalities | in: text, image / out: text |
| Tool use | ✓ |
| Structured output | ✓ |
| Family | o-series |
| Knowledge cutoff | 2024-06-01 |
| Verified | 2026-05-17 |
| Source | provider page ↗ |
| $/1M chars | $160.00 | $30.00 |
|---|---|---|
| Voice quality | neural | neural |
| Voice cloning | — | — |
| Voice count | — | — |
| Languages | 40+ | en, es, de, fr, nl, it, ja |
| SSML support | ✓ | — |
| TTFB | — | — |
| Output formats | — | wav, mp3, linear16, mulaw, alaw, opus |
| Verified | 2026-05-19 | 2026-05-19 |
| Source | provider page ↗ | provider page ↗ |