| Output $/MTok | $10.00 | $8.00 |
|---|---|---|
| Input $/MTok | $1.25 | $2.00 |
| Cached input $/MTok | $0.1250 | $0.5000 |
| Context window | 400,000 | 1,047,576 |
| Output cap | 128,000 | 32,768 |
| Modalities | in: text, image / out: text | in: text, image / out: text |
| Tool use | ✓ | ✓ |
| Structured output | ✓ | ✓ |
| Family | GPT-5 | GPT-4.1 |
| Knowledge cutoff | 2024-09-30 | 2024-06-01 |
| Verified | 2026-05-17 | 2026-05-17 |
| Source | provider page ↗ | provider page ↗ |
| $/1M chars | $50.00 |
|---|---|
| Voice quality | neural |
| Voice cloning | ✓ included |
| Voice count | — |
| Languages | 42+ |
| SSML support | — |
| TTFB | 90ms |
| Output formats | raw/pcm_f32le, raw/pcm_s16le, raw/pcm_mulaw, raw/pcm_alaw, wav, mp3 |
| Verified | 2026-05-05stale 32d |
| Source | provider page ↗ |