| Output $/MTok | $2.50 |
|---|---|
| Input $/MTok | $0.30 |
| Cached input $/MTok | $0.0300 |
| Context window | 1,048,576 |
| Output cap | 65,536 |
| Modalities | in: text, image, audio, video / out: text |
| Tool use | ✓ |
| Structured output | ✓ |
| Family | Gemini 2.5 |
| Knowledge cutoff | 2025-01-31 |
| Verified | 2026-07-02 |
| Source | provider page ↗ |
| $/1M chars | $50.00 | $50.00 | $19.50 |
|---|---|---|---|
| Voice quality | neural | neural | neural |
| Voice cloning | ✓ included | ✓ included | — |
| Voice count | — | — | — |
| Languages | 42+ | en, fr, de, es, pt, zh, ja, ko | en, hi |
| SSML support | ✓ | — | — |
| TTFB | 90ms | 90ms | 200ms |
| Output formats | raw/pcm_f32le, raw/pcm_s16le, raw/pcm_mulaw, raw/pcm_alaw, wav, mp3 | — | pcm, mp3, wav, ulaw, alaw |
| Verified | 2026-07-02 | 2026-07-02 | 2026-07-02 |
| Source | provider page ↗ | provider page ↗ | provider page ↗ |