Gemini 3 Flash

by Google · chat

High-speed thinking model with 1M context for agentic workflows, multi-turn chat, and coding with configurable reasoning effort.

Pricing

Input: $0.475/M tokens · Output: $2.79/M tokens

Capabilities

Vision, Function Calling, JSON Mode, Streaming, Reasoning

Context: 1049K tokens

Max output: 66K tokens

Routes: 1/1 healthy

Performance

TTFT: 2652ms · Latency: 17535ms · Throughput: 9.5 tok/s