Gemini 3.1 Flash-Lite (Preview)

by Google · chat

Frontier-class performance rivaling larger models at a fraction of the cost.

Pricing

Input: $0.2375/M tokens · Output: .395/M tokens

Capabilities

Vision, Function Calling, JSON Mode, Streaming, Reasoning

Context: 1049K tokens

Max output: 66K tokens

Routes: 1/1 healthy

Performance

TTFT: 1060ms · Latency: 4211ms · Throughput: 182.4 tok/s