← Back to Directory
Google DeepMind
Multimodal2025-01-15

Gemini 2.0 Flash

Gemini 2.0 Flash is Google's ultra-fast multimodal model with 1M context.

#fast#cheap#multimodal

Overview

Gemini 2.0 Flash is Google's latency-optimized multimodal model. It is designed for real-time applications where every millisecond counts.

Unique Factor

Industry-leading latency for a model with 1M context.

Key Capabilities

Sub-second latency
1M context
Native vision/audio

Benchmarks

MMLU Score
86%
HumanEval (Coding)
85%
GPQA Diamond
76%
MATH Benchmark
83%

Top Use Cases

Customer Support Agents

Providing instant, multimodal help to customers.

Example: “Explain this billing error to the user based on the screenshot.

Learn to Master This Model

Take our free structured Gemini course — from basics to advanced techniques.

Gemini Course

Technical Specs

Context1,000,000 tokens
Paramsunknown
LicenseProprietary
ArchTransformer

API Pricing

$0.1 / 1M input tokens

Output: $0.4 / 1M tokens

✓ Free tier available
Access API

Developer

Google's AI research lab — creators of Gemini, the highest context-window multimodal LLM.

Prompt Library

Browse Coding Prompts

📋

Previous Version

Gemini 1 5 Flash