← Back to Directory
Google DeepMind
Multimodal2024-02-15

Gemini 1.5 Pro

Gemini 1.5 Pro features a massive 2M token context window and strong multimodal performance.

#long-context#multimodal#enterprise

Overview

Gemini 1.5 Pro is Google's high-performance multimodal model, famous for its massive context window of up to 2 million tokens. It excels in long-form understanding and multimodal analysis.

Unique Factor

The 2 million token context window — the largest in the industry at release.

Key Capabilities

2M context window
Video understanding
Native multimodal

Benchmarks

MMLU Score
85.9%
HumanEval (Coding)
84.1%
GPQA Diamond
78%
MATH Benchmark
82%

Top Use Cases

Analyze long videos

Search and summarize content across hours of video footage.

Example: “Find the part in this 1-hour video where they discuss pricing.

Learn to Master This Model

Take our free structured Gemini course — from basics to advanced techniques.

Gemini Course

Technical Specs

Context2,000,000 tokens
Paramsunknown
LicenseProprietary
ArchMoE

API Pricing

$3.5 / 1M input tokens

Output: $10.5 / 1M tokens

✓ Free tier available
Access API

Developer

Google's AI research lab — creators of Gemini, the highest context-window multimodal LLM.

Prompt Library

Browse Coding Prompts

📋

Previous Version

Gemini 1 0 Pro