← Back to Directory
Multimodal2024-02-15

Gemini 1.5 Pro
Gemini 1.5 Pro features a massive 2M token context window and strong multimodal performance.
Overview
Gemini 1.5 Pro is Google's high-performance multimodal model, famous for its massive context window of up to 2 million tokens. It excels in long-form understanding and multimodal analysis.
Unique Factor
The 2 million token context window — the largest in the industry at release.
Key Capabilities
●
2M context window
●
Video understanding
●
Native multimodal
Benchmarks
MMLU Score
85.9%
HumanEval (Coding)
84.1%
GPQA Diamond
78%
MATH Benchmark
82%
Top Use Cases
Analyze long videos
Search and summarize content across hours of video footage.
Example: “Find the part in this 1-hour video where they discuss pricing.”
Learn to Master This Model
Take our free structured Gemini course — from basics to advanced techniques.
Technical Specs
Context2,000,000 tokens
Paramsunknown
LicenseProprietary
ArchMoE
Developer
Google's AI research lab — creators of Gemini, the highest context-window multimodal LLM.
Prompt Library
Browse Coding Prompts →
Previous Version
Gemini 1 0 Pro →