← All Comparisons

o3 vs Gemini 3.1 Pro

A detailed comparison of o3 (OpenAI) and Gemini 3.1 Pro (Google) across pricing, performance, and features.

Pricing Comparison

Metric	o3	Gemini 3.1 Pro	Difference
Input / 1M tokens	$0.40	$2.00	+400%
Output / 1M tokens	$1.60	$12.00	+650%
Context window	200K	1M	—
Max output	100K	64K	—

Benchmark Comparison

Benchmark	o3	Gemini 3.1 Pro
MMLU-Pro	87%	91%
HumanEval	94.5%	95%
GPQA	79.2%	94.3%

Capabilities

Capability	o3	Gemini 3.1 Pro
audio	✗	✓
code	✓	✓
reasoning	✓	✓
text	✓	✓
tool-use	✓	✓
vision	✓	✓

o3 Strengths

✓Recently repriced — now very cheap
✓Excellent logical reasoning
✓200K context window

o3 Weaknesses

✗Slower due to reasoning overhead
✗Overkill for simple tasks

Gemini 3.1 Pro Strengths

✓#1 on 12 of 18 tracked benchmarks
✓94.3% GPQA Diamond — highest of any model
✓Same price as Gemini 3 Pro (free upgrade)
✓1M context with configurable thinking levels

Gemini 3.1 Pro Weaknesses

✗Still in preview
✗Context-tiered pricing ($4/$18 above 200K)

Quick Verdict

Best value: o3 is the more affordable option at $0.4/$1.6 per 1M tokens.

Higher benchmarks: Gemini 3.1 Pro scores higher on average across available benchmarks (93.4% avg).

Larger context: Gemini 3.1 Pro supports 1M tokens.

Choose o3 if cost matters most. Choose Gemini 3.1 Pro if you need the best possible quality for complex tasks.

More Comparisons

o3 vs Claude Opus 4.6 o3 vs Claude Sonnet 4.6 o3 vs Claude Sonnet 4.5 o3 vs Claude Haiku 4.5 o3 vs GPT-5.3 Codex o3 vs GPT-5.2 Codex