← All Tools

Best Mistral Medium 3 Alternatives

Mistral Medium 3 by Mistral is a mid-tier model priced at $0.4/2 per 1M tokens (in/out). It's already affordable, but you might want different strengths or features.

Mistral Medium 3

MistralMid-Tier

Input

$0.4/1M

Output

$2/1M

Context

128K

Max Output

16K

Why Switch from Mistral Medium 3?

Lags behind on reasoning benchmarks
Smaller community and tooling

Top Alternatives

#1o3OpenAIReasoning

17% cheaper, higher benchmark scores, 100K max output.

Input

$0.4/1M

Same price

Output

$1.6/1M

20% cheaper

Context

200K

Max Output

100K

MMLU-Pro: 87%(+11.0%)HumanEval: 94.5%(+7.5%)
#2GLM-4.7Zhipu AIMid-Tier

Higher benchmark scores, 128K max output, adds reasoning.

Input

$0.6/1M

50% more

Output

$2.2/1M

10% more

Context

200K

Max Output

128K

MMLU-Pro: 84.3%(+8.3%)HumanEval:
#3Llama 4 MaverickMetaOpen Source

Dramatically cheaper (52% less), higher benchmark scores, 1M context (8x more).

Input

$0.31/1M

23% cheaper

Output

$0.85/1M

57% cheaper

Context

1M

Max Output

32K

MMLU-Pro: 80.5%(+4.5%)HumanEval: 90.2%(+3.2%)
#4Claude Sonnet 4.6AnthropicMid-Tier

Comparable performance, adds reasoning.

Input

$3/1M

650% more

Output

$15/1M

650% more

Context

200K

Max Output

16K

MMLU-Pro: 86%(+10.0%)HumanEval: 94%(+7.0%)
#5Claude Sonnet 4.5AnthropicMid-Tier

Comparable performance, adds reasoning.

Input

$3/1M

650% more

Output

$15/1M

650% more

Context

200K

Max Output

16K

MMLU-Pro: 84.5%(+8.5%)HumanEval: 93%(+6.0%)
#6Llama 4 ScoutMetaOpen Source

Dramatically cheaper (66% less), comparable performance, 10M context (78x more).

Input

$0.18/1M

55% cheaper

Output

$0.63/1M

69% cheaper

Context

10M

Max Output

32K

MMLU-Pro: 74.2%(-1.8%)HumanEval: 86%(-1.0%)
#7Gemini 2.5 ProGoogleMid-Tier

Higher benchmark scores, 1M context (8x more), 66K max output.

Input

$1.25/1M

213% more

Output

$10/1M

400% more

Context

1M

Max Output

66K

MMLU-Pro: 87.5%(+11.5%)HumanEval: 93.5%(+6.5%)
#8o4-miniOpenAIReasoning

Higher benchmark scores, 100K max output, adds reasoning.

Input

$1.1/1M

175% more

Output

$4.4/1M

120% more

Context

200K

Max Output

100K

MMLU-Pro: 85%(+9.0%)HumanEval: 93.5%(+6.5%)

Full Comparison Table

ModelInput $/1MOutput $/1MContextMMLU-ProHumanEvalScore
o3OpenAI$0.40Same price$1.6020% cheaper200K87%+11.0%94.5%+7.5%79
GLM-4.7Zhipu AI$0.6050% more$2.2010% more200K84.3%+8.3%78
Llama 4 MaverickMeta$0.3123% cheaper$0.8557% cheaper1M80.5%+4.5%90.2%+3.2%78
Claude Sonnet 4.6Anthropic$3.00650% more$15.00650% more200K86%+10.0%94%+7.0%74
Claude Sonnet 4.5Anthropic$3.00650% more$15.00650% more200K84.5%+8.5%93%+6.0%74
Llama 4 ScoutMeta$0.1855% cheaper$0.6369% cheaper10M74.2%-1.8%86%-1.0%71
Gemini 2.5 ProGoogle$1.25213% more$10.00400% more1M87.5%+11.5%93.5%+6.5%70
o4-miniOpenAI$1.10175% more$4.40120% more200K85%+9.0%93.5%+6.5%69
Gemini 3 FlashGoogle$0.5025% more$3.0050% more1M78%+2.0%90%+3.0%69
Gemini 2.5 FlashGoogle$0.1563% cheaper$0.6070% cheaper1M76%Same89.5%+2.5%69
Mistral Large 3Mistral$2.00400% more$5.00150% more128K83%+7.0%91%+4.0%69
Claude Haiku 4.5Anthropic$0.80100% more$4.00100% more200K69.4%-6.6%88.1%+1.1%68
GPT-4o MiniOpenAI$0.1563% cheaper$0.6070% cheaper128K68%-8.0%87.2%+0.2%68
GPT-4oOpenAI$2.50525% more$10.00400% more128K80.5%+4.5%91%+4.0%67
MiniMax M2.5MiniMax$0.3025% cheaper$1.2040% cheaper200K82%+6.0%90%+3.0%67
GLM-5Zhipu AI$1.00150% more$3.2060% more200K70.4%-5.6%91%+4.0%62
GPT-5.3 CodexOpenAI$2.00400% more$16.00700% more200K90%+14.0%96.5%+9.5%59
GPT-5.2 CodexOpenAI$1.75338% more$14.00600% more200K89%+13.0%95.5%+8.5%59
DeepSeek V3DeepSeek$0.1465% cheaper$0.2886% cheaper164K78%+2.0%89%+2.0%57
DeepSeek R1DeepSeek$0.5538% more$2.199% more128K84%+8.0%92%+5.0%57
GPT-5OpenAI$1.25213% more$10.00400% more128K88.5%+12.5%95%+8.0%55
Gemini 3.1 ProGoogle$2.00400% more$12.00500% more1M91%+15.0%95%+8.0%55
Gemini 3 ProGoogle$2.00400% more$12.00500% more1M89.8%+13.8%94%+7.0%55
Grok 4xAI$3.00650% more$15.00650% more128K86%+10.0%93%+6.0%55
Claude Opus 4.6Anthropic$5.001150% more$25.001150% more200K89.5%+13.5%95%+8.0%49

Head-to-Head Comparisons

Alternatives for Other Models