← All Tools

Best AI Model for Document Summarization

Summarizing long documents, reports, meetings, and articles. Needs large context windows and good compression ability.

Our Verdict

Gemini 2.5 Flash at $0.15/$0.60 with 1M context is the best deal — summarize entire books for pennies. For higher quality summaries, Gemini 3.1 Pro's 1M context at $2 input gives noticeably better compression. Llama 4 Scout's 10M context is unique if you need to process truly massive documents, and at $0.18 input it's dirt cheap. Input cost is everything here — you're sending lots of text and generating little.

Top Picks

1M context at $0.15 input — cheapest way to summarize long docs

Best for: Budget summarization at scale

Input

$0.15/1M

Output

$0.6/1M

Context

1M

Max Output

66K

MMLU-Pro: 76%HumanEval: 89.5%

1M context with best-in-class comprehension for accurate summaries

Best for: High-quality summaries

Input

$2/1M

Output

$12/1M

Context

1M

Max Output

64K

MMLU-Pro: 91%HumanEval: 95%GPQA: 94.3%

10M context — the only model that can process truly massive corpora

Best for: Extremely long documents

Input

$0.18/1M

Output

$0.63/1M

Context

10M

Max Output

32K

MMLU-Pro: 74.2%HumanEval: 86%

What Matters for Summarization

Key Factors

  • Context window
  • Input cost
  • Compression quality

Tips

  • Input cost dominates — you're sending lots of text but generating little
  • 1M context models (Gemini, Llama 4 Scout) can handle entire books
  • Flash/budget models are usually sufficient for summarization

Full Ranking (All Compatible Models)

RankModelInputOutputAvg BenchScore
#1Llama 4 ScoutMeta$0.18$0.6380.1%109
#2Gemini 2.5 FlashGoogle$0.15$0.6082.8%93
#3Gemini 3.1 ProGoogle$2.00$12.0093.4%68
#4Gemini 3 FlashGoogle$0.50$3.0084.0%62
#5GLM-4.7Zhipu AI$0.60$2.2085.0%62
#6GPT-4o MiniOpenAI$0.15$0.6077.6%56
#7Gemini 2.5 ProGoogle$1.25$10.0085.7%55
#8MiniMax M2.5MiniMax$0.30$1.2086.0%54
#9o3OpenAI$0.40$1.6086.9%54
#10Mistral Medium 3Mistral$0.40$2.0081.5%53
#11Llama 4 MaverickMeta$0.31$0.8585.3%53
#12DeepSeek R1DeepSeek$0.55$2.1982.5%52
#13DeepSeek V3DeepSeek$0.14$0.2883.5%50
#14o4-miniOpenAI$1.10$4.4084.8%48
#15Claude Haiku 4.5Anthropic$0.80$4.0078.8%47
#16GLM-5Zhipu AI$1.00$3.2077.8%47
#17Gemini 3 ProGoogle$2.00$12.0086.9%43
#18GPT-4oOpenAI$2.50$10.0078.6%42
#19Claude Sonnet 4.6Anthropic$3.00$15.0083.3%38
#20GPT-5.2 CodexOpenAI$1.75$14.0086.8%38
#21Claude Sonnet 4.5Anthropic$3.00$15.0081.9%38
#22Mistral Large 3Mistral$2.00$5.0087.0%38
#23GPT-5.3 CodexOpenAI$2.00$16.0088.2%37
#24GPT-5OpenAI$1.25$10.0085.7%34
#25Grok 4xAI$3.00$15.0083.7%28
#26Claude Opus 4.6Anthropic$5.00$25.0086.7%24

Compare Top Picks

Other Use Cases