Zum Inhalt

Models with Prompt caching

Models supporting cached prompt prefixes for cost reduction.

84 models

Google Gemini

Nano Banana

100.0
Prompt caching33K ctx

Anthropic

Claude Opus 4

100.0
Prompt caching200K ctxTier C

Google Gemini

Gemini Flash-Lite Latest

100.0
Prompt caching1.048576M ctxTier C

Google Gemini

Nano Banana 2

100.0
Prompt caching66K ctx

OpenAI

gpt-4.1-mini

99.9
Prompt caching1.047576M ctxTier C

Anthropic

Claude Sonnet 4.5

99.9
Prompt caching200K ctxTier B

Anthropic

Claude Opus 4.5

99.9
Prompt caching200K ctxTier B

OpenAI

gpt-4.1-nano

99.9
Prompt caching1.047576M ctxTier C

Google Gemini

Gemini 3.1 Flash Lite

99.9
Prompt caching1.048576M ctx

OpenAI

gpt-4.1

99.8
Prompt caching1.047576M ctxTier B

Anthropic

Claude Opus 4.7

99.8
Prompt caching1M ctxTier B

OpenAI

gpt-4-turbo-2024-04-09

99.8
Prompt cachingTier C

OpenAI

gpt-4.1-2025-04-14

99.8
Prompt caching

OpenAI

gpt-4o-mini-2024-07-18

99.8
Prompt cachingTier C

OpenAI

gpt-4-turbo

99.8
Prompt caching128K ctxTier C

OpenAI

gpt-4o-2024-11-20

99.8
Prompt cachingTier C

OpenAI

gpt-4o-search-preview-2025-03-11

99.7
Prompt caching

OpenAI

gpt-4o-2024-08-06

99.7
Prompt cachingTier C

OpenAI

gpt-4o

99.7
Prompt caching128K ctxTier C

OpenAI

gpt-4o-mini

99.7
Prompt caching128K ctxTier C

OpenAI

gpt-4o-mini-search-preview

99.7
Prompt cachingTier C

OpenAI

gpt-4o-2024-05-13

99.7
Prompt cachingTier C

OpenAI

gpt-4.1-mini-2025-04-14

99.6
Prompt caching

Anthropic

Claude Haiku 4.5

99.6
Prompt caching200K ctxTier A

OpenAI

gpt-4.1-nano-2025-04-14

99.6
Prompt caching

Anthropic

Claude Sonnet 4

99.6
Prompt caching200K ctxTier C

Anthropic

Claude Opus 4.1

99.6
Prompt caching200K ctxTier C

Anthropic

Claude Opus 4.8

99.4
Prompt caching1M ctxTier A

OpenAI

gpt-5-search-api

99.4
Prompt cachingTier C

Anthropic

Claude Sonnet 4.6

99.3
Prompt caching1M ctxTier A

OpenAI

gpt-3.5-turbo-1106

99.2
Prompt caching

OpenAI

gpt-5-chat-latest

99.2
Prompt cachingTier C

Anthropic

Claude Opus 4.6

99.1
Prompt caching200K ctxTier B

OpenAI

gpt-5-search-api-2025-10-14

99.0
Prompt caching

OpenAI

gpt-4o-search-preview

99.0
Prompt cachingTier C

Google Gemini

Gemini 2.5 Flash-Lite

99.0
Prompt caching1.048576M ctxTier B

OpenAI

gpt-3.5-turbo-0125

98.8
Prompt caching

OpenAI

gpt-4-0613

98.6
Prompt caching

OpenAI

gpt-4

98.4
Prompt cachingTier C

OpenAI

gpt-4o-mini-search-preview-2025-03-11

96.6
Prompt caching

OpenAI

gpt-3.5-turbo-16k

94.8
Prompt caching

Google Gemini

Gemini 3 Flash Preview

94.3
Prompt caching1.048576M ctxTier C

OpenAI

gpt-3.5-turbo

91.8
Prompt cachingTier C

Google Gemini

Gemini Flash Latest

41.7
Prompt caching1.048576M ctxTier B

Google Gemini

Gemini 3.1 Pro Preview Custom Tools

38.2
Prompt caching1.048576M ctxTier C

Google Gemini

Gemini 3.1 Pro Preview

31.4
Prompt caching1.048576M ctxTier C

Google Gemini

Gemini 2.5 Flash

27.2
Prompt caching1.048576M ctxTier A

Google Gemini

Gemini 3.5 Flash

24.2
Prompt caching1.048576M ctxTier A

Google Gemini

Gemini Pro Latest

21.0
Prompt caching1.048576M ctxTier C

Google Gemini

Nano Banana Pro

12.5
Prompt caching131K ctx

Google Gemini

Gemini 2.5 Pro

8.3
Prompt caching1.048576M ctxTier A

Google Gemini

Deep Research Pro Preview (Dec-12-2025)

no data
Prompt caching131K ctx

OpenRouter

DeepSeek v3.2

no data
Prompt caching131K ctxTier A

Google Gemini

Gemini 2.5 Pro Preview TTS

no data
Prompt caching8K ctx

OpenAI

gpt-5

no data
Prompt cachingTier C

OpenAI

gpt-5-2025-08-07

no data
Prompt caching

OpenAI

gpt-5-mini

no data
Prompt cachingTier C

OpenAI

gpt-5-mini-2025-08-07

no data
Prompt caching

OpenAI

gpt-5-nano

no data
Prompt cachingTier C

OpenAI

gpt-5-nano-2025-08-07

no data
Prompt caching

OpenAI

gpt-5.1

no data
Prompt cachingTier B

OpenAI

gpt-5.1-2025-11-13

no data
Prompt caching

OpenAI

gpt-5.1-chat-latest

no data
Prompt cachingTier C

OpenAI

gpt-5.2

no data
Prompt cachingTier B

OpenAI

gpt-5.2-2025-12-11

no data
Prompt caching

OpenAI

gpt-5.2-chat-latest

no data
Prompt cachingTier C

OpenAI

gpt-5.3-chat-latest

no data
Prompt cachingTier C

OpenAI

gpt-5.4

no data
Prompt cachingTier A

OpenAI

gpt-5.4-2026-03-05

no data
Prompt caching

OpenAI

gpt-5.4-mini

no data
Prompt cachingTier A

OpenAI

gpt-5.4-mini-2026-03-17

no data
Prompt caching

OpenAI

gpt-5.4-nano

no data
Prompt cachingTier C

OpenAI

gpt-5.4-nano-2026-03-17

no data
Prompt caching

OpenAI

gpt-5.5

no data
Prompt cachingTier C

OpenAI

gpt-5.5-2026-04-23

no data
Prompt caching

OpenRouter

MiniMax M2.5

no data
Prompt caching256K ctxTier A

OpenAI

o1

no data
Prompt caching200K ctxTier C

OpenAI

o1-2024-12-17

no data
Prompt cachingTier C

OpenAI

o3

no data
Prompt caching200K ctxTier C

OpenAI

o3-2025-04-16

no data
Prompt caching

OpenAI

o3-mini

no data
Prompt caching200K ctxTier C

OpenAI

o3-mini-2025-01-31

no data
Prompt caching

OpenAI

o4-mini

no data
Prompt cachingTier C

OpenAI

o4-mini-2025-04-16

no data
Prompt caching