Last updated: 2026-04-18

What is Prompt caching?

Reusing a previously-sent prompt prefix (system message, tool list, large context) at a fraction of the normal token cost. Anthropic's ephemeral cache has a 5-minute TTL. Pins system prompts to slash routine-job cost by 60–90%.

See also

← Back to the full AI agent glossary.