Last updated: 2026-04-18
What is Prompt caching?
Reusing a previously-sent prompt prefix (system message, tool list, large context) at a fraction of the normal token cost. Anthropic's ephemeral cache has a 5-minute TTL. Pins system prompts to slash routine-job cost by 60–90%.
See also
← Back to the full AI agent glossary.