Prompt Caching Explained: Real Benchmarks, Real Savings Across Claude, GPT, and Gemini
We ran identical legal document analysis tasks against Claude, OpenAI, and Gemini - with and without prompt caching. Here are the exact token counts, latency numbers, and cost math at every scale.