Gemini Context Caching is now available on TypingMind! ๐ฅ
Supported models:
- Gemini 1.5 Pro
- Gemini 1.5 Flash
The minimum input token count for context caching is 32,768.
Prompt Caching allows users to make repeated API calls more efficiently by reusing context from recent prompts, resulting in a reduction in input token costs and faster response times.
The Prompt Caching option is now available for Claude, OpenAI and Google Gemini models.
๐ย How it works
- Go to Model Settings
- Expand the Advanced Model Parameter
- Scroll down to enable the โPrompt Cachingโ option
Learn more โฃ
๐ Stay updated
Follow us on Twitter to stay informed about the latest updates, tips, and tutorials: