All Topics

Technology

Business

Entertainment

News

Programming

Security

Science

Design

Environment

Finance

Crypto

Politics

Sports

Education

Gaming

Art

Music

Health

Books

Food

Travel

Personal

Prompt Caching 101

1y ago

Source

OpenAIPrompt Caching 101openai.com

Snippet from the RSS feed

Cookbook to reduce latency and cost using OpenAI prompt caching.

You might also wanna read

Can JavaScript Become a Planned Runtime? — A study of AOT compilation, computation graphs, memory planning, and concurrency scheduling

helabenkhalfallah.medium.com·1mo ago

Building current, a browser-based file sharing tool, with Rust and WASM

current.rdvz.app·1mo ago

Butter Introduces Automatic Template Induction for LLM Response Caching

Butter, an HTTP proxy cache for LLM responses, has introduced automatic template induction for its response caching system. This new feature

blog.butter.dev·5mo ago

CacheKit: High-Performance Cache Policies and Data Structures for Rust Systems

CacheKit is a Rust library providing high-performance cache replacement policies and supporting data structures for systems programming. It

github.com·5mo ago

xAI Launches Grok 4 Fast: Cost-Efficient Reasoning Model with 2M Token Context

xAI has launched Grok 4 Fast, a new cost-efficient reasoning model that builds on learnings from Grok 4. The model delivers frontier-level p

OpenAI rolls out improved memory system for ChatGPT to better retain user preferences across conversations

OpenAI is rolling out a new, more scalable memory system for ChatGPT that addresses challenges with staleness, correctness, and scalability

openai.com·29d ago

Comments

Sign in to join the conversation.

No comments yet. Be the first.