feat: add prompt caching support for OpenRouter provider by danny-avila · Pull Request #76 · danny-avila/agents

danny-avila · 2026-03-22T16:32:16Z

OpenRouter uses the same cache_control format as Anthropic's native API, so reuse addCacheControl() for cache breakpoint insertion. Also map cache_write_tokens to cache_creation in usage metrics.

danny-avila · 2026-04-13T17:52:27Z

@codex review

chatgpt-codex-connector · 2026-04-13T17:58:03Z

Codex Review: Didn't find any major issues. What shall we delve into next?

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

dvejsada · 2026-04-15T11:33:46Z

@danny-avila Dou you think this can be merged anytime soon? Currently Gemini models are slow as hell on Openrouter and cost of Anthropic models without caching is huge (as our use case requires a lot of tool calls). Thanks!

danny-avila · 2026-05-08T20:35:25Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 22177b4eef

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-05-08T20:42:20Z

+        promptCacheProvider === Providers.OPENROUTER &&
+        dynamicInstructions !== ''
+      ) {
+        body = [new HumanMessage(dynamicInstructions), ...body];


Count OpenRouter dynamic tail in instruction budget

When promptCache is enabled for OpenRouter, dynamicInstructions are moved into a prepended HumanMessage, but systemMessageTokens is still computed only from systemMessage. That makes instructionTokens underestimate prompt overhead, so pruning can admit too much history and the final request can exceed maxContextTokens in runs with large dynamic instructions (for example summary-in-system or long additional instructions). This is a regression specific to the new OpenRouter prompt-cache path because the dynamic segment is no longer included in the counted system message.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-08T20:42:20Z

+        promptCacheProvider === Providers.OPENROUTER &&
+        dynamicInstructions !== ''
+      ) {
+        body = [new HumanMessage(dynamicInstructions), ...body];


Preserve a stable first user message for OpenRouter cache

Prepending dynamicInstructions as the first non-system message makes the conversation opener change whenever those instructions vary (e.g., rotating summary text), which defeats OpenRouter’s sticky routing heuristic and drops prompt-cache hit rates for that session. The new behavior should keep the first non-system message stable and place volatile instructions later; otherwise prompt caching can silently degrade to repeated cache misses in dynamic-instruction workflows.

Useful? React with 👍 / 👎.

feat: add prompt caching support for OpenRouter provider

cbfa4f1

OpenRouter uses the same cache_control format as Anthropic's native API, so reuse addCacheControl() for cache breakpoint insertion. Also map cache_write_tokens to cache_creation in usage metrics.

danny-avila added 3 commits May 8, 2026 15:52

feat: add OpenRouter prompt cache support

3f9f317

test: add OpenRouter prompt cache live script

cdb98fe

feat: cache OpenRouter static tools and instructions

22177b4

danny-avila changed the base branch from main to dev May 8, 2026 20:35

chatgpt-codex-connector Bot reviewed May 8, 2026

View reviewed changes

fix: preserve OpenRouter cache routing budget

7ae2d53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add prompt caching support for OpenRouter provider#76

feat: add prompt caching support for OpenRouter provider#76
danny-avila wants to merge 5 commits intodevfrom
claude/objective-ramanujan

danny-avila commented Mar 22, 2026

Uh oh!

danny-avila commented Apr 13, 2026

Uh oh!

chatgpt-codex-connector Bot commented Apr 13, 2026

Uh oh!

dvejsada commented Apr 15, 2026

Uh oh!

danny-avila commented May 8, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot May 8, 2026

Uh oh!

chatgpt-codex-connector Bot May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

danny-avila commented Mar 22, 2026

Uh oh!

danny-avila commented Apr 13, 2026

Uh oh!

chatgpt-codex-connector Bot commented Apr 13, 2026

Uh oh!

dvejsada commented Apr 15, 2026

Uh oh!

danny-avila commented May 8, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 8, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 8, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants