fix(server): strip volatile billing header on all cache paths (OpenAI + codex) by dusterbloom · Pull Request #360 · Luce-Org/lucebox-hub

dusterbloom · 2026-06-10T07:36:45Z

Re-carved from #274 (commit 4ec8d17). A volatile per-turn header that clients prepend to the system prompt was poisoning the prefix cache (miss every turn) on the OpenAI path; the Anthropic path was already protected inline.

Extracts a pure, IO-free normalize_system_for_cache() (server/src/server/prompt_normalize.{h,cpp}) and routes all three cache paths through it before the prefix key is computed: Anthropic /v1/messages (refactored from the inline strip — single normalization path, DRY), OpenAI /v1/chat/completions messages[0] (was unprotected — the core fix), and codex /v1/responses instructions. Known-pattern strip only (x-anthropic-billing-header + the claude-code volatile block), position/whitespace-insensitive; legit system content is never stripped.

Unit-tested (6 cases: strips on both shapes, idempotent across a changing header, preserves legit content, leading-whitespace, prefix-key stable). The 1620 prior assertions are unaffected. Branch A/B measured 0%→97.1% warm hit-rate on the OpenAI path; end-to-end warm-hit re-validation on the carved code is a batched GPU gate.

5 files, +228/-23.

… + codex) - Pure normalizer normalize_system_for_cache() in prompt_normalize.{h,cpp}: no IO, no globals - OpenAI /v1/chat/completions + codex /v1/responses now call the pure fn before tokenize/hash - DRY: anthropic path's inline strip collapsed to thin caller (single normalization path) - 6 pure-function tests: strips billing header (Anthropic array + OpenAI msg0), idempotent across turn change, preserves legit content, handles leading-whitespace header, cache-key stable

cubic-dev-ai

3 issues found across 5 files

Prompt for AI agents (unresolved issues)


Check if these issues are valid — if so, understand the root cause of each and fix them. If appropriate, use sub-agents to investigate and fix each issue separately.


<file name="server/src/server/http_server.cpp">

<violation number="1" location="server/src/server/http_server.cpp:1346">
P2: OpenAI system header normalization is skipped when `messages[0].content` is an array, leaving a cache-miss path unnormalized.</violation>
</file>

<file name="server/src/server/prompt_normalize.cpp">

<violation number="1" location="server/src/server/prompt_normalize.cpp:48">
P1: `messages[0]["content"]` is accessed without verifying the key exists, which can throw and fail request handling for malformed OpenAI payloads.

(Based on your team's feedback about guarding JSON string reads and key/type checks.) [FEEDBACK_USED].</violation>
</file>

_{Tip: cubic used a learning from your PR history. Let your coding agent read cubic learnings directly with the cubic MCP.

Re-trigger cubic}

cubic-dev-ai · 2026-06-10T07:43:11Z

+        if (first.is_object() && first.contains("role")) {
+            // OpenAI messages array: strip billing-header lines from messages[0].
+            if (first.value("role", "") == "system") {
+                const auto & content = first["content"];


P1: messages[0]["content"] is accessed without verifying the key exists, which can throw and fail request handling for malformed OpenAI payloads.

(Based on your team's feedback about guarding JSON string reads and key/type checks.) .

View Feedback

Prompt for AI agents

Check if this issue is valid — if so, understand the root cause and fix it. At server/src/server/prompt_normalize.cpp, line 48: <comment>`messages[0]["content"]` is accessed without verifying the key exists, which can throw and fail request handling for malformed OpenAI payloads. (Based on your team's feedback about guarding JSON string reads and key/type checks.) .</comment> <file context> @@ -0,0 +1,82 @@ + if (first.is_object() && first.contains("role")) { + // OpenAI messages array: strip billing-header lines from messages[0]. + if (first.value("role", "") == "system") { + const auto & content = first["content"]; + if (content.is_string()) { + return strip_billing_header_lines(content.get<std::string>()); </file context>

cubic-dev-ai · 2026-06-10T07:43:11Z

+            if (req.messages.is_array() && !req.messages.empty()) {
+                auto & m0 = req.messages[0];
+                if (m0.is_object() && m0.value("role", "") == "system" &&
+                    m0.contains("content") && m0["content"].is_string()) {


P2: OpenAI system header normalization is skipped when messages[0].content is an array, leaving a cache-miss path unnormalized.

Prompt for AI agents

Check if this issue is valid — if so, understand the root cause and fix it. At server/src/server/http_server.cpp, line 1346: <comment>OpenAI system header normalization is skipped when `messages[0].content` is an array, leaving a cache-miss path unnormalized.</comment> <file context> @@ -1355,6 +1339,14 @@ bool HttpServer::route_request(int fd, const HttpRequest & hr) { + if (req.messages.is_array() && !req.messages.empty()) { + auto & m0 = req.messages[0]; + if (m0.is_object() && m0.value("role", "") == "system" && + m0.contains("content") && m0["content"].is_string()) { + m0["content"] = dflash::common::normalize_system_for_cache(req.messages); + } </file context>

Suggested change

m0.contains("content") && m0["content"].is_string()) {

m0.contains("content") && (m0["content"].is_string() || m0["content"].is_array())) {

cubic-dev-ai Bot reviewed Jun 10, 2026

View reviewed changes

dusterbloom force-pushed the split/05-header-strip branch from cbac5f1 to 8f65f5c Compare June 10, 2026 07:45

davide221 merged commit 946eb38 into Luce-Org:main Jun 10, 2026
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(server): strip volatile billing header on all cache paths (OpenAI + codex)#360

fix(server): strip volatile billing header on all cache paths (OpenAI + codex)#360
davide221 merged 1 commit into
Luce-Org:mainfrom
dusterbloom:split/05-header-strip

dusterbloom commented Jun 10, 2026

Uh oh!

cubic-dev-ai Bot left a comment •

edited

Loading

Uh oh!

cubic-dev-ai Bot Jun 10, 2026

Uh oh!

cubic-dev-ai Bot Jun 10, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	m0.contains("content") && m0["content"].is_string()) {
	m0.contains("content") && (m0["content"].is_string() \|\| m0["content"].is_array())) {

Conversation

dusterbloom commented Jun 10, 2026

Uh oh!

cubic-dev-ai Bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai Bot Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai Bot Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cubic-dev-ai Bot left a comment •

edited

Loading