refactor(sacp): Session API improvements and proxy race condition fix #95

nikomatsakis · 2025-12-22T12:53:46Z

Summary

This PR refactors the Session API for clarity and adds a fix for a race condition in proxy_remaining_messages.

Changes

API Renames

with_client → run_until on JrConnectionBuilder - Better reflects the blocking behavior (precedent: tokio::task::LocalSet::run_until)
spawn_session → start_session on SessionBuilder - The old name was misleading since it blocks the current task

New Spawned Session Methods

on_session_start(cx, async |session| {...}) - Truly spawned session that returns immediately
on_proxy_session_start(cx, async |id| {...}) - Spawned proxy session variant

Builder Pattern for Blocking

Added block_task() method with sentinel types (Blocking/NonBlocking) to make blocking behavior explicit at the type level
start_session() and start_session_proxy() now require calling block_task() first

Race Condition Fix

Fixed proxy_remaining_messages to prevent message loss/reordering during the transition from active session to proxy mode:

Drop session handler registration (stops new messages to channel)
Drop update_tx (closes channel for drain detection)
Drain and forward any queued messages to client
Install proxy handler for future messages

This required separating session_handler_registration from mcp_handler_registrations in ActiveSession.

Final API

Method	Blocks task?	Returns
`.block_task().start_session()`	Yes	`ActiveSession<'static>`
`.block_task().start_session_proxy(cx)`	Yes	`SessionId`
`.on_session_start(cx, async \|session\| {...})`	No	`()`
`.on_proxy_session_start(cx, async \|id\| {...})`	No	`()`

The name run_until better reflects the semantics: run background responders until the provided closure completes. This aligns with tokio::task::LocalSet::run_until and similar APIs. Updated all call sites across sacp, sacp-tokio, sacp-conductor, elizacp, yopo, and associated tests/examples.

The name start_session better reflects that this method blocks until the session handshake completes before returning. The 'spawn' terminology was misleading since the responders are spawned but the method itself awaits. Also renamed spawn_session_proxy to start_session_proxy for consistency.

Add SessionBuilder::on_session_start() which spawns a task that runs the provided closure once the session starts. Unlike start_session(), this returns immediately without blocking the current task. The closure receives an ActiveSession<'static, _> and runs in a background task, making it suitable for fire-and-forget session handling patterns.

SessionBuilder now has a BlockState type parameter (Blocking/NonBlocking). Methods that block the current task (run_until, start_session, start_session_proxy) require calling block_task() first. The on_session_start() method remains available without block_task() since it spawns a task and returns immediately. Also renamed run_session to run_until for consistency with JrConnection::run_until.

…t_session_proxy - start_session_proxy now returns SessionId instead of () - Add on_proxy_session_start() which spawns a task and runs a closure with the SessionId once the proxy is established - The spawned variant doesn't require block_task() since it returns immediately

Separate session handler registration from MCP handler registrations so they can be managed independently during the proxy transition. The new implementation follows a careful sequence to prevent message loss or reordering: 1. Drop session handler registration (stops new messages to channel) 2. Drop update_tx (closes channel for drain detection) 3. Drain and forward any queued messages to client 4. Install proxy handler for future messages This ensures all messages are delivered in order during the transition from active session mode to proxy mode.

The example now correctly handles the SessionId return value and returns Ok(()) from the handler to satisfy IntoHandled bounds.

…arding In proxy mode, forward messages to the conductor's successor using the Agent endpoint instead of manually wrapping in SuccessorMessage and sending to Client. This simplifies the code and prepares for fixing the message ordering bug where responses bypass the conductor's message loop.

…e loop The conductor must maintain message ordering when forwarding between components. Previously, send_proxied_message_to would respond directly on the request context, bypassing the conductor's message loop and causing a race condition where responses could arrive before notifications. Changed send_proxied_message_via to send_proxied_message_to_via with an explicit endpoint parameter, and use it for all message forwarding to ensure responses go through conductor_tx like notifications do.

Since the conductor can act as a proxy, it's important to be explicit about which endpoint (Client or Agent) is being used rather than relying on HasDefaultEndpoint. This makes the message flow direction clear in the code. - Change if_request/if_notification to if_request_from/if_notification_from with Client - Change send_notification to send_notification_to with Client

nikomatsakis added 12 commits December 22, 2025 06:25

docs(sacp): fix cookbook doctest for start_session_proxy

6aaaab3

The example now correctly handles the SessionId return value and returns Ok(()) from the handler to satisfy IntoHandled bounds.

refactor: cleanup the proxying code

9f44b37

refactor: better debug logging

b44ab40

nikomatsakis merged commit 71c384f into symposium-dev:main Dec 23, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor(sacp): Session API improvements and proxy race condition fix #95

refactor(sacp): Session API improvements and proxy race condition fix #95

Uh oh!

nikomatsakis commented Dec 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

refactor(sacp): Session API improvements and proxy race condition fix #95

refactor(sacp): Session API improvements and proxy race condition fix #95

Uh oh!

Conversation

nikomatsakis commented Dec 22, 2025

Summary

Changes

API Renames

New Spawned Session Methods

Builder Pattern for Blocking

Race Condition Fix

Final API

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant