Skip to content

Conversation

@slayoffer
Copy link
Contributor

@slayoffer slayoffer commented Jan 4, 2026

Summary

Adds async_processing parameter to the MCP retain tool, allowing non-blocking memory storage.

Changes:

  • Added async_processing: bool = True parameter to retain tool
    • When True (default): queues memory for background processing, returns immediately
    • When False: waits for memory processing to complete before returning

This complements the existing multi-bank support added in v0.2.1.

Why

The retain tool was blocking callers during memory processing. This can be slow for complex content. The new parameter gives callers control:

  • Use async_processing=True for fire-and-forget storage (default, faster)
  • Use async_processing=False when you need confirmation before proceeding

Test plan

  • Start API with MCP enabled
  • Test retain with async_processing=True (should return immediately with operation_id)
  • Test retain with async_processing=False (should wait for completion)

🤖 Generated with Claude Code

@slayoffer slayoffer changed the title fix(mcp): correct http_app path parameter for MCP endpoint routing fix(mcp): correct http_app path and add async_processing parameter Jan 4, 2026
@nicoloboschi
Copy link
Collaborator

hey @slayoffer thanks for your contribution,
can you rebase and resolve conflicts? I did a similar fix recently

aevseev-xsolla and others added 4 commits January 6, 2026 06:57
Add async_processing parameter (default: True) to the MCP retain tool
to allow non-blocking memory storage. When True, memories are queued
for background processing and the tool returns immediately. When False,
the tool waits for completion before returning.

This matches the async behavior available in the HTTP API.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <[email protected]>
Add two missing MCP tools to achieve feature parity with HTTP API:

- list_memories: browse memories with pagination and full-text search
  (equivalent to GET /memories/list)
- reflect: LLM-based reasoning over memories with disposition awareness
  (equivalent to POST /reflect)

Both tools follow the existing pattern with JSON string responses
and proper error handling.
- Add memory types explanation (world, experience, opinion, observation)
- Document retain/ and search/ submodule structure
- Add commands for single test run, ruff format, ty type checking
- Note MCP server implementation in API layer
- Add optional environment variables section
- Clarify conventions (no Python files at root, npm workspaces)
These are user-specific development tool configs that should not be committed.
@slayoffer slayoffer force-pushed the fix/mcp-http-app-path branch from bac37ca to 008a487 Compare January 5, 2026 21:00
@slayoffer slayoffer changed the title fix(mcp): correct http_app path and add async_processing parameter feat(mcp): add async_processing and list_memories tool Jan 5, 2026
return json.dumps({"error": str(e)})

@mcp.tool()
async def list_memories(
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@slayoffer I think this tool is misleading for the agent, it should use recall, the list method is mostly for debugging and exploration, not for real usage

The list_memories endpoint is for debugging/exploration, not agent use.
Agents should use recall for semantic search instead.

Feedback from maintainer: "this tool is misleading for the agent,
it should use recall, the list method is mostly for debugging and
exploration, not for real usage"

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <[email protected]>
@slayoffer slayoffer changed the title feat(mcp): add async_processing and list_memories tool feat(mcp): add async_processing parameter to retain tool Jan 7, 2026
slayoffer and others added 2 commits January 8, 2026 06:59
These admin/orchestration tools are not needed for typical agent usage.
Agents work with a single configured bank via X-Bank-Id header.

MCP now exposes only core memory operations:
- retain: store memories
- recall: semantic search
- reflect: LLM reasoning over memories

Co-Authored-By: Claude Opus 4.5 <[email protected]>
@nicoloboschi nicoloboschi merged commit 37fc7fb into vectorize-io:main Jan 8, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants