Add /internal/chat endpoint for multi-model editor integration by JMRussas · Pull Request #25 · JMRussas/orchestration-engine

JMRussas · 2026-03-07T18:08:10Z

Summary

Adds standalone /api/internal/chat endpoint for editor integration (NoZ, VS Code extension)
Routes to CLI providers (Claude, Gemini, Codex) via async subprocess, Ollama via HTTP API
Supports conversation history (last 20 messages), model selection per provider, and native Ollama /api/chat messages array

Test plan

curl -X POST http://localhost:5200/api/internal/chat -H 'Content-Type: application/json' -d '{"prompt": "hello", "provider": "gemini"}'
Test with conversation history: send messages array
Test Ollama routing: provider: "ollama" uses HTTP API directly
Test model selection: Gemini -m, Codex --model flags
Verify no auth required (no Bearer token needed)

Generated by Claude Code · Claude Opus 4.6

Standalone chat proxy that routes to CLI providers (Claude, Gemini, Codex) via subprocess and Ollama via HTTP API. Supports conversation history (last 20 messages), model selection, and native Ollama /api/chat messages. No auth — intended for trusted network access (editor integrations). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

shutil.which() resolves bare command names to full .cmd paths on Windows. Avoids shell=True (command injection risk) while finding npm global CLIs. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

New llm_router.py: call_llm() routes through CLI subprocess (Claude, Gemini, Codex) or Ollama HTTP with automatic fallback chain. Zero cost on subscription billing. Planner no longer requires ANTHROPIC_API_KEY. Budget reservation removed (cost is always $0 on subscription). Provider fallback: gemini → claude → codex. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Bypasses auth to allow MCP and internal callers to trigger planning via CLI providers. Includes traceback in error response for debugging. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Windows has a ~32K command line length limit. Large planning prompts (system prompt + requirements) exceed this when passed as -p arguments. Now pipes prompts via stdin for all CLI providers. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Planner returns plan_id, not id. Use .get() with fallback for resilience. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Same Windows command line length fix as llm_router.py — pipe prompts via stdin instead of passing as -p arguments to CLI providers. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

JMRussas and others added 8 commits March 7, 2026 13:07

Fix Windows CLI resolution for npm global binaries

18acb94

shutil.which() resolves bare command names to full .cmd paths on Windows. Avoids shell=True (command injection risk) while finding npm global CLIs. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Add /internal/plan endpoint for unauthenticated planning

907c606

Bypasses auth to allow MCP and internal callers to trigger planning via CLI providers. Includes traceback in error response for debugging. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Wire internal routes into DI container for planner access

76b8bd6

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Fix MCP plan_project to handle plan_id key from planner service

09ff64b

Planner returns plan_id, not id. Use .get() with fallback for resilience. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Fix chat endpoint to pipe prompts via stdin like llm_router

8abce73

Same Windows command line length fix as llm_router.py — pipe prompts via stdin instead of passing as -p arguments to CLI providers. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add /internal/chat endpoint for multi-model editor integration#25

Add /internal/chat endpoint for multi-model editor integration#25
JMRussas wants to merge 8 commits intomainfrom
feature/internal-chat-endpoint

JMRussas commented Mar 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

JMRussas commented Mar 7, 2026

Summary

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant