Port mentra ai local by aisraelov · Pull Request #3238 · Mentra-Community/MentraOS

aisraelov · 2026-06-24T22:47:49Z

Summary by cubic

Ports Mentra AI (local) to a backend-driven design: the agent, visual classifier, and web search now run server-side behind @mentra/auth, while the miniapp keeps wake-word, vision, and chat with a refreshed UI. This removes AI keys from the bundle, adds model selection, optional giga‑agent delegation, app-control tools, and improves STT accuracy.

New Features
- Backend (Hono) with @mentra/auth middleware and routes: POST /api/agent, POST /api/classify, POST /api/search, GET /healthz. OpenRouter + Jina keys live server-side.
- Agent tool-loop moved server-side via OpenRouter chat-completions; validated model registry with vision support; ask_agent delegation to the control plane (optional), with polling and action buttons (e.g. OAuth links).
- App control tools: start_app, stop_app, invoke_app_action(...) recorded as device actions; background executes via session.miniapps/session.actions. Installed apps + actions are sent in context.
- Local miniapp background rebuilt on @mentra/miniapp: managers for audio/TTS, display HUD, photos, location+weather, notifications, transcription with wake word, and query orchestration. Client talks to the backend via auth’d fetch.
- UI webview: full chat interface, Settings page with model picker and theme, debug overlay; local channel RPC replaces REST/SSE; build scripts inline only MENTRA_PUBLIC_* backend URL.
- Speech: Soniox provider now biases toward Mentra terms (“Mentra”, “MentraOS”, “Mira”) to reduce mistranscriptions.
Migration
- Set env vars: OPENROUTER_API_KEY, JINA_API_KEY; optional delegation AGENTS_BASE_URL, MENTRA_AGENT_API_KEY. Miniapp needs MENTRA_PUBLIC_MENTRA_AI_BACKEND_URL.
- Dev: bun run dev (Doppler) or bun run dev:local; backend defaults to port 3131. For USB, run adb reverse tcp:3131 tcp:3131.
- Packing: bun run miniapp:pack zips dist/ to build/com.mentra.ai.local-<version>.zip; build.ts stages miniapp.json and icon.png.
- Manifest: sdkVersion 0.3.0, minHostVersion 1.42.0. No AI keys in the bundle; ensure Doppler/infra holds backend secrets.

^{Written for commit 52cdfa9. Summary will update on new commits.}

…Cloud V2

Give the fast Mastra agent an `ask_agent` tool that hands multi-step / personal-account / long-running work to the user's persistent giga-agent (the testaiassistant control plane), keeping the fast path for trivial queries. Delegation is a no-op unless AGENTS_BASE_URL + MENTRA_AGENT_API_KEY are set, so unconfigured environments behave exactly as before. - AgentBridge: control-plane REST client (message/wake/getTask), keyed on the SDK mentraUserId, forwarding POV photo + device context. - DelegationManager: registry for tasks that exceed the 2s grace window; settles once via webhook (primary) or poll backstop. - ask_agent tool + prompt routing: fast <=2s -> inline reply; slow -> the agent speaks a brief ack and the result is delivered later. - QueryProcessor.deliverFollowUp: re-enters the normal HUD/speaker output path, mirrors to the webview chat thread, and records the FINAL answer in history so the fast agent stays in sync. - Webhook receiver at /api/agent/webhook (public sub-app; verifies the shared x-api-key, no SDK token). - Predictive wake at "Hey Mentra" to warm the container during STT. - Webview chat renders agent action buttons (e.g. OAuth "Connect Gmail").

Adds a text input to the Mentra AI webview so the assistant can be tested by typing, with no glasses required, and gives typed messages fuller answers than the glasses voice/HUD path. - New `chat` channel: prompt drops device/vision/TTS/HUD sections and the hard word limit; markdown formatting encouraged. Glasses path unchanged. - QueryProcessor.processTextQuery: runs the agent (+ delegation) for a typed message, broadcasts the turn to the chat SSE stream, returns the answer. - POST /api/chat/send and the chat SSE stream move to the auth-only sub-app (get-or-create User) so chat works whether or not glasses are connected. - Frontend: input pill + circular send button (matches Parth's Figma), wired to /api/chat/send; responses render via the existing SSE stream. Renders agent action buttons (e.g. OAuth connect) on incoming messages.

Brings the Mentra AI webview in line with the finished V2 Figma and fixes a layout collision introduced by the chat input bar. - Header: adds the sparkle logo + "Mentra AI" / "Settings" title. - Settings: full V2 rebuild — Assistant (Model picker accordion, Wake word, Voice & language), Appearance (dark mode), About — with the card/section layout from the Figma. - Model picker: persists the choice to UserSettings.model and the agent honors it (MODEL_OPTIONS + resolveModel in config; QueryProcessor passes the resolved model to the agent, falling back to the default safely). - Chat input: replaces the old fixed bottom logo banner (which overlapped the input bar) with a "Say Hey Mentra or type below" hint chip; adds the decorative mic glyph inside the pill. - Home: subtitle now "Then ask a question — or type below.". - Fix: theme toggle used a stale-closure setState that couldn't revert on a failed save; switched to the functional updater.

Implements the giga-agent escalation in the local miniapp architecture (backend + background/ui), the correct home for it — the cloud-SDK miniapps/mentra-ai is only a reference copy. Backend (key stays server-side): - services/agent/delegation.ts: control-plane client (message/getTask/wake), gated on AGENTS_BASE_URL + MENTRA_AGENT_API_KEY. - tools.ts: `ask_agent` tool, advertised only when delegation is configured; routes through a per-request holder so the service can surface actions + a pending taskId out-of-band from the model text. - agent.service.ts: threads the authed mentraUserId + holder, returns {pendingTaskId?, actions?}; prompt.ts gains the delegation routing rules. - agent.api.ts: passes auth.mentraUserId; adds GET /task/:id (poll) and POST /wake (predictive warm) — both proxy the control plane so the key never leaves the server. Background + UI: - MentraAgent: returns pendingTaskId/actions; pollDelegation() polls the backend (which proxies the control plane — no webhook needed). - QueryProcessor: on a pending delegation, speaks the ack now, then awaitFollowUp() delivers the real answer as a second turn (chat:event + glasses + history). Fast-delegation actions ride the AI message. - system:open-url RPC channel → session.system.openUrl for OAuth connect buttons; ChatInterface renders message.actions as buttons. - Predictive wake fired at wake-word time. Model picker, chat box (chat:ask), and chat:event already existed here.

miniapps/mentra-ai is only a reference copy of the old cloud-SDK Mentra AI. My earlier giga-agent / chat-box / V2-design commits were placed there by mistake; the real work lives in miniapps/mentra-ai-local (@mentra/miniapp). This restores the reference folder to its pre-port state (becbd8d), removing the misplaced changes. mentra-ai-local is unaffected.

github-actions · 2026-06-24T22:48:00Z

📋 PR Review Helper

📱 Mobile App Build

✅ Ready to test! (commit 52cdfa9)

📥 Download APK

🕶️ ASG Client Build

⏳ Waiting for build...

🔀 Test Locally

gh pr checkout 3238

github-actions · 2026-06-24T22:49:35Z

PR Agent Orchestrator State

{
  "cycle": 3,
  "fixRound": 0,
  "totalReviewerRuns": 3,
  "consecutiveNoNewReviews": 3,
  "openFindings": [],
  "resolvedFindings": [],
  "nitFindings": [],
  "phase": "discovery",
  "status": "in_progress",
  "lastPair": [],
  "stagnationFixRounds": 0,
  "lastOpenCount": 0,
  "fingerprintReopenCounts": {},
  "mutedFingerprints": []
}

Mentra AI (com.mentra.ai.local) becomes a system app and can drive other miniapps from the agent. - Host: add com.mentra.ai.local to SYSTEM_APPS so the SDK's system-only miniapps/actions APIs are permitted. - Discovery via context: the background gathers session.miniapps.list() each query and passes the installed apps + their declared actions into the agent context (next to location/notifications) — no tool round-trip to discover. - Three deferred tools: start_app, stop_app, invoke_app_action(pkg, actionId, params). The backend records a DeviceAction (like the open_url actions) and returns an optimistic confirmation to the model; the background executes them after the response via session.miniapps.start/stop + session.actions.invoke. - Prompt gains an "App Control" section listing the available apps/actions. - All gated on apps being present (the tools simply aren't offered otherwise).

Fix the split-layout pack (the generic `mentra-miniapp pack` expects a flat miniapp with a build script; ours is backend/ + miniapp/). Adopt merge's pattern: build.ts now stages miniapp.json + icon.png into dist/, and the miniapp:pack script zips dist/ into build/com.mentra.ai.local-<version>.zip. Adds an icon.png (the Mentra AI mark) and ships the packed com.mentra.ai.local-1.0.0.zip so the host preloads it like the other system miniapps.

AryanFP and others added 13 commits June 15, 2026 13:50

Mentra AI is done... Need more testing tho it is blocked waiting for …

a4e52d0

…Cloud V2

Merge branch 'dev' into port-mentra-ai-local

d106e9b

added soniox context

733c129

implementing ui

e8ced38

fixing the ui now...

676a562

be right back fixing matt's issue

52a92dd

setting page done

60d838d

still WIP adding Parth's design

becbd8d

aisraelov added 2 commits June 24, 2026 16:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Port mentra ai local#3238

Port mentra ai local#3238
aisraelov wants to merge 15 commits into
devfrom
port-mentra-ai-local

aisraelov commented Jun 24, 2026 •

edited by cubic-dev-ai Bot

Loading

Uh oh!

github-actions Bot commented Jun 24, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jun 24, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

aisraelov commented Jun 24, 2026 • edited by cubic-dev-ai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by cubic

Uh oh!

github-actions Bot commented Jun 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📋 PR Review Helper

📱 Mobile App Build

🕶️ ASG Client Build

🔀 Test Locally

Uh oh!

github-actions Bot commented Jun 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Agent Orchestrator State

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

aisraelov commented Jun 24, 2026 •

edited by cubic-dev-ai Bot

Loading

github-actions Bot commented Jun 24, 2026 •

edited

Loading

github-actions Bot commented Jun 24, 2026 •

edited

Loading