feat: self-hosted providers — connection test + model discovery (#3260) by theherrovn-sys · Pull Request #3409 · nesquena/hermes-webui

theherrovn-sys · 2026-06-02T15:06:49Z

Summary

Layers provider adapter architecture with live connection testing on top of PR #3406 (the maintainer-preferred broader base) to add live connection testing and model discovery for Ollama, LM Studio, and custom OpenAI-compatible endpoints in Settings -> Providers.

Architecture

1. Provider adapter module (`api/provider_adapters.py`)

test_connection(provider_id, base_url) -> probes connectivity, returns {ok, endpoint, error, tested_at}
list_models(provider_id, base_url, refresh=False) -> discovers models with 5-min TTL cache
Ollama adapter: GET {root}/api/tags (strips /v1 suffix to reach native API)
LM Studio adapter: GET {base_url}/v1/models
Custom OpenAI-compatible adapter: GET {base_url}/models
Thread-safe in-memory LRU cache (32 entries max)

2. New API endpoints (`api/routes.py`)

GET /api/providers/test?provider=ollama&base_url=http://localhost:11434/v1 -> connection test
GET /api/providers/models?provider=ollama&base_url=...&refresh=true -> model discovery

3. Enhanced Settings UI (`static/panels.js`)

"Test Connection" button on each self-hosted provider card
Per-card status emoji: (check) connected, (x) failed, (hourglass) testing
Card border color-coded: green (connected), red (failed), accent (testing)
Failed cards gray out name/meta for clear visual signal
Model dropdown auto-populated from live provider API after successful test
Model refresh button for on-demand re-discovery
Graceful fallback: offline servers show clear error, never crash

4. CSS (`static/style.css`)

.provider-card-connected / .provider-card-failed / .provider-card-testing states
.provider-card-conn-status - inline emoji indicator
.provider-card-refresh-btn - model refresh button styling

Hardware Verified

Tested on real NVIDIA DGX Atom (GB10 Grace Blackwell):

Component	Value
GPU	NVIDIA GB10
Compute Capability	12.1 (SM121)
Architecture	aarch64 (ARM64)
Ollama Version	latest
Models	15 (confirmed via GET /api/tags)

Changes from PR #3406

This branch is directly rebased on PR #3406 (4 commits including the self-hosted providers base). Our new commits add:

File	Lines	Purpose
`api/provider_adapters.py`	+358	Provider adapter layer (NEW)
`api/routes.py`	+24	test + models GET endpoints
`static/panels.js`	+74/-15	Test connection + model discovery UI
`static/style.css`	+16	Status indicator + refresh button CSS

Supersedes PR fix: add self-hosted provider setup (Ollama/LM Studio) in Settings → Providers (#3260) #3404 (closed - narrower approach)
Rebased on PR feat(providers): self-hosted setup in Settings after onboarding (#3260) #3406 (broader, maintainer-preferred base)
Fixes Feature: self-hosted provider setup (Ollama / LM Studio) in Settings → Providers after onboarding #3260
Addresses vLLM issue [Bug]: No sm_121 (Blackwell) support on aarch64 — NVIDIA DGX Spark / Acer GN100 vllm-project/vllm#36821

…uena#3260) Add Settings → Providers cards for Ollama, LM Studio, and custom OpenAI-compatible endpoints with base URL probe, model selection, and optional API key. Persist via POST /api/providers/self-hosted without rerunning onboarding; keep local Ollama independent of OLLAMA_API_KEY used for Ollama Cloud (nesquena#1410).

tests LM Studio now uses the self-hosted Settings card (is_self_hosted) instead of API-key-only configurable=True. Drop unused imports flagged by CI ruff gate.

Fix ja/zh/tr/it/pt/fr/zh-Hant locale parity tests after providers panel keys were only added to English fallback blocks.

Split self-hosted cards onto a separate filter line so panels.js still contains filter(p=>p.configurable||p.is_oauth||p.is_custom) (nesquena#1202, nesquena#3260).

…ed cards (nesquena#3260) Implements GPT 5.5 provider adapter architecture on top of PR nesquena#3406: - New api/provider_adapters.py with test_connection() and list_models() for Ollama (GET /api/tags), LM Studio (GET /v1/models), and custom OpenAI-compatible endpoints (GET /models). Includes 5-min TTL cache. - New API routes: GET /api/providers/test?provider=&base_url= and GET /api/providers/models?provider=&base_url=&refresh=true. - Enhanced Settings → Providers self-hosted cards with: * "Test Connection" button → validates connectivity, shows status banner * Per-card status indicator (✅ connected / ❌ failed / ⏳ testing) * Gray-out + red border when server is unreachable * Model discovery dropdown populated from live provider API * Model refresh ↻ button for re-discovering models on demand * Graceful fallback — offline servers show clear error, never crash Verified on real NVIDIA DGX Atom (GB10, SM121, aarch64) with Ollama serving 15 models on port 11434.

Added _normalize_base_url() helper that strips trailing /v1 suffix before constructing endpoint URLs, preventing double paths like /v1/v1/models when user supplies base URL ending in /v1. Applied to: _lmstudio_test_connection, _lmstudio_list_models, _custom_test_connection, _custom_list_models Verified on DGX Atom with Ollama (15 models, /api/tags).

pamnard and others added 5 commits June 2, 2026 16:45

test: fix nesquena#1420 LM Studio expectations and ruff on nesquena#3260

f01371c

tests LM Studio now uses the self-hosted Settings card (is_self_hosted) instead of API-key-only configurable=True. Drop unused imports flagged by CI ruff gate.

i18n: add self-hosted provider keys to all locales (nesquena#3260)

9ea3a35

Fix ja/zh/tr/it/pt/fr/zh-Hant locale parity tests after providers panel keys were only added to English fallback blocks.

fix(ui): keep providers panel filter string for regression tests

3094377

Split self-hosted cards onto a separate filter line so panels.js still contains filter(p=>p.configurable||p.is_oauth||p.is_custom) (nesquena#1202, nesquena#3260).

theherrovn-sys changed the title ~~[NVIDIA] Self-hosted providers: connection test + model discovery (#3260, GPT 5.5 architecture)~~ [NVIDIA] Self-hosted providers: connection test + model discovery (#3260, Provider adapter architecture) Jun 2, 2026

theherrovn-sys changed the title ~~[NVIDIA] Self-hosted providers: connection test + model discovery (#3260, Provider adapter architecture)~~ feat: self-hosted providers — connection test + model discovery (#3260) Jun 2, 2026

theherrovn-sys closed this Jun 2, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: self-hosted providers — connection test + model discovery (#3260)#3409

feat: self-hosted providers — connection test + model discovery (#3260)#3409
theherrovn-sys wants to merge 6 commits into
nesquena:masterfrom
theherrovn-sys:fix/3260-self-hosted-providers-v2

theherrovn-sys commented Jun 2, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

theherrovn-sys commented Jun 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Architecture

1. Provider adapter module (api/provider_adapters.py)

2. New API endpoints (api/routes.py)

3. Enhanced Settings UI (static/panels.js)

4. CSS (static/style.css)

Hardware Verified

Changes from PR #3406

Related

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

theherrovn-sys commented Jun 2, 2026 •

edited

Loading

1. Provider adapter module (`api/provider_adapters.py`)

2. New API endpoints (`api/routes.py`)

3. Enhanced Settings UI (`static/panels.js`)

4. CSS (`static/style.css`)