synapse

OpenAI-compatible LLM proxy with provider fallback chain.

Synapse sits between your application and one or more LLM providers, routing requests through a prioritized chain. If the first provider fails, it automatically falls back to the next. It exposes a standard OpenAI-compatible API so clients need no modification.

Install

curl -fsSL https://github.com/shetty4l/synapse/releases/latest/download/install.sh | bash

Requires Bun, curl, tar, and jq. Installs to ~/srv/synapse/ and symlinks the CLI to ~/.local/bin/synapse.

Features

Provider fallback chain -- priority-based routing with automatic failover on upstream errors
Health tracking -- consecutive failure counting with configurable thresholds and exponential cooldown; auto-recovery when cooldown expires
Streaming -- SSE passthrough with idle timeout detection for stalled streams
Request logging -- buffered JSONL writes with size-based rotation
Configuration -- JSON config file with ${ENV_VAR} interpolation
Model aggregation -- /v1/models returns a deduplicated list from all healthy providers

Quick start

# if installed via the install script
synapse start

# or for development
bun install && bun run start

Synapse listens on port 7750 by default and forwards requests to a local Ollama instance.

# verify it's running
curl http://localhost:7750/health

# send a chat completion
curl http://localhost:7750/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{"model": "llama3", "messages": [{"role": "user", "content": "hello"}]}'

Configuration

Synapse loads configuration from ~/.config/synapse/config.json. If no file exists, it uses defaults (single Ollama provider at localhost:11434).

Example config

{
  "port": 7750,
  "providers": [
    {
      "name": "ollama",
      "baseUrl": "http://localhost:11434/v1",
      "models": ["*"],
      "maxFailures": 3,
      "cooldownSeconds": 60
    },
    {
      "name": "openai",
      "baseUrl": "https://api.openai.com/v1",
      "apiKey": "${OPENAI_API_KEY}",
      "models": ["gpt-4", "gpt-4o", "gpt-3.5-turbo"],
      "maxFailures": 3,
      "cooldownSeconds": 120
    },
    {
      "name": "groq",
      "baseUrl": "https://api.groq.com/openai/v1",
      "apiKey": "${GROQ_API_KEY}",
      "models": ["llama-3.3-70b-versatile", "mixtral-8x7b-32768"],
      "maxFailures": 5,
      "cooldownSeconds": 30
    }
  ]
}

Providers are tried in order -- first provider in the list has highest priority.

Provider fields

Field	Required	Default	Description
`name`	yes	--	Unique display name (used in logs and health endpoint)
`baseUrl`	yes	--	Base URL including version prefix, e.g. `http://localhost:11434/v1`
`apiKey`	no	--	Bearer token. Supports `${ENV_VAR}` interpolation
`models`	yes	--	Models this provider serves. Use `["*"]` to match any model
`maxFailures`	no	`3`	Consecutive failures before marking unhealthy
`cooldownSeconds`	no	`60`	Seconds to wait before retrying an unhealthy provider

Environment variables

Variable	Description	Default
`SYNAPSE_PORT`	Override the listening port (highest precedence)	`7750`
`SYNAPSE_CONFIG_PATH`	Override the config file path	`~/.config/synapse/config.json`

Any string value in the config file supports ${ENV_VAR} interpolation. If a referenced variable is not set, loading fails with an error.

API

All endpoints return OpenAI-format responses and errors.

`POST /v1/chat/completions`

Proxy a chat completion request through the provider fallback chain. Supports both streaming ("stream": true) and non-streaming requests.

Requires a JSON body with a model field
Max body size: 1 MB
On 2xx-4xx from a provider: returns the response immediately (client errors are not retried)
Exception: 404 from a wildcard (["*"]) provider falls through to the next provider (the wildcard provider doesn't have the requested model)
On 5xx or network error: falls back to the next provider
If all providers fail: returns 502

`GET /v1/models`

Returns an aggregated, deduplicated model list from all healthy providers. If multiple providers serve the same model, the first provider in config order wins.

`GET /health`

Returns service health and per-provider status. HTTP 200 if all providers are healthy, 503 if any are degraded. Includes a reachability check (cached for 30 seconds).

{
  "status": "healthy",
  "version": "0.1.0",
  "providers": [
    {
      "name": "ollama",
      "healthy": true,
      "reachable": true,
      "consecutiveFailures": 0
    }
  ]
}

How routing works

For each chat completion request:

Iterate providers in config order
Skip if the provider's model list doesn't match the requested model (unless ["*"])
Skip if the provider is currently unhealthy (cooldown hasn't expired)
Forward the request to the provider
On success or 4xx client error: return the response, record success in health tracker
On 5xx or network/timeout error: record failure, try the next provider
If all providers exhausted: return 502 with details of what was attempted and skipped

Health tracking

Providers start healthy with zero failures
Each failure increments a consecutive failure counter
When failures reach maxFailures: provider is marked unhealthy, cooldown timer starts
If still failing while unhealthy: cooldown is extended
On success: counter resets to zero
Auto-recovery: when cooldown expires, the provider is automatically eligible for routing again

Logging

Request logs are written as JSONL to ~/.config/synapse/logs/requests.log.

Buffered with 1-second flush interval
Size-based rotation at 50 MB (keeps current + 1 rotated file)
Each entry records: timestamp, model, provider, latency, status, streaming flag, attempted/skipped providers, and error (if any)

CLI

After installing, the synapse command is available:

synapse start          Start the server (background daemon)
synapse stop           Stop the daemon
synapse status         Show daemon status
synapse restart        Restart the daemon
synapse serve          Start the server (foreground)
synapse health         Check health of running instance
synapse config         Print resolved configuration
synapse logs [n]       Show last n request log entries (default: 10)
synapse version        Show version

All commands support --json for machine-readable output.

# start synapse in the background
synapse start

# check daemon status
synapse status

# check provider health
synapse health

# view resolved config (API keys masked)
synapse config

# tail the last 25 request log entries
synapse logs 25

# stop the daemon
synapse stop

Development

bun install              # install dependencies
bun run start            # start the server
bun run test             # run tests
bun run typecheck        # type check
bun run lint             # lint with oxlint
bun run format           # format with biome (auto-fix)
bun run format:check     # check formatting
bun run validate         # run all checks (typecheck + lint + format + test)

Tooling

Runtime: Bun -- runs TypeScript directly, no build step
Type checking: TypeScript (strict mode)
Formatting: Biome
Linting: oxlint
Git hooks: Husky -- pre-commit runs bun run validate

CI/CD

CI runs on all PRs and pushes to main: typecheck, lint, format check, tests
Release runs automatically after CI passes on main: computes semver bump from commit markers, creates a git tag and GitHub release with a source tarball

Version bumps:

bun run version:bump minor   # next release will be a minor bump
bun run version:bump major   # next release will be a major bump
# patch bumps happen automatically if no marker is found

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.github		.github
.husky		.husky
scripts		scripts
src		src
test		test
.gitignore		.gitignore
README.md		README.md
biome.json		biome.json
bun.lock		bun.lock
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

synapse

Install

Features

Quick start

Configuration

Example config

Provider fields

Environment variables

API

`POST /v1/chat/completions`

`GET /v1/models`

`GET /health`

How routing works

Health tracking

Logging

CLI

Development

Tooling

CI/CD

About

Uh oh!

Releases 23

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

shetty4l/synapse

Folders and files

Latest commit

History

Repository files navigation

synapse

Install

Features

Quick start

Configuration

Example config

Provider fields

Environment variables

API

POST /v1/chat/completions

GET /v1/models

GET /health

How routing works

Health tracking

Logging

CLI

Development

Tooling

CI/CD

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 23

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

`POST /v1/chat/completions`

`GET /v1/models`

`GET /health`

Packages