feat: safe mode — redirect send to draft editor, block edit/delete by TheOutdoorProgrammer · Pull Request #106 · stablyai/agent-slack

TheOutdoorProgrammer · 2026-06-10T14:47:02Z

Summary

Adds safe mode, an enforced human-in-the-loop mode for AI agent environments, enabled via the AGENT_SLACK_SAFE_MODE env var (1/true/yes/on) or a global --safe-mode flag
While active, message send is redirected to the browser draft editor and message edit/message delete are blocked
Implements feature 1 of Feature: Safe mode (redirect send→draft) and background/detach mode for draft editor #80 (feature 2, background/detach mode for the draft editor, is not part of this PR)

Changes

Skill instructions like "always use draft, never send" are guidance an agent can ignore. Safe mode enforces the policy at the tool level so nothing posts to Slack without human review.

src/cli/safe-mode.ts (new): isSafeModeEnabled() (env var + CLI flag), safeModeBlockedError(), and redirectSendToDraft(). The redirect opens the draft editor with the send text and thread context pre-filled, prints a stderr warning, and marks the JSON output with "safe_mode": true and "redirected_from": "send". Send flags the editor cannot represent (--attach, --blocks, --schedule, --schedule-in, --reply-broadcast) are rejected with an explicit error instead of being silently dropped. The CI direct-send shortcut in the draft path is also blocked, so CI=1 cannot bypass safe mode.
src/cli/message-command.ts: wires the redirect into send and the blocks into edit/delete. Read operations and reactions are unchanged (reactions felt low-risk; easy to gate later if desired).
src/index.ts: registers the global --safe-mode flag.
test/safe-mode.test.ts (new): 22 tests covering env var parsing, flag override, unsupported-flag rejection (individually and combined), the CI guard, and the redirect payload/draft invocation via an injected draft function.
Docs: README "Safe mode" section, skills/agent-slack/references/commands.md notes on send/edit/delete, and a SKILL.md mention so agents know the behavior exists.

Verified manually: env var and flag both block edit/delete (exit 1), send with unsupported flags errors, and send with plain text opens the draft editor pre-filled with the warning on stderr. Full suite passes (261 tests), typecheck clean, lint warnings unchanged from baseline.

Adds an enforced human-in-the-loop mode (AGENT_SLACK_SAFE_MODE env var or global --safe-mode flag) for AI agent environments: - message send is redirected to the browser draft editor with the text and thread context pre-filled; output includes safe_mode/redirected_from markers and a stderr warning. Flags the editor cannot represent (--attach, --blocks, --schedule, --schedule-in, --reply-broadcast) are rejected instead of silently dropped, and the CI direct-send shortcut is blocked so safe mode cannot be bypassed with CI=1. - message edit and message delete are blocked with a clear error. - Read operations and reactions are unchanged. Implements feature 1 of stablyai#80.

AmethystLiang · 2026-06-12T07:15:49Z

gonna take a look in these two days :-)

AmethystLiang self-requested a review June 12, 2026 07:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: safe mode — redirect send to draft editor, block edit/delete#106

feat: safe mode — redirect send to draft editor, block edit/delete#106
TheOutdoorProgrammer wants to merge 1 commit into
stablyai:mainfrom
TheOutdoorProgrammer:feature/safe-mode

TheOutdoorProgrammer commented Jun 10, 2026

Uh oh!

AmethystLiang commented Jun 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

TheOutdoorProgrammer commented Jun 10, 2026

Summary

Changes

Uh oh!

AmethystLiang commented Jun 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants