You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
OmniTool's trajectory logging is valuable because each step can be inspected with screenshot, parsed screen elements, model/action output, and timing. OpenChrome already has stronger runtime recovery, journaling, audit logging, and action recording, but lacks a single opt-in visual evidence bundle tying perception, action, outcome, and recovery together.
This issue adds privacy-aware trajectory/evidence bundles for visual/harness workflows. The goal is not to log everything by default; it is to make post-merge verification and failure analysis reproducible when explicitly enabled.
Proposed Implementation
Add an opt-in evidence bundle writer for browser action/perception steps.
Suggested module:
src/observability/visual-trajectory.ts
integration points in vision_find, Ralph visual strategy, interact, and optionally act
artifact root under ~/.openchrome/trajectories/ or configured path
Bundle shape
Each event should be JSONL and path-based for binary artifacts:
npm run build && npm test -- --runInBand trajectory observability vision pass, plus full npm run build && npm test && npm run lint:tier before PR completion.
Verification (post-merge, via OpenChrome MCP)
Record artifacts under scripts/verify/omniparser-adoption-D-visual-trajectory/ and use an isolated trajectory directory.
This strengthens OpenChrome's harness and verification story without adding agent behavior. It is observability-only, opt-in, and explicitly privacy bounded.
Curated scope, overlap handling, and verification checklist
Scope classification
Canonical lane: visual observability bundles.
Primary deliverable: opt-in visual trajectory evidence bundles for grounding/debug verification.
Open PR: none currently linked; create a new PR only after checking for newer overlapping PRs.
Non-goal: privacy-invasive default screenshot capture, runtime decision changes, or replacing general trajectory bundles.
Tier:
core/ observability (opt-in artifact capture; privacy-safe defaults)PR target:
developSeries: OmniParser adoption D
Priority: P1 — required to debug visual grounding, reduce wandering, and verify regressions after merge
Related / Sequencing
Background
OmniTool's trajectory logging is valuable because each step can be inspected with screenshot, parsed screen elements, model/action output, and timing. OpenChrome already has stronger runtime recovery, journaling, audit logging, and action recording, but lacks a single opt-in visual evidence bundle tying perception, action, outcome, and recovery together.
This issue adds privacy-aware trajectory/evidence bundles for visual/harness workflows. The goal is not to log everything by default; it is to make post-merge verification and failure analysis reproducible when explicitly enabled.
Proposed Implementation
Add an opt-in evidence bundle writer for browser action/perception steps.
Suggested module:
src/observability/visual-trajectory.tsvision_find, Ralph visual strategy,interact, and optionallyact~/.openchrome/trajectories/or configured pathBundle shape
Each event should be JSONL and path-based for binary artifacts:
Controls
OPENCHROME_VISUAL_TRAJECTORY=1OPENCHROME_VISUAL_TRAJECTORY_DIR=...recordTrajectory: trueNon-goals
Acceptance Criteria
vision_findand Ralph visual fallback can emit JSONL entries with path-based screenshot/snapshot artifacts.npm run build && npm test -- --runInBand trajectory observability visionpass, plus fullnpm run build && npm test && npm run lint:tierbefore PR completion.Verification (post-merge, via OpenChrome MCP)
Record artifacts under
scripts/verify/omniparser-adoption-D-visual-trajectory/and use an isolated trajectory directory.Setup
Scenario 1 — enabled capture writes bounded JSONL and files
Pass: trajectory JSONL exists, references perception data, and does not inline images.
Scenario 2 — secrets are absent
Pass: fixture secret is absent from all trajectory artifacts and recorded responses.
Scenario 3 — disabled default writes nothing
Pass: no trajectory artifacts are written unless capture is enabled.
Cleanup
Directionality / Fit Check
This strengthens OpenChrome's harness and verification story without adding agent behavior. It is observability-only, opt-in, and explicitly privacy bounded.
Curated scope, overlap handling, and verification checklist
Scope classification
Overlap and conflict resolution
Implementation checklist
Success criteria
Post-merge OpenChrome live verification checklist