Proposal: separate Windows code and dogfooding

We have a few PRs proposing to add Windows support:
- https://github.com/OpenHands/software-agent-sdk/pull/1014
- https://github.com/OpenHands/software-agent-sdk/pull/1209
- https://github.com/OpenHands/software-agent-sdk/pull/1012

**Proposal:**

I think maybe we could we consider implementing this separately, separate from core agent-sdk code. In fact, there are two things here that I feel we could consider:
- separate code / loose coupling
- dogfooding / dedicated maintainer(s).

**1. Loose coupling:**

- maybe Windows-specific tools
- maybe Windows-specific package
- or better abstractions, at least separate files, initialized per platform
    - if we implement it directly into core.

I'm concerned about code like conditionals with Windows-specific logic in core implementation. Windows is _too different_ IMHO, which means we can expect much more special cases, much more `ifs` in random places. I do feel Windows support is a much more demanding feature than the current PRs, and I'm concerned that:
- the agent-sdk codebase is becoming more error-prone and harder to understand
- we're starting something we are not able to maintain.

IMHO we could perhaps think how to design and implement Windows support as _separated_ as possible from the rest of the logic... The [WindowsTerminal](https://github.com/OpenHands/software-agent-sdk/blob/c40789b3b36ea638ad3ac6263c340c63cabefb8f/openhands-tools/openhands/tools/terminal/terminal/windows_terminal.py) file is a good example: abstraction with platform-specific import. But the `if` for [a tool prompt](https://github.com/OpenHands/software-agent-sdk/pull/1012/files#diff-651a11fcfb97c6e524bcc0155c3140f900ae50c1adba4b70a343e9361c0ede0eR223) could be done differently. It's a prompt piece, we could have, for example, different templates/files. Or even subclass the definition, or maybe separate a component with the prompt, and subclass that.

**2. Dogfooding:**

As I noted in one of these PR, one detail here that seems relevant to me is that usually, it won't be the same people that work regularly on Windows and Linux/Mac. This matters at multiple levels:
- for one, no maintainer is working on Windows. We could test a PR (on a quick Windows), but how deep is that testing? Are we introducing support for something we don't maintain?
- this also means that when any contributor looks at a screen of such code in their editor, they never have at their disposal, in their sight, a screen of code, they have _half_ the screen. This may sound trivial, it's true, but if it's too many times, it affects ease of debugging/readability/maintainability IMHO.

(as it happens, I was re-reading Richard Gabriel's [Patterns of Software](https://www.dreamsongs.com/Files/PatternsOfSoftware.pdf), and this struck me: "_The primary feature for easy maintenance is locality: Locality is that characteristic of source code that enables a programmer to understand that source by looking at only a small portion of it."_ Now this refers to a different issue, but it is the kind of thing I value when I say the _whole screen_ of code I'm looking at should ideally be meaningful.)

I would suggest that we design it separate, and, publicize support when we have an interested maintainer who actually uses Windows. I'm concerned this is a too error-prone feature to be incidental.

Otherwise, I fear we are looking at a bunch of bug reports and anger, like we had in V0, if not more.

**Note:**

Please note also that if we start supporting Windows in the SDK, people will of course expect it in the CLI and Web UI too. In V0, Windows had been implemented only in LocalRuntime. But we weren't able to make understandable the distinction, so people expected it in CLI at least... Which hadn't been implemented, but that came across as "full of bugs", instead of "not supported".

I would love to hear some thoughts. I am aware people would like Windows support, and that's fine, I'm just suggesting that _how_ we do it matters IMO. Maybe we can do better this time around. 😅 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Proposal: separate Windows code and dogfooding #1210

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Proposal: separate Windows code and dogfooding #1210

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions