Alui/multiagent #141

luiarthur · 2025-12-18T21:35:38Z

No description provided.

luiarthur · 2025-12-18T21:41:51Z

@mikegros This draft demonstrates how to include the planning and execution agents in a supervisor agent (that I call Ursa).

Changes:

src/agents/excution_agent.py
- minor change to aid type checking
src/experimental/agents/multiagent.py contains
- a system prompt for the supervisor
- tool wrapped execution and planning agents
- an additional tool that handles splitting plan steps into different execution agent calls
tests/agents/test_multiagent/test_multiagent.py
- a test/demo

A good way to start reviewing the changes would be to pull the branch and then try to run test_multiagent.py by running from the root directory

pytest -s tests/agents/test_multiagent

mikegros · 2025-12-25T19:24:16Z

I think this looks great overall. A couple things:

I like having a main agent names Ursa
I think the type checking in the execute agent may be a little strict. I saw that you needed extra_tools to be a list of BaseTools - I assume that a StructuredTool counts as a BaseTool, but the add_tools capability also can take just plain functions, which is part of why the typing was originally a Callable. I think that was wrong (that we wanted it to be a Callable or some sort of tool object), but I think requiring just a BaseAgent might not be ideal (that said, extra_tools might be a weird thing to have anyway since we have the add tools method.

mikegros · 2025-12-25T19:25:40Z

src/ursa/agents/execution_agent.py

        llm: BaseChatModel,
        agent_memory: Optional[Any | AgentMemory] = None,
        log_state: bool = False,
-        extra_tools: Optional[list[Callable[..., Any]]] = None,


See my comment in the PR

luiarthur · 2026-01-05T23:39:23Z

@mikegros Thanks for reviewing! My main hesitation with this PR is I don't know if the memory is being properly handled. In this implementation, the Ursa agent retains memory; no other agents retain memory. Since the outputs of subagents are returned to the Ursa agent, I would assume there's no need to pass around the history to other agents. Does that make sense?

mikegros · 2026-01-05T23:46:02Z

@mikegros Thanks for reviewing! My main hesitation with this PR is I don't know if the memory is being properly handled. In this implementation, the Ursa agent retains memory; no other agents retain memory. Since the outputs of subagents are returned to the Ursa agent, I would assume there's no need to pass around the history to other agents. Does that make sense?

I will look at that closer, but as a quick response, you could give all the tool agents the same checkpointer database, as long as they all have their own thread_id (which is the way the CLI handles checkpointing). Then they could all keep a message history.

Alex's recent refactor got rid of the _action method and made invoke a method of each agent directly. Removed _action and this should pass the other CI tests. Co-authored-by: lui-arthur

There was one I missed on the planning agent. Co-authored-by: lui-arthur

…ittle more updating soon.

…iagent

- Passing checkpointer to the subagents so that they can keep a history. - Fixed a JSON Decode Error that could happen on the LLM output. - Removed some potential bad character passing to json.loads

mikegros

I didnt realize I had filled out this review but not submitted it. I think this is mostly good to merge. Especially since it is listed as "experimental" so if memory aspects aren't being handled ideally, that can be part of the iteration that happens between now and when we would make it no-longer-experimental.

mikegros · 2026-01-21T17:57:48Z

src/ursa/experimental/agents/multiagent.py

+    return call_agent
+
+
+class Ursa:


Should this inherit BaseAgent for usage metrics or anything else?

We talked about this, so I dont think we need to make any changes.

luiarthur added 21 commits December 4, 2025 16:56

add multiagent

8f8155b

plan_execute_tool

f3856df

yes

066cedc

yes

9b34694

yes

fb2d2e6

improve demo

659eeea

yes

4173c11

yes

aa68c1a

yes

983acba

yes

fe394d5

add todo for input/output control between agents

598309f

yes

79fc6cc

yes

5beaa26

yes

a118695

format

4518579

commit run.py

0500290

better print

6c6a54e

add multiagent test

3cd1993

remove dev

39f5d38

remove deep agent

4c8b8aa

yes

355b180

luiarthur added 2 commits December 18, 2025 14:44

add comments

52a4188

update model

187da3b

mikegros self-assigned this Dec 19, 2025

mikegros self-requested a review December 19, 2025 01:53

luiarthur and others added 4 commits December 22, 2025 17:27

dynamic llm in multiagent test

a8f4594

Update test_multiagent.py

d37333d

Update test_multiagent.py

ad8b21f

Small formatting update.

ffc2ebb

mikegros and others added 2 commits December 24, 2025 21:25

Small formatting update.

f30b7e5

Formatting

49dc312

mikegros reviewed Dec 25, 2025

View reviewed changes

default extra_tools to None

d00f2ba

luiarthur and others added 9 commits January 5, 2026 16:46

change default workspace

d0b4f7e

add space

8c15502

Fix to address failed test

8fbcc44

Alex's recent refactor got rid of the _action method and made invoke a method of each agent directly. Removed _action and this should pass the other CI tests. Co-authored-by: lui-arthur

Missed one _action

c8de7b2

There was one I missed on the planning agent. Co-authored-by: lui-arthur

Merge branch 'main' into alui/multiagent

ca1ddc0

Small update toward bringing up to date with other PRs. I will do a l…

52b6854

…ittle more updating soon.

Merge branch 'main' into alui/multiagent

7a25cbc

Merge branch 'alui/multiagent' of github.com:lanl/ursa into alui/mult…

57d1285

…iagent

Small updates

d1f393e

- Passing checkpointer to the subagents so that they can keep a history. - Fixed a JSON Decode Error that could happen on the LLM output. - Removed some potential bad character passing to json.loads

luiarthur marked this pull request as ready for review January 21, 2026 19:25

mikegros reviewed Jan 28, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Alui/multiagent #141

Alui/multiagent #141

Uh oh!

luiarthur commented Dec 18, 2025

Uh oh!

luiarthur commented Dec 18, 2025

Uh oh!

mikegros commented Dec 25, 2025

Uh oh!

mikegros Dec 25, 2025

Uh oh!

luiarthur commented Jan 5, 2026

Uh oh!

mikegros commented Jan 5, 2026

Uh oh!

mikegros left a comment

Uh oh!

mikegros Jan 21, 2026

Uh oh!

mikegros Jan 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Alui/multiagent #141

Are you sure you want to change the base?

Alui/multiagent #141

Uh oh!

Conversation

luiarthur commented Dec 18, 2025

Uh oh!

luiarthur commented Dec 18, 2025

Uh oh!

mikegros commented Dec 25, 2025

Uh oh!

mikegros Dec 25, 2025

Choose a reason for hiding this comment

Uh oh!

luiarthur commented Jan 5, 2026

Uh oh!

mikegros commented Jan 5, 2026

Uh oh!

mikegros left a comment

Choose a reason for hiding this comment

Uh oh!

mikegros Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

mikegros Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants