Add user simulation by JiwenJ · Pull Request #13 · Simple-Efficient/RL-Factory

JiwenJ · 2025-06-03T00:21:14Z

Support Issue #4

User Simulation for Multi-Turn RL

We propose adding a user simulator to an RL framework to create realistic, varied user interactions for better training and testing.

Motivation:
The motivation behind modifying the user simulation mechanism is to create a more realistic and flexible environment for evaluating and training AI agents. The previous approach relied on static or overly simplistic user feedback, which limited the diversity and authenticity of simulated user interactions. By enhancing the simulation, we aim to better mimic real-world user behavior, improve the robustness of agent training, and enable more accurate benchmarking of agent performance.

Key Points:

Persona-Based Feedback:
The new mechanism samples user personas from a configurable dataset, allowing simulated feedback to reflect a variety of user backgrounds and preferences. This increases the diversity and realism of the feedback provided to the agent.
Dynamic Feedback Generation:
Instead of using only pre-defined feedback, the system now leverages an LLM (e.g., DeepSeek) to generate contextually relevant, concise, and constructive feedback based on the persona and the agent’s response. This makes the feedback more adaptive and nuanced.
Configurable and Extensible:
The simulation parameters, such as feedback probability, persona dataset, and fallback feedbacks, are now easily configurable via YAML files. This design allows for straightforward extension and customization for different research or application needs.
Seamless Integration:
The new mechanism is integrated into the agent’s workflow, ensuring that feedback is only provided when required (controlled by a flag), and that the feedback is appended to the next observation in a way that is compatible with the agent’s prompt structure.

JiwenJ and others added 9 commits May 27, 2025 18:32

test

a6c256a

Merge branch 'Simple-Efficient:main' into add_user_simulation

cafdf9e

add user

b4ad675

Merge branch 'Simple-Efficient:main' into add_user_simulation

03463e3

Merge branch 'Simple-Efficient:main' into add_user_simulation

6ac56a2

add user

e30c030

add claude

2db45c2

clear unrelevant

8a9e5d2

fix bugs of async loop

de2c376

gjyin marked this pull request as draft October 3, 2025 08:50

gjyin marked this pull request as ready for review October 3, 2025 08:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add user simulation#13

Add user simulation#13
JiwenJ wants to merge 9 commits intoSimple-Efficient:mainfrom
JiwenJ:add_user_simulation

JiwenJ commented Jun 3, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

JiwenJ commented Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

User Simulation for Multi-Turn RL

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

JiwenJ commented Jun 3, 2025 •

edited

Loading