Inconsistent order of execution and StepCount

**Describe the bug**

Order of execution between FixedUpdate and OnEpisodeBegin is different, depending on how episode ended/started.
In the first episode after running the game, FixedUpdate with StepCount == 0 is called before OnEpisodeBegin, causing incorrect reward and possible errors due to incomplete initialization.
This would be less of an issue if this was consistent with other episodes, but in an episode after MaxStep was reached, it is different and FixedUpdate called after OnEpisodeBegin.
I have not tested what happens with EndEpisode, but this might also be different.

**To Reproduce**

1. Open CrawlerAgent script
2. Add changes to the script (described below)
3. Set MaxStep to small number, for example `5`
4. Disable all copies of Agent except one
5. Enable "Pause" and then click "Play"
6. Click "Step" button a couple of times, until second episode starts
7. See in logs: the order is not consistent, and OnEpisodeBegin already has reward from FixedUpdate

**Changes to the Crawler environment**

```
    void FixedUpdate()
    {
        Debug.Log($"FixedUpdate: step={StepCount}");
        AddReward(1);
```

```
    public override void OnEpisodeBegin()
    {
        Debug.Log($"OnEpisodeBegin: step={StepCount}, reward={GetCumulativeReward()}");
```

**Console logs / stack traces / screenshots**

I waited a couple of seconds between each "step" click, so that you can see which operations were in one frame.
In first case OnEpisodeBegin called after StepCount=0 (not before!)
In the second case immediately after StepCount=4 (not before StepCount=0)
![Image](https://github.com/user-attachments/assets/d11edafb-fbc9-4bc5-b0a5-96094c082fbf)

**Environment (please complete the following information):**
- Unity Version: Unity 6000.0.26f1
- OS + version: Windows 11
- _ML-Agents version_: release_22 / 3.0.0
- _Torch version_: 2.2.2+cu121
- _Environment_: Crawler


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Inconsistent order of execution and StepCount #6190

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Inconsistent order of execution and StepCount #6190

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions