Update Gymnasium to v1.0.0 #1837

pseudo-rnd-thoughts · 2024-02-13T17:12:50Z

This PR updates SB3 to Gymnasium v1.0, read the release-notes to see all the changes.

closes #2023

SB3 contrib PR: Stable-Baselines-Team/stable-baselines3-contrib#261
RL Zoo PR: DLR-RM/rl-baselines3-zoo#475

Motivation and Context

Gymnasium is the core API used in SB3, therefore would be helpful for both SB3 to use the latest version and that SB3 provides a great testing ground to check for that the Gymnasium release works as intended.

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)

Checklist

Note: You can run most of the checks using make commit-checks.

Note: we are using a maximum length of 127 characters per line

pseudo-rnd-thoughts · 2024-02-13T18:36:40Z

There are only two main issues to resolve + need to rewrite VecVideoRecorder

As we have removed Env.__getattr__, then we need to update to use Env.get_wrapper_attr
Registration of external environment is not automatical, i.e., atari, therefore a hack was added to fix it

pseudo-rnd-thoughts · 2024-02-13T19:10:20Z

The CI seems to have failed due to reasons unrelated to the version change

araffin · 2024-02-14T09:42:52Z

Thanks for the PR =)

As we have removed Env.getattr, then we need to update to use Env.get_wrapper_attr

if possible (and if not too hacky), I would add backward compat changes to handle both gymnasium 0.29 and 1.x.

pseudo-rnd-thoughts · 2024-02-14T10:28:23Z

Thanks for the PR =)

No worries, all the errors seem expected and no unexpected bugs are found

As we have removed Env.getattr, then we need to update to use Env.get_wrapper_attr

if possible (and if not too hacky), I would add backward compat changes to handle both gymnasium 0.29 and 1.x.

I believe the changes made should be backward compatible. Just updating VecRecordEnv needs to be fully updated / rewritten

Kallinteris-Andreas · 2024-02-16T06:00:04Z

Assuming the Environment which used gymnasium==0.29 is not broken from the gymansium==1.0 update, it should just work without any additional compatibility work in SB3

araffin · 2024-02-16T07:33:13Z

Assuming the Environment which used gymnasium==0.29 is not broken from the gymansium==1.0 update, it should just work without any additional compatibility work in SB3

I'm talking about allowing people to use 0.29 with SB3.

I believe the changes made should be backward compatible.

mmh, I would double check the getattr() part, I remember it was warning the user

pseudo-rnd-thoughts · 2024-02-16T08:50:52Z

mmh, I would double check the getattr() part, I remember it was warning the user

If the code works with 1.0.0a1 then it will work with 0.29 but possibly not the other way around

pseudo-rnd-thoughts · 2024-02-19T14:04:31Z

@araffin I believe I have fixed all the issues except for updating VecVideoRecorder.
I don't know how SB3 works internals, would you be able to get one of your devs to update that?

araffin · 2024-02-19T14:26:20Z

I don't know how SB3 works internals, would you be able to get one of your devs to update that?

Currently, there is only one active dev (me...), Quentin (@qgallouedec ) is helping me with answering questions and doing code reviews, for the rest, we have to rely on the community.

In the meantime, you could try running tests in SB3 contrib and RL Zoo with this branch, that should unveil other bugs/issues.

Kallinteris-Andreas · 2024-04-01T20:12:09Z

i tested this: (from https://stable-baselines3.readthedocs.io/en/master/guide/examples.html#record-a-video)

import gymnasium as gym
from stable_baselines3.common.vec_env import VecVideoRecorder, DummyVecEnv

env_id = "CartPole-v1"
video_folder = "logs/videos/"
video_length = 100

vec_env = DummyVecEnv([lambda: gym.make(env_id, render_mode="rgb_array")])

obs = vec_env.reset()

# Record the video starting at the first step
vec_env = VecVideoRecorder(vec_env, video_folder,
                       record_video_trigger=lambda x: x == 0, video_length=video_length,
                       name_prefix=f"random-agent-{env_id}")

vec_env.reset()
for _ in range(video_length + 1):
  action = [vec_env.action_space.sample()]
  obs, _, _, _ = vec_env.step(action)
# Save the video
vec_env.close()

and I get this error

python test.py
Traceback (most recent call last):
  File "/home/intelligence-lab-pc4/Documents/kalli/test_mjc3/test.py", line 17, in <module>
    vec_env.reset()
  File "/home/intelligence-lab-pc4/Documents/kalli/test_mjc3/stable-baselines3/stable_baselines3/common/vec_env/vec_video_recorder.py", line 66, in reset
    self.start_video_recorder()
  File "/home/intelligence-lab-pc4/Documents/kalli/test_mjc3/stable-baselines3/stable_baselines3/common/vec_env/vec_video_recorder.py", line 78, in start_video_recorder
    self.video_recorder.capture_frame()
    ^^^^^^^^^^^^^^^^^^^
  File "/home/intelligence-lab-pc4/Documents/kalli/test_mjc3/stable-baselines3/stable_baselines3/common/vec_env/base_vec_env.py", line 420, in __getattr__
    return self.getattr_recursive(name)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/intelligence-lab-pc4/Documents/kalli/test_mjc3/stable-baselines3/stable_baselines3/common/vec_env/base_vec_env.py", line 445, in getattr_recursive
    attr = getattr(self.venv, name)
           ^^^^^^^^^^^^^^^^^^^^^^^^
AttributeError: 'DummyVecEnv' object has no attribute 'video_recorder'

i am using gymnasium==1.0.0a1, sb3 this PR's latest commit (in march)

pseudo-rnd-thoughts · 2024-04-03T09:17:16Z

Just updating VecRecordEnv needs to be fully updated / rewritten

@Kallinteris-Andreas Yes, this is the only part of the PR that still needs to be done.
I'm tempted to just copy and paste the old video recorder in as a solution. Might try to do this evening

qgallouedec · 2024-04-03T13:05:31Z

Feel free to ping me if necessary

…vide recorder wrapper

tests/test_logger.py

pseudo-rnd-thoughts · 2024-10-09T12:20:59Z

If I understand correctly, SB3 has its only vector environments, VecEnv so don't use the gymnasium vector environment so is unaffected by this change

araffin · 2024-10-09T17:48:26Z

is there anything blocking a new SB3 release

There are different things blocking:

time and priority from my side
testing with SB3 contrib and RL Zoo
backward compat and tests with 0.28.1
if possible, remove the copy-pasted VideoRecorder, or at least bring it to SB3 standards in term of documentation/typing/tests

Kallinteris-Andreas · 2024-10-12T05:05:27Z

I will (try to test) backcompat for gymnasium==0.29.1
Is there a reason you need the VideoRecorder to be inside SB3, can't you just use the one directly from gymnasium
unfortunately my time machine no longer works, so I can not give you more time

araffin · 2024-10-12T15:42:46Z

Is there a reason you need the VideoRecorder to be inside SB3, can't you just use the one directly from gymnasium

we don't: #1837 (comment)

pseudo-rnd-thoughts · 2024-10-12T15:51:49Z

Is there a reason you need the VideoRecorder to be inside SB3, can't you just use the one directly from gymnasium

SB3 doesn't use Gymnasium's VectorEnv and has their own VecEnv and so have their own wrappers like VecVideoRecorder. This is largely a wrapper about the Gymnasium old monitor code however as that was removed in v1.0, then either SB3 needs the original code, a stripped down version or a complete rewrite of the wrapper.

araffin · 2024-11-02T10:14:54Z

I've opened two other PR:

and integrated the video recorder inside the VecVideoRecorder.

unit tests are passing except for trained agents in the RL Zoo (new version of envs).

What is missing for this PR is to update the changelog.

Kallinteris-Andreas · 2024-11-03T17:45:58Z

Are the unit test failing for the same version of the environments?

araffin · 2024-11-04T07:22:35Z

Are the unit test failing for the same version of the environments?

it's just that the env LunarLander-v2 doesn't exist anymore.

qgallouedec

Looks good :)

araffin · 2024-11-04T13:05:43Z

I've commented out the lunar lander envs in DLR-RM/rl-baselines3-zoo#475 (i need to update the trained agents), but there is still one failure: parking-v0 obs space from highway-env was updated (float64 instead of float32) but the version number is the same.

araffin · 2024-11-05T13:49:35Z

@pseudo-rnd-thoughts there is another weird bug that just popped with gymnasium v1: https://github.com/DLR-RM/rl-baselines3-zoo/actions/runs/11664748980/job/32475927244?pr=475

The saved inf is no longer the same?

pseudo-rnd-thoughts · 2024-11-05T14:27:06Z

@araffin Thanks for reporting that. I'm reading the logs, and I'm a bit confused by what is happening.
You are loading a CartPole-v1 model and finding that the observation space is different?
It seems to be related to Farama-Foundation/Gymnasium#1092
There was a time when we avoided inf bounds due to the environment checker, but I think this reverted as we realised that the max int value causes more issues than inf.

Apologies for not noting this in the release notes. I'm not sure what we do about it now, any thoughts?

araffin · 2024-11-05T16:02:05Z

You are loading a CartPole-v1 model and finding that the observation space is different?

yes, from

stable-baselines3/stable_baselines3/common/base_class.py

Lines 715 to 716 in 8f0b488

    
           # Check if given env is valid 
        
           check_for_correct_spaces(env, data["observation_space"], data["action_space"])

some good sanity checks.

I see, then I need to update the trained agent (save and load it while overriding the obs space).

araffin · 2024-11-05T16:20:08Z

Should fixed in DLR-RM/rl-trained-agents@eb1bd43 but in the meantime I found another issue.
Is there a way to resize the render window when using ale-py? (the render on my machine is way too big and the window cannot be adjusted, a quick fix is to use SubprocVecEnv and opencv display but that's not so nice...)

pseudo-rnd-thoughts · 2024-11-05T16:24:12Z

Glad the other issue is solved.

For ale-py, I'm surprised this is a new problem. Another user has recently asked for changes to the render size. I will investigate adding an argument for specifying the render window size.

pseudo-rnd-thoughts added 5 commits February 13, 2024 17:09

Update Gymnasium to v1.0.0a1

08e5f9a

Comment out gymnasium.wrappers.monitor (todo update to VideoRecord)

f73c08e

Fix ruff warnings

08d3ac9

Register Atari envs

eb55500

Update getattr to Env.get_wrapper_attr

686d1a0

pseudo-rnd-thoughts added 3 commits February 13, 2024 18:39

Reorder imports

da48aed

Fix seed order

b063f94

Fix collecting max_steps

6e11f93

pseudo-rnd-thoughts mentioned this pull request Feb 19, 2024

Update gymnasium dependencies to 1.0.0a1 Farama-Foundation/PettingZoo#1184

Open

Merge branch 'master' into gymnasium-1.0.0a1

7958dba

pseudo-rnd-thoughts mentioned this pull request Feb 28, 2024

Projects updated to v1.0.0 Farama-Foundation/Gymnasium#944

Open

20 tasks

Merge branch 'master' into gymnasium-1.0.0a1

d7ed302

Kallinteris-Andreas mentioned this pull request Mar 25, 2024

[Question] Problems with Collisions in Pusher-v4 Farama-Foundation/Gymnasium#950

Closed

pseudo-rnd-thoughts added 4 commits April 3, 2024 22:26

Copy and paste video recorder to prevent the need to rewrite the vec …

39f0900

…vide recorder wrapper

Use typing.List rather than list

2f403da

Merge branch 'master' into gymnasium-1.0.0a1

1f8c554

Fix env attribute forwarding

c32e198

araffin reviewed Apr 4, 2024

View reviewed changes

tests/test_logger.py Outdated Show resolved Hide resolved

araffin mentioned this pull request Oct 10, 2024

[Feature Request] request title jax API and gymnax #2020

Closed

2 tasks

araffin mentioned this pull request Oct 19, 2024

[Feature Request] When are you planning to upgrade to Gymnasium v1.0.0 #2023

Closed

2 tasks

araffin added the help wanted Help from contributors is welcomed label Oct 29, 2024

araffin added 3 commits November 2, 2024 08:33

Merge branch 'master' into gymnasium-1.0.0a1

3bf93fb

Fix github CI yaml

3b48d27

Run gym 0.29.1 on python 3.10

0f97c3b

araffin mentioned this pull request Nov 2, 2024

Add support for gymnasium v1.0 Stable-Baselines-Team/stable-baselines3-contrib#261

Merged

15 tasks

araffin added 2 commits November 2, 2024 10:08

Update lower bounds

1b10cef

Integrate video recorder

45cd5f8

araffin mentioned this pull request Nov 2, 2024

Add support for gymnasium v1.0 DLR-RM/rl-baselines3-zoo#475

Merged

13 tasks

araffin added 2 commits November 3, 2024 18:31

Remove ordered dict

cba9a2c

Update changelog

df5fdaa

araffin requested a review from qgallouedec November 3, 2024 18:00

qgallouedec approved these changes Nov 4, 2024

View reviewed changes

araffin approved these changes Nov 4, 2024

View reviewed changes

araffin merged commit 8f0b488 into DLR-RM:master Nov 4, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update Gymnasium to v1.0.0 #1837

Update Gymnasium to v1.0.0 #1837

pseudo-rnd-thoughts commented Feb 13, 2024 •

edited by araffin

Loading

pseudo-rnd-thoughts commented Feb 13, 2024 •

edited

Loading

pseudo-rnd-thoughts commented Feb 13, 2024

araffin commented Feb 14, 2024

pseudo-rnd-thoughts commented Feb 14, 2024

Kallinteris-Andreas commented Feb 16, 2024

araffin commented Feb 16, 2024 •

edited

Loading

pseudo-rnd-thoughts commented Feb 16, 2024

pseudo-rnd-thoughts commented Feb 19, 2024

araffin commented Feb 19, 2024

Kallinteris-Andreas commented Apr 1, 2024

pseudo-rnd-thoughts commented Apr 3, 2024

qgallouedec commented Apr 3, 2024

pseudo-rnd-thoughts commented Oct 9, 2024

araffin commented Oct 9, 2024

Kallinteris-Andreas commented Oct 12, 2024

araffin commented Oct 12, 2024

pseudo-rnd-thoughts commented Oct 12, 2024

araffin commented Nov 2, 2024

Kallinteris-Andreas commented Nov 3, 2024

araffin commented Nov 4, 2024

qgallouedec left a comment

araffin commented Nov 4, 2024

araffin commented Nov 5, 2024

pseudo-rnd-thoughts commented Nov 5, 2024 •

edited

Loading

araffin commented Nov 5, 2024

araffin commented Nov 5, 2024 •

edited

Loading

pseudo-rnd-thoughts commented Nov 5, 2024

Update Gymnasium to v1.0.0 #1837

Update Gymnasium to v1.0.0 #1837

Conversation

pseudo-rnd-thoughts commented Feb 13, 2024 • edited by araffin Loading

Motivation and Context

Types of changes

Checklist

pseudo-rnd-thoughts commented Feb 13, 2024 • edited Loading

pseudo-rnd-thoughts commented Feb 13, 2024

araffin commented Feb 14, 2024

pseudo-rnd-thoughts commented Feb 14, 2024

Kallinteris-Andreas commented Feb 16, 2024

araffin commented Feb 16, 2024 • edited Loading

pseudo-rnd-thoughts commented Feb 16, 2024

pseudo-rnd-thoughts commented Feb 19, 2024

araffin commented Feb 19, 2024

Kallinteris-Andreas commented Apr 1, 2024

pseudo-rnd-thoughts commented Apr 3, 2024

qgallouedec commented Apr 3, 2024

pseudo-rnd-thoughts commented Oct 9, 2024

araffin commented Oct 9, 2024

Kallinteris-Andreas commented Oct 12, 2024

araffin commented Oct 12, 2024

pseudo-rnd-thoughts commented Oct 12, 2024

araffin commented Nov 2, 2024

Kallinteris-Andreas commented Nov 3, 2024

araffin commented Nov 4, 2024

qgallouedec left a comment

Choose a reason for hiding this comment

araffin commented Nov 4, 2024

araffin commented Nov 5, 2024

pseudo-rnd-thoughts commented Nov 5, 2024 • edited Loading

araffin commented Nov 5, 2024

araffin commented Nov 5, 2024 • edited Loading

pseudo-rnd-thoughts commented Nov 5, 2024

pseudo-rnd-thoughts commented Feb 13, 2024 •

edited by araffin

Loading

pseudo-rnd-thoughts commented Feb 13, 2024 •

edited

Loading

araffin commented Feb 16, 2024 •

edited

Loading

pseudo-rnd-thoughts commented Nov 5, 2024 •

edited

Loading

araffin commented Nov 5, 2024 •

edited

Loading