behavior of on_episode_end() in the ReplayBuffer class when I am only adding a select number of time-steps per interaction. #29
WilderLavington
started this conversation in
General
Replies: 1 comment 1 reply
-
@WilderLavington The answer is perhaps "No", but I recommend that you call it. Generally speaking, we developers assume users call In detail, |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello,
I am currently using the Replay buffer class for storing image data and wanted to know what the effects of calling on_episode_end() for separate buffers of examples. In particular, I am storing examples with reward = 1 in one buffer and reward equal to zero in another. In this environment, reward is equal to 1 only at the end of the episode, and thus I only add one "negative" transition tuple to my replay buffer per episode. In this case, do I still need to call on_episode_end()?
Beta Was this translation helpful? Give feedback.
All reactions