We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
In _sample_action of OffPolicyAlgorithm class, self.predict function is called. But episode_start flag is never set for any off policy algorithms.
_sample_action
OffPolicyAlgorithm
self.predict
episode_start
No response
The text was updated successfully, but these errors were encountered:
Hello, that's correct because there is current only RecurrentPPO that make use of states (LSTM states) and episode starts (to reset the states).
RecurrentPPO
states
Sorry, something went wrong.
No branches or pull requests
🐛 Bug
In
_sample_action
ofOffPolicyAlgorithm
class,self.predict
function is called. Butepisode_start
flag is never set for any off policy algorithms.To Reproduce
No response
Relevant log output / Error message
No response
System Info
No response
Checklist
The text was updated successfully, but these errors were encountered: