-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature Request] RAINBOW #622
Comments
Any updates about the rainbow implementation? |
Contributions are welcomed ;) (if you do so, please read the contributing guide from SB3-Contrib, it explains how to test new algorithms) EDIT: QR-DQN is available in SB3-Contrib, and double DQN is also available if needed (currently as an exercise) |
Hi there, I hope you're having a great day. I was reading through #487 and it leads me here. I see that the double-DQN is now available as an exercise. I looked through sb3 and sb3-contrib repositories, but I couldn't find any code or reference to it. Would you be able to point me in the direction of the material or pseudo code that explains the implementation of double-DQN in sb3? I'd really appreciate it. Thank you! |
Hi, |
If you read this issue and the linked one, you will see components that have been implemented but not fully tested.
Yes, we need help for #1622 to debug and be able to reproduce the original results (and maybe re-write this implementation). As I wrote in #1622 (comment), the plan is to keep SB3 DQN implementation as-is (vanilla) but have a RAINBOW implementation with all the tricks too on the side. |
Can we still use implementation of DDQN, dueling dqn , priorized replay from stable baselines 2? |
Important Note: We do not do technical support, nor consulting and don't answer personal questions per email.
Please post your question on the RL Discord, Reddit or Stack Overflow in that case.
🚀 Feature
Implement RAINBOW (DQN + all extensions: noisy-net, double dqn, dueling, c51, prioritized experience replay).
closes #487 if it is implemented.
Motivation
This would allow to have DQN with all extensions while keeping DQN code simple.
The text was updated successfully, but these errors were encountered: