Double Duelling Deep Q Network

Requirements

Quick start

pip install -r requirements.txt
python env.py

Python 3.10+
Numpy
Pytorch
Gymnasium

Lunar Lander Environment Docs

Deep Q Learning

What is Q learning?
- A Model-Free Reinforcement Learning algorithm to learn the Quality value of taking an Action in a particular State.
  
  Learn more
- Following the Bellman update equation, we can train an agent to take high quality actions that lead to states that maximize return in reward
- We construct a Quality table of states , actions, rewards, and iteratively update it with the equation above.
Applying Deep Learning
- Instead of storing a table of state transitions, use neural networks to approximate the Q function.
  
  Why? When dealing with extremely large or continuous state spaces, storing the Quality function in a table is no longer feasible.
- Replay Buffer
  - Represents the agents memory
  - Store transitions on every step (state, action, reward, next_state, terminated)
  - Circular insertion
  - Samples batches of transitions for neural network training
- New Update Equation:
- Psuedo Code
  
  Note: Using Mean Squared Error for loss, and Stochastic Gradient Descent for back propogation
Modifictions
- Double Deep Q Networks
  
  Purpose: Stabilize training
  - Use two Neural Networks
    - Q Network
    - Q_target Network
  - Calculate loss between them, with respect to some reward
  - Copy Q Network weights to Q_target Network every N iterations
  - Architecture Diagram
  - Updated Equation:
- Double Duelling Deep Q Networks
  
  Purpose: Faster convergence
  - Using the same technique above, we change the Neural Network architecture to produce a Value for being in a state and reward estimates for all possible Actions (A.K.A. Advantage), then calculate Quality
  - Architecture diagram
  - Updated Equation:

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
ddpg		ddpg
.gitattributes		.gitattributes
DDQN.py		DDQN.py
DQN.py		DQN.py
DuellingDQN.py		DuellingDQN.py
LunarLander_DRL.gif		LunarLander_DRL.gif
README.md		README.md
ReplayBuffer.py		ReplayBuffer.py
ReplayBuffer_test.py		ReplayBuffer_test.py
dddqn_result.png		dddqn_result.png
ddqn_result.png		ddqn_result.png
dqn_result.png		dqn_result.png
env.py		env.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Double Duelling Deep Q Network

Requirements

Quick start

Deep Q Learning

Results

Deep Q Network

Double Deep Q Network

Double Duelling Deep Q Network

About

Releases

Packages

Languages

Ali-Jasim/DoubleDuellingDeepQNetwork

Folders and files

Latest commit

History

Repository files navigation

Double Duelling Deep Q Network

Requirements

Quick start

Deep Q Learning

Results

Deep Q Network

Double Deep Q Network

Double Duelling Deep Q Network

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages