Skip to content

Latest commit

 

History

History
8 lines (7 loc) · 228 Bytes

File metadata and controls

8 lines (7 loc) · 228 Bytes

Chapter 11 — Policy Gradient Fundamentals (REINFORCE)

Quickstart:

pip install -r ch11_policy_gradient/requirements.txt
pytest -q ch11_policy_gradient/tests
python -m ch11_policy_gradient.scripts.run_bandit_demo