Skip to content

Latest commit

 

History

History
28 lines (26 loc) · 818 Bytes

README.md

File metadata and controls

28 lines (26 loc) · 818 Bytes

RL-Coursera

서튼 교수 강화학습 교재 공부

  • Sutton & Barto, Reinforcement learning: an introduction, 2e
  • 질문(?)과 코멘트(!) 작성
  • 시뮬레이션 코드 리팩토링 (CleanRL 처럼)

Contents

  1. introduction

part I. tabular solution methods

  1. multi-armed bandits
  2. finite Markov decision processes
  3. dynamic programming
  4. Monte Carlo methods
  5. temporal-difference learning
  6. n-step bootstrapping
  7. planning and learning with tabular methods

part II. approximate solution methods

  1. on-policy prediction with approximation
  2. on-policy control with approximation
  3. off-policy methods with approximation
  4. eligibility traces
  5. policy gradient methods

part III. looking deeper

  1. psychology
  2. neuroscience
  3. applications and case studies 17.frontiers