Skip to content

hoodong/RL-Sutton

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 

Repository files navigation

RL-Coursera

서튼 교수 강화학습 교재 공부

  • Sutton & Barto, Reinforcement learning: an introduction, 2e
  • 질문(?)과 코멘트(!) 작성
  • 시뮬레이션 코드 리팩토링 (CleanRL 처럼)

Contents

  1. introduction

part I. tabular solution methods

  1. multi-armed bandits
  2. finite Markov decision processes
  3. dynamic programming
  4. Monte Carlo methods
  5. temporal-difference learning
  6. n-step bootstrapping
  7. planning and learning with tabular methods

part II. approximate solution methods

  1. on-policy prediction with approximation
  2. on-policy control with approximation
  3. off-policy methods with approximation
  4. eligibility traces
  5. policy gradient methods

part III. looking deeper

  1. psychology
  2. neuroscience
  3. applications and case studies 17.frontiers

About

Coursera RL lectures

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published