TD-Lambda-Online-Learning

In this project we are going to implement td lambda learning. There are two approaches to implement the td lambda learning: 1. Forward or the theoretical view (Offline learning) 2.Backward or practical view (Online Learning) Here we used simple random walk envoironment to see the performance of td lambda with different lambda numbers and different alphas. Since the actual value of the states are available for us, we used RMS to calculate the error rate. In the sutton's book it is mentioned that the backward view is equivalent to the forward view. So this implementation is enough in most of the cases.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
Untitled.ipynb		Untitled.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TD-Lambda-Online-Learning

About

Releases

Packages

Languages

mahyarsadeghi/TD-Lambda-Online-Learning

Folders and files

Latest commit

History

Repository files navigation

TD-Lambda-Online-Learning

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages