Skip to content

Conversation

GrosSacASac
Copy link
Owner

No description provided.

much better with dealing with negative rewards,
much worse reduceStateAndActionSeeAllDistance)(learn)
better with learnWithAverage
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant