Pessimistic Agents

Pessimistic Agents are ask-for-help reinforcement learning agents that offer guarantees of:

Eventually outperforming the mentor
Eventually stopping querying the mentor
Never causing unprecedented events to happen, with arbitrary probability

In this repository, we investigate their behaviour in the faithful setting, and explore approximations that allow them to be used in real-world RL problems.

Overview - see individual README.md files for more detail.

Distributional Q Learning - dist_q_learning/

We introduce a tractable implementation of Pessimistic Agents. Approximate the Bayesian world and mentor models as a distribution over epistemic uncertainty of Q values. By using a pessimistic (low) quantily, we demonstrate the expected behaviour and safety results for a pessimistic agent.

Work	Status
Finite state Q Table proof of concept
Continuous deep Q learning implementation

Faithful implementation - cliffworld/

Implement and investigate a faithful representation of a Bayesian Pessimistic Agent.

Work	Status
Environment
Agent

On hold, some progress made in implementing the environment and mentor models.

Pessimistic RL - pessimistic_prior/

Apply pessimism approximation to neural network based, deep Q learning RL agents.

Work	Status
DQN proof of concept

Setup

Supported conda env

With anaconda

conda env create -f torch_env_cpu.yml

Name		Name	Last commit message	Last commit date
Latest commit History 415 Commits
.github/workflows		.github/workflows
cliffworld		cliffworld
dist_q_learning		dist_q_learning
.gitignore		.gitignore
README.md		README.md
latest_env.yml		latest_env.yml
pessimistic_prior.txt		pessimistic_prior.txt
torch_env_cpu.yml		torch_env_cpu.yml
torch_env_cuda.yml		torch_env_cuda.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pessimistic Agents

Distributional Q Learning - dist_q_learning/

Faithful implementation - cliffworld/

Pessimistic RL - pessimistic_prior/

Setup

Supported conda env

About

Uh oh!

Releases

Packages

Contributors 5

Uh oh!

Languages

j-bernardi/pessimistic-agents

Folders and files

Latest commit

History

Repository files navigation

Pessimistic Agents

Distributional Q Learning - dist_q_learning/

Faithful implementation - cliffworld/

Pessimistic RL - pessimistic_prior/

Setup

Supported conda env

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Uh oh!

Languages

Packages