Upgrade to RLLib 1.3 and improvements around amTFT #12

Manuscrit · 2022-01-21T13:26:20Z

No description provided.

Remove use lock_replay during training (must not use it in LTFT). Create submodule marltoolbox.utils.log. Move methods to summarize a model into an helper class. use before_init_loss instead of after_init (policy class factory arg).

…mall fixes.

Fix some tests. Add augmented R2D2. Add examples with R2D2. Add some end to end tests for amTFT vs exploiter, meta game and R2D2.

Fix speed performance issue in entropy computation. Some refactoring of configs and hyperparameters (DQN, R2D2, LOLA-PG).

Tune HP for R2D2. Few corrections. Few style changes.

Partial refactoring of the coin game envs tests. Add logging & plot of exploration temperature.

…than 2. Add rolling average for the LOLA-PG reward centering and normalization.

- punishment helped in CGs - customizable matrix game - coop coins log in vectorized MCPCG

…, replicator dynamic)

Add the "punishment helped" option in vectorized_ssd_mm_coin_game.py. Add new plots by defaults in cross an self play evaluation. Add script to plot bar chart summary figure.

…ns (instead of 2 or 3 for LOLA-Exact and instead of 2 for SOS-Exact)

Renaming module tune_analysis into exp_analysis. SOS: add a few logs Welfare_coordinator: fix ordering in the set of welfare function sets. Env mixing: started adding NPlayersNContinuousActionsInfoMixin (not debugged) Env: started adding the SimpleBargaining env (not debugged) Env: removed the MixedMotiveCoinGame (not the MCPCG/SSDMMCG env) LOLA-PG: allow option to filter runs by speeds achieved LOLA-PG: both players can pick the same coin always Meta games: add the negotiation environment. Meta games: add the uniform solver. Scripts: add script to add create bar charts Scripts: update script to compute mean and std err from saved results. Scripts: add script to plot meta policies & joint meta policies experiment_analysis module: added a few helper functions FrozenTunePolicy: add callback to call policy reset at the start of each episode. PSRO: add PSRO hardcoded PSRO: started to add PSRO from OpenSpiel

Add the OpenSpiel version of the SimpleBargaining env. Add logs to compute the % of coordination in the welfare_coordination.py Add script to generate the scatter plots

Add cross experiments between meta solvers. Add evaluation of exploitability.

Maxime Riché and others added 20 commits April 15, 2021 15:28

Upgrade to use RLLib master (almost v1.3).

9f8a1b2

Remove use lock_replay during training (must not use it in LTFT). Create submodule marltoolbox.utils.log. Move methods to summarize a model into an helper class. use before_init_loss instead of after_init (policy class factory arg).

Use _compute_actions_helper instead of compute_actions in policies. S…

4e0bf57

…mall fixes.

Fixing some tests, _comute_action_helper, update_target

9888b75

Fix amTFT exploiter.

5599bf6

Fix some tests. Add augmented R2D2. Add examples with R2D2. Add some end to end tests for amTFT vs exploiter, meta game and R2D2.

Support LSTM in amTFT.

00927ab

Fix speed performance issue in entropy computation. Some refactoring of configs and hyperparameters (DQN, R2D2, LOLA-PG).

Add meta game exp with LOLA-Exact.

07c35cc

Tune HP for R2D2. Few corrections. Few style changes.

Add several options to stabilize the LOLA-PG training.

c43019c

Fix bug in cross_play evaluator.

c1051f0

Partial refactoring of the coin game envs tests. Add logging & plot of exploration temperature.

Add SSDMMCG vectorized.

67c8f79

Fix amTFT: how the environment is used during the rollouts

6f8f7e4

Change LOLA-Exact to make it work with discrete actions space larger …

9dd9738

…than 2. Add rolling average for the LOLA-PG reward centering and normalization.

Add stuff to the environments:

028be50

- punishment helped in CGs - customizable matrix game - coop coins log in vectorized MCPCG

Add meta game experiments with various meta solvers (like: alpha-rank…

46ff4b0

…, replicator dynamic)

Add more meta algorithms in meta game experiment.

c65be87

Add the "punishment helped" option in vectorized_ssd_mm_coin_game.py. Add new plots by defaults in cross an self play evaluation. Add script to plot bar chart summary figure.

Adapt LOLA-Exact and SOS-Exact to work with matrix games with N actio…

92ae8b4

…ns (instead of 2 or 3 for LOLA-Exact and instead of 2 for SOS-Exact)

Improve PSRO hardcoded (less warnings).

02ae601

Add the OpenSpiel version of the SimpleBargaining env. Add logs to compute the % of coordination in the welfare_coordination.py Add script to generate the scatter plots

Add options in plot bar and plot scatter.

4586a79

Add cross experiments between meta solvers. Add evaluation of exploitability.

Add PG for amTFT. Fix training of amTFT.

e618724

Merge branch 'master' into anonymization

2517a68

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Upgrade to RLLib 1.3 and improvements around amTFT #12

Upgrade to RLLib 1.3 and improvements around amTFT #12

Uh oh!

Manuscrit commented Jan 21, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Upgrade to RLLib 1.3 and improvements around amTFT #12

Are you sure you want to change the base?

Upgrade to RLLib 1.3 and improvements around amTFT #12

Uh oh!

Conversation

Manuscrit commented Jan 21, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants