This project provides a simulation of multi-armed bandit problems.
This implementation is based on the below paper. Simple Modification of the Upper Confidence Bound Algorithm by Generalized Weighted Averages (https://arxiv.org/abs/2308.14350).
Compatible with Python3.6 and above. The required libraries are listed below.
- NumPy
- matplotlib
- scikit-optimize
$ pip install numpy
$ pip install matplotlib
$ python compare_stochastic_mab.py
The results are displayed as below.
For more information, compare_stochastic_mab.py.
$ python compare_survival_mab.py
The results are displayed as below.
For more information, compare_survival_mab.py.
The three experiments described in the paper can be executed with the following commands.
$ python experiment1.py
$ python experiment2.py
$ python experiment3.py
This is free and open-source software licensed under the 3-clause BSD license.