GitHub

Reinforcement Learning on 2048

The goal of this repository is to use the RLlib library from Ray to solve the game of 2048. The primary focus of this repository is not to dive into the RL theory and application, rather to showcase the tools that come with Ray for monitoring and scaling your training in a multi-container Docker environment.

The 2048 environment used in this repository comes from HERE.

TODOs:

[] Render the game board for inspection.
[] Use Curriculum learning to periodically make the game board larger or the "2048" target higher.
[] Build sample grafana dashboard .
[] Create/mount training config yaml file.

How to use

Assuming you have docker-compose installed:

Stand up the container stack using docker-compose up. If this works, the Ray webservices and Tensorboard should be active.
In another terminal, run docker-compose exec ray-rllib python gym_env/train.py to start the training process. Alternatively, to test rendering run docker-compose exec ray-rllib python gym_env/basic.py to visualize random actions.
When you are done, use ctrl+c to stop the process (both the training and stack). And use docker-compose down to spin down the project containers.

Additionally, visit the web services described below to monitor the training process.

Docker-Compose Services

Upon starting this stack, the following web services will be made availble:

Ray Dashboard: localhost:8265
- Used to monitor Ray clusters, including error logs, hardware utilization, etc.
Ray Monitoring: localhost:8080
- A Prometheus endpoint containing various metrics captured by Ray.
Prometheus: localhost:8080/metrics
- Prometheus endpoint with some default metrics
Grafana: localhost:3000
- A monitoring tool that parses the Prometheus endpoints and visualizes their metrics.
Tensorboard: localhost:6006
- Visualization tool to monitoring training progress and experimentation.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
app		app
grafana		grafana
prometheus		prometheus
ray_results		ray_results
requirements		requirements
scripts		scripts
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml
open_browser.sh		open_browser.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Reinforcement Learning on 2048

TODOs:

How to use

Docker-Compose Services

About

Uh oh!

Languages

kaimibk/RL2048

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning on 2048

TODOs:

How to use

Docker-Compose Services

About

Resources

Uh oh!

Stars

Watchers

Forks

Languages