GitHub - YoungSnoww/LeagueOfLegendsModelBenchmark

Specification Statement

Title: Benchmarking Models for Predicting League of Legends Game Outcomes

Objective: The objective of this project is to benchmark various machine learning models to predict the outcome of League of Legends (LoL) games using in-game data. The goal is to identify the most accurate and efficient model for predicting game outcomes based on features such as the number of kills, gold earned, and other relevant in-game statistics.

Context: League of Legends is a multiplayer online battle arena (MOBA) game in which two teams, the blue team and the red team, compete against each other. The game features three lanes, a jungle area, and five distinct roles for players. The primary objective is to destroy the opposing team's Nexus to secure victory.

Feasibility: To ensure the project feasibilty a Proof of Concept (PoC) has been realised and is accessible on Github here.

Group Composition: The project will be conducted by a group of 5 people, each responsible for different aspects of the benchmarking process, including data preprocessing, model training, evaluation, and documentation.

The group is made of:

Valentin Woehrel - vw207@kent.ac.uk
Quentin Erdinger - qe3@kent.ac.uk
Simon Bandiera - sb2440@kent.ac.uk
Hugo Galan - hg310@kent.ac.uk
Timothée Lesellier, PM - tl456@kent.ac.uk

Datasets:

This datasets was created by scraping the official RIOT API. The data is composed of 200,000 games with 109 features each. The features include in-game statistics such as kills, deaths, assists, gold earned, and other relevant information.

Url

Models to Benchmark: The following models will be benchmarked to predict the outcome of LoL games:

Logistic Regression: A baseline model to establish a performance benchmark.
Decision Trees: To capture non-linear relationships in the data.
Random Forest: An ensemble method to improve the performance of decision trees.
Gradient Boosting Machines (GBM): Including XGBoost and LightGBM for robust performance.
Support Vector Machines (SVM): With different kernel functions to handle complex data patterns.
Neural Networks: Including Multi-Layer Perceptrons (MLP) and Convolutional Neural Networks (CNN) for capturing intricate data relationships.
k-Nearest Neighbors (k-NN): To evaluate the performance of instance-based learning.
Naive Bayes: To assess the performance of probabilistic classifiers.
AdaBoost: An ensemble method to boost the performance of weak classifiers.
CatBoost: A gradient boosting library that handles categorical features efficiently.

Evaluation Metrics: The performance of each model will be evaluated using the following metrics:

Accuracy: The proportion of correctly predicted game outcomes.
Precision: The proportion of correctly predicted positive outcomes (wins) out of all predicted positive outcomes.
Recall: The proportion of correctly predicted positive outcomes out of all actual positive outcomes.
F1-Score: The harmonic mean of precision and recall.
AUC-ROC: The area under the Receiver Operating Characteristic curve to evaluate the model's ability to distinguish between positive and negative outcomes.
Confusion Matrix: To visualize the performance of the model in terms of true positives, true negatives, false positives, and false negatives.

Methodology:

Data Preprocessing: Clean and preprocess the datasets to handle missing values, outliers, and normalize the features.
Feature Engineering: Create new features that may improve the predictive power of the models.
Model Training: Train each model using the preprocessed datasets.
Hyperparameter Tuning: Optimize the hyperparameters of each model using techniques such as grid search or random search.
Evaluation: Evaluate the performance of each model using the specified evaluation metrics.
Comparison: Compare the performance of all models to identify the most accurate and efficient model for predicting LoL game outcomes.

Deliverables:

Benchmarking Report: A detailed report comparing the performance of all models.
Codebase: A repository containing the code used for data preprocessing, model training, and evaluation.
Presentation: A presentation summarizing the findings and recommendations.

Name		Name	Last commit message	Last commit date
Latest commit History 74 Commits
.doc		.doc
datasets		datasets
deep_learning		deep_learning
machine_learning		machine_learning
.gitignore		.gitignore
BenchmarkResults.md		BenchmarkResults.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Specification Statement

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Specification Statement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages