cogpilotdatachallenge

Introduction

This repository contains the submission notebook and helperfunctions file for the cogpilot data challenge hosted by the US Department of the Air Force - MIT Artificial Intelligence (AI) Accelerator. Our team placed 4th in this international challenge.

There are 2 tasks for this challenge, and our submission focuses on Task 1. Task 1 requires us to classify the difficulty level (1-4) of a flight simulation run using only physiological metrics. This task will be evaluated using F1 score and Area Under the ROC Curve (AUC) to assess classification accuracy between predicted and actual difficulty level.

Setup

Do create a new virtual environment using the supplied requirements.txt to ensure that you have the required packages.

Once the virtual environment is set up, the folder structure is as such:

├── Main
│   ├── Code
│   │   ├── Submission.ipynb
│   │   ├── helperfunction.py
│   ├── dataPackage
│   │   ├── task-ils
│   │   ├── task-rest
│   │   ├── EvalSet_StartEndTimes.csv
│   ├── dataPackageEval
│   │   ├── EvalSet_StartEndTimes.csv

Method

Packages

We primarily use the sktime package for time-series predictions via machine learning. The documentation can be found here.

We also utilised pycaret to train the final ensemble model as it provided a fast and convenient way to test multiple regression classifiers rapidly on non time-series data. Documentation can be found here

Steps

Generating ensembled probabilities

Utilising the time series classification APIs available in sktime, we methodically trained the APIs on each physiological signal and tested the models against 20% holdout data across 5 folds. (Certain datasets were too large to feasibly train on local machines. For these we only conducted 1 fold testing as each fold require up to 10 hours to run)

Through this process, we optimised the hyperparameters and selected the best classifier for that particular physiological signal.

Following this, the classifier with optimised parameters were used to generate the prediction probabilities for each physiological signal. The predictions of the five holdout sets of data was combined to one table. Here is an example generated for the ECG signal:

This was repeated for all the signals. The probabilities were joined on the subject, difficulty and run columns to form a large ensembled table.

Training models to generate probabilities from evaluation dataset

With the optimised classifiers and hyperparameters found in the previous section, we trained the classifiers on the entire physiological signal training set for each signal. These models were then used on the evaluation datasets to generate the probabilities like the above picture and once again ensembled across all physiological signals.

With these probabilities we now have a training set of ensembled probabilities and an evaluation set of ensembled probabilities with all physiological signals included.

Predicting landing difficulty

With the ensembled training table, we utilised pycaret to determine the importance of features. It was found that pupil diameter had the largest predictive power when validated against holdouts of the training data, with features such as gaze direction, respiration, and ECG being of importance too:

We hypothesize that this is because pupil diameter is imoprtant as the poorer visibility of higher difficulty landings correspond well to dilated pupil diameters as the pilot's eyes will naturally dilate when visibility is poor to gather more light. Similarly, we hypothesize that gaze direction is important as the pilots will look at either the instruments or out of the window more depending on visibility conditions as landing difficulty increases. ECG and respiration correspond to the pilot's stress levels and it is logical that changing the difficulty of landing will result in changes to these signals.

Utilising the 4 most important signals (pupil diameter, gaze direction, respiration, and ECG) for the training set, we obtained the below accuracy, F1 and AUC scores (do note that due to random seeds results can change from run to run):

Results

The fully trained model on the 4 important signals was applied to the evaluation data set and we are now pending the results of the competition.

Team Members (alphabetical order)

Jie Yong Er - LinkedIn
Kah Ming Tan - LinkedIn | GitHub
Skyler Tan - LinkedIn | GitHub
Xue Yang

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
Trained Models		Trained Models
readme_pics		readme_pics
README.md		README.md
Submission.ipynb		Submission.ipynb
helperfunctions.py		helperfunctions.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

cogpilotdatachallenge

Introduction

Setup

Method

Packages

Steps

Generating ensembled probabilities

Training models to generate probabilities from evaluation dataset

Predicting landing difficulty

Results

Team Members (alphabetical order)

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

cogpilotdatachallenge

Introduction

Setup

Method

Packages

Steps

Generating ensembled probabilities

Training models to generate probabilities from evaluation dataset

Predicting landing difficulty

Results

Team Members (alphabetical order)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages