RARE Challenge - Computer Vision for Early Cancer Detection

Introduction

The RARE Challenge focuses on developing computer-aided detection (CADe) systems for early cancer detection in low-prevalence scenarios. In clinical practice, early-stage cancers are rare and often overshadowed by normal findings, making model training and evaluation particularly challenging. This repository provides a baseline implementation for training and evaluating deep learning models for the detection of neoplasia in Barrett’s Esophagus (BE) using a class-imbalanced dataset.

Challenge Motivation

Detecting early-stage neoplasia in BE is crucial for timely interventions. Missed detections can lead to late-stage cancer progression, significantly lowering survival rates. The challenge lies in effectively handling the severe class imbalance, ensuring models achieve both high sensitivity and specificity in real-world clinical settings.

Repository Structure

├── configs/                     # YAML configuration files for different model setups
├── dataset.py                   # Dataset class for loading and preprocessing data
├── evaluate.py                  # Script for evaluating models and performing bootstrap analysis
├── metrics.py                   # Custom metrics for model evaluation
├── output/                      # Directory for storing experiment outputs (logs, models, results)
├── README.md                    # Project documentation
├── requirements.txt             # Python dependencies for the project
├── run_baselines.sh             # Shell script to run all baseline experiments
├── train.py                     # Script for training models
└── Dockerfile                   # Dockerfile for containerizing the project

Explanation of Key Files

configs/: Contains YAML files specifying model configurations (e.g., model type, batch size, learning rate).
dataset.py: Implements the RareTestSet class for loading and preprocessing the dataset.
evaluate.py: Handles model evaluation, including generating predictions and performing bootstrap analysis for metrics.
metrics.py: Defines custom metrics such as AUROC, AUPRC, and others for evaluating model performance.
run_baselines.sh: Automates the execution of baseline experiments for all configurations.
train.py: Main script for training models using the configurations provided in the configs/ directory.
Dockerfile: Used to build a Docker image for running the project in a containerized environment.

Running Baseline Experiments with Docker

To build and run the Docker container for executing all baseline experiments on a Windows device, follow these steps:

Build the Docker Image: Run the following command to build the Docker image:
```
docker build -t rare-challenge .
```
Run the Docker Container:
```
docker run --rm --env-file .env -v $(pwd):/app rare-challenge bash run_baselines.sh
```
This command mounts the current directory to the /app directory in the container and runs the run_baselines.sh script. Please use the .env file to set the environment variables for the container (e.g. dataset path).

Questions

For any questions or contributions, feel free to open an issue or contact us via e-mail.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RARE Challenge - Computer Vision for Early Cancer Detection

Introduction

Challenge Motivation

Repository Structure

Explanation of Key Files

Running Baseline Experiments with Docker

Questions

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
configs		configs
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
dataset.py		dataset.py
evaluate.py		evaluate.py
evaluation_Grand-Challenge.py		evaluation_Grand-Challenge.py
metrics.py		metrics.py
requirements.txt		requirements.txt
run_baselines.sh		run_baselines.sh
train.py		train.py
utils.py		utils.py

Folders and files

Latest commit

History

Repository files navigation

RARE Challenge - Computer Vision for Early Cancer Detection

Introduction

Challenge Motivation

Repository Structure

Explanation of Key Files

Running Baseline Experiments with Docker

Questions

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages