MMDetection-DS

A foundational codebase for dual-spectrum image object detection, based on the MMDetection3.1.0 framework code.

Related

Main Features

Customized aligned dual-spectrum dataset configuration files
Customized dual-spectrum dataset preprocessing configuration
Customized dual-stream network structure

Background

Recent dual-spectrum object detection works claim to have achieved high accuracy on multiple dual-spectrum datasets, but none have open-sourced their code, making their methods difficult to reproduce and learn from.

Additionally, current object detection works heavily utilize various optimization tools, making it challenging to compete with SOTA methods that incorporate numerous optimization tools. Frameworks like Detection2 and MMDetection are highly extensible and user-friendly, allowing easy use of various optimization tools, but their extensive abstraction and encapsulation make them challenging for beginners to learn.

Therefore, we hope to open-source a dual-spectrum object detection codebase to facilitate better learning and research on dual-spectrum object detection tasks.

1. Environment Setup

Please follow the steps below to configure the runtime environment:

1. First, clone this codebase

git clone --depth 1 <this repo url> mmdetection-ds
cd mmdetection-ds

2. Create an environment and install dependencies

Create and activate a new conda environment

conda create -n openmmlab python=3.8 -y
conda activate openmmlab

Install PyTorch, the default installation is the CPU version. For the GPU version, please refer to 1. CUDA-Toolkit version 11.8 or above, 2. Other lower versions

conda install pytorch torchvision -c pytorch  
# It is recommended not to use this command directly, but to choose the appropriate version based on the links above

pip install -U openmim
mim install "mmcv>=2.0.0"
pip install -v -e .

2. Dataset Preparation

Aligned dual-spectrum (RGB+IR) datasets

LLVIP Dataset

LLVIP Dataset: https://bupt-ai-cz.github.io/LLVIP/

Note: If you do not want to use the following script to convert annotation files or find it inconvenient to use the LLVIP provided dataset download method, you can directly use our backup (including converted annotations), download link: https://huggingface.co/datasets/UserNae3/LLVIP

Dataset Download and Annotation Conversion:

The annotation files provided by LLVIP are in VOC format. Although they provide a format conversion script, all annotations are in one folder without separation.

We provide a script that uses their conversion script to automatically generate corresponding annotation files based on the training and test set splits of the dataset.

First, download the LLVIP dataset zip file LLVIP.zip and extract it to a folder, then run the following script

# We use the typer library, so you need to install it first
pip install typer

cd dataset/ # Enter the dataset/ folder of this project
python dct.py llvip --data_dir path_to_extracted_LLVIP_dataset

It will generate COCO format annotation files for the training and test sets and save them in the coco_annotations folder.

FLIR Dataset

FLIR Dataset: https://www.flir.com/oem/adas/adas-dataset-form/

The FLIR dataset has multiple versions. The aligned dual-spectrum dataset generally uses the version from Zhang et al. Paper, Dataset, hereafter referred to as the FLIR dataset.

Note: If you do not want to use the following script to convert or find it inconvenient to use the dataset download method provided by Zhang et al., you can directly use our backup (including converted annotations), download link: https://huggingface.co/datasets/UserNae3/FLIR_aligned

Dataset Conversion:

Similar to the LLVIP dataset, the FLIR dataset also provides VOC format annotation files. We provide a script that calls their conversion script to automatically generate corresponding annotation files based on the training and test set splits of the dataset. First, download the FLIR dataset zip file FLIR.zip and extract it to a folder, then run the following script

cd dataset/ # Enter the dataset/ folder of this project
python dct.py flir --data_dir path_to_extracted_FLIR_dataset

Additionally, in some past papers, the dog category is generally removed from the FLIR dataset due to its small quantity. We also provide a script to remove specific categories from COCO annotations. This is also included in the backup we provided, which includes annotations with the dog category removed.

cd dataset/ # Enter the dataset/ folder of this project
python python dct.py coco-rc --help # View usage help

3. Related Configuration Files and Models

Training Configuration Files: aligned_dual_spectrum_od

Dual-Spectrum Dataset Configuration Files: aligned_dual_spectrum_dataset.py

Dual-Spectrum Dataset Preprocessing Configuration Files: aligned_dual_spectrum_preprocess.py

Example Model: ts_backbone.py

4. More

For more resources, refer to

Multispectral-Pedestrian-Detection-Resource: https://github.com/CalayZhou/Multispectral-Pedestrian-Detection-Resource

5. Copyright

The copyright of the models, datasets, and optimization tools involved in this code belongs to the original authors. Please refer to the relevant literature to understand the related copyright information.

Name		Name	Last commit message	Last commit date
Latest commit History 2,620 Commits
.circleci		.circleci
.dev_scripts		.dev_scripts
.github		.github
configs		configs
dataset		dataset
demo		demo
docker		docker
docs		docs
mmdet		mmdet
projects		projects
requirements		requirements
resources		resources
tests		tests
tools		tools
.gitignore		.gitignore
.owners.yml		.owners.yml
.pre-commit-config-zh-cn.yaml		.pre-commit-config-zh-cn.yaml
.pre-commit-config.yaml		.pre-commit-config.yaml
.readthedocs.yml		.readthedocs.yml
CITATION.cff		CITATION.cff
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
MM-README-en.md		MM-README-en.md
MM-README_zh-CN.md		MM-README_zh-CN.md
README-zh.md		README-zh.md
README.md		README.md
dataset-index.yml		dataset-index.yml
model-index.yml		model-index.yml
pytest.ini		pytest.ini
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MMDetection-DS

1. Environment Setup

1. First, clone this codebase

2. Create an environment and install dependencies

2. Dataset Preparation

LLVIP Dataset

FLIR Dataset

3. Related Configuration Files and Models

4. More

5. Copyright

About

Releases

Packages

Contributors 405

Languages

License

echoniuniu/DSOD

Folders and files

Latest commit

History

Repository files navigation

MMDetection-DS

1. Environment Setup

1. First, clone this codebase

2. Create an environment and install dependencies

2. Dataset Preparation

LLVIP Dataset

FLIR Dataset

3. Related Configuration Files and Models

4. More

5. Copyright

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Contributors 405

Languages

Packages