🚀StackMFF🚀

Official PyTorch implementation of "StackMFF: End-to-end Multi-Focus Image Stack Fusion Network"

Our paper has been accepted by APIN ~

🌼Abstract

Existing end-to-end multi-focus image fusion networks work well for merging two images but suffer from various types of image degradation when applied to image stacks. To address this issue, we propose a novel 3D CNN approach for end-to-end fusion of entire multi-focus image stacks. Furthermore, we developed a synthesis pipeline based on monocular depth estimation to create a large-scale dataset for training. To improve the fusion quality, we introduce a 3D coordinate adaptive enhancement module to amplify and emphasize high-frequency information. Additionally, we establish a benchmark for multi-focus image stack fusion tasks and release a comprehensive toolbox containing 12 different algorithms to facilitate consistent evaluation and comparison. Extensive experiments show that our method can effectively fuse multi-focus image stacks while reducing image degradation, achieving state-of-the-art performance in fusion quality and processing speed.

⭐Highlights

Proposes the first network specifically designed for multi-focus image stack fusion.
Introduces a novel pipeline for synthesizing image stacks based on depth estimation.
Establishes a benchmark for the multi-focus image stack fusion task.
Releases a multi-focus image stack fusion toolbox containing 12 algorithms.
Achieves state-of-the-art fusion performance and processing speed.

🔨Installation (GPU)

pytorch 1.13.0
torchvision 0.14.0
opencv-python 4.9.0.80
pillow 10.2.0
tensorboardX 2.6.22
tqdm 4.66.2

✈️Inference

If you want to inference datasets, run:

python predict_dataset.py --model_path checkpoint/checkpoint.pth --stack_basedir_path data/Datasets_StackMFF/4D-Light-Field/image stack

If you want to inference a image stack, run:

python predict.py --model_path checkpoint/checkpoint.pth --stack_path data/Datasets_StackMFF/4D-Light-Field/image stack/boxes

✈️Train

1. Download datasets for training

Download the validation set of the original dataset Open Images V7 used to make the training dataset, and put all images to 'data/OpenImagesV7'.

2. Split the datasets

Split the datasets validation set of the original dataset Open Images V7 into 2 training sets and validation sets by running the following command:

python split_dataset.py

3. Get depth maps

Using Metric3D to get depth maps (8bit, range from 0 to 255) for all images, and put all depth maps to 'data/OpenImagesV7/train_depth' and 'data/OpenImagesV7/test_depth', respectively. Then, using depth-adapted multi-Focus simulation (DAMS) to get multi-focus image stacks, run:

python make_dataset.py

4. Train StackMFF

The training code will be released after the article is accepted.

🌈Prepare datasets for evaluation

We have prepared all the evaluation datasets for you, which can be downloaded here. Put Datasets_StackMFF to data/Datasets_StackMFF

If you want to make your own evaluation dataset, please refer to the following:

1. 4D Light Field

Download 4D-Light-Field dataset Put full_data.zip under ./data/4D-Light-Field_Gen Run the following command under ./data/4D-Light-Field_Gen

unzip full_data.zip
python LF2hdf5.py --base_dir ./full_data --output_dir ./LF
python FS_gen.py --LF_path ./LF/HCI_LF_trainval.h5 --output_dir ./FS
python save_AiF.py ./FS/HCI_FS_trainval.h5 ./FS
python save_stack.py ./FS/HCI_FS_trainval.h5 ./FS

2. FlyingThings3D Dataset

Download FlyingThings3D_FS under ./data/FlyingThings3D/ Unzip the dataset

3. Middlebury Dataset

Download Middlebury_FS under 'data/Middlebury' Unzip the dataaset Mobile Depth Dataset

4. Mobile Depth Dataset

Download Mobile Depth dataset under 'data/Mobile_Depth_Gen' Run the following command under ./data/Mobile_Depth_Gen

mkdir Photos_Calibration_Results
mv depth_from_focus_data2.zip Photos_Calibration_Results
cd Photos_Calibration_Results
unzip ./depth_from_focus_data2.zip
mv calibration/metal calibration/metals
mv calibration/GT calibration/zeromotion
mv calibration/GTSmall calibration/smallmotion
mv calibration/GTLarge calibration/largemotion
cd ..
unzip depth_from_focus_data3.zip

✅Depth-adapted multi-focus simulation

Common: Proposed:

✅Download

1. Fusion results of various methods

2. The toolbox for multi-focus image stack fusion

📝 Citation

If you find this work useful for your research, please consider citing our papers:

@article{XIE2025110238,
title = {Multi-focus image fusion with visual state space model and dual adversarial learning},
journal = {Computers and Electrical Engineering},
volume = {123},
pages = {110238},
year = {2025},
issn = {0045-7906},
doi = {https://doi.org/10.1016/j.compeleceng.2025.110238},
url = {https://www.sciencedirect.com/science/article/pii/S0045790625001818},
author = {Xinzhe Xie and Buyu Guo and Peiliang Li and Shuangyan He and Sangjun Zhou},
keywords = {Deep learning, Multi-focus image fusion, Adversarial learning, Fine-tuning technology, Visual state space model}
}

@article{xie2025stackmff,
  title={StackMFF: end-to-end multi-focus image stack fusion network},
  author={Xie, Xinzhe and Qingyan, Jiang and Chen, Dong and Guo, Buyu and Li, Peiliang and Zhou, Sangjun},
  journal={Applied Intelligence},
  volume={55},
  number={6},
  pages={503},
  year={2025},
  publisher={Springer}
}

@article{xie2024swinmff,
  title={SwinMFF: toward high-fidelity end-to-end multi-focus image fusion via swin transformer-based network},
  author={Xie, Xinzhe and Guo, Buyu and Li, Peiliang and He, Shuangyan and Zhou, Sangjun},
  journal={The Visual Computer},
  pages={1--24},
  year={2024},
  publisher={Springer}
}

@inproceedings{xie2024underwater,
  title={Underwater Three-Dimensional Microscope for Marine Benthic Organism Monitoring},
  author={Xie, Xinzhe and Guo, Buyu and Li, Peiliang and Jiang, Qingyan},
  booktitle={OCEANS 2024-Singapore},
  pages={1--4},
  year={2024},
  organization={IEEE}
}

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
Visualization		Visualization
checkpoint		checkpoint
datasets_utils/4D-Light-Field		datasets_utils/4D-Light-Field
nets		nets
tools		tools
Dataloader.py		Dataloader.py
README.md		README.md
evaluate.py		evaluate.py
make_dataset.py		make_dataset.py
predict.py		predict.py
predict_dataset.py		predict_dataset.py
split_dataset.py		split_dataset.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀StackMFF🚀

🌼Abstract

⭐Highlights

🔨Installation (GPU)

✈️Inference

If you want to inference datasets, run:

If you want to inference a image stack, run:

✈️Train

1. Download datasets for training

2. Split the datasets

3. Get depth maps

4. Train StackMFF

🌈Prepare datasets for evaluation

1. 4D Light Field

2. FlyingThings3D Dataset

3. Middlebury Dataset

4. Mobile Depth Dataset

✅Depth-adapted multi-focus simulation

✅Download

1. Fusion results of various methods

2. The toolbox for multi-focus image stack fusion

📝 Citation

About

Releases

Packages

Languages

Xinzhe99/StackMFF

Folders and files

Latest commit

History

Repository files navigation

🚀StackMFF🚀

🌼Abstract

⭐Highlights

🔨Installation (GPU)

✈️Inference

If you want to inference datasets, run:

If you want to inference a image stack, run:

✈️Train

1. Download datasets for training

2. Split the datasets

3. Get depth maps

4. Train StackMFF

🌈Prepare datasets for evaluation

1. 4D Light Field

2. FlyingThings3D Dataset

3. Middlebury Dataset

4. Mobile Depth Dataset

✅Depth-adapted multi-focus simulation

✅Download

1. Fusion results of various methods

2. The toolbox for multi-focus image stack fusion

📝 Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages