【ICME 2025 Oral】Interactive Sketch-based Person Re-Identification with Text Feedback

This repository offers the official implementation of InteractReID in PyTorch.

Overview

We introduce a novel interactive person retrieval frameworl for Sketch ReID. Inspired by CLIP's powerful cross-modal semantic alignment capabilities, Task-oriented Knowledge Adaptation is conducted for CLIP to achieve knowledge transfer from pre-trained CLIP to downstream Sketch ReID tasks. Afterwards, in order to achieve interactive sketch person retrieval with user's text feedback, based on the vision-text joint embedding space provided by CLIP, we aim to find a pseudo-word token that can accurately capture sketch's semantics, thus achieving explicit sketch-text compositionality for optimal composed semantic mining.

Environment

All experiments are conducted on Nvidia GTX 4090 (24GB) GPUs.
Python = 3.8
The required packages are listed in requirements.txt. You can install them using:

pip install -r requirements.txt

Download

Download CUHK-PEDES dataset from here, ICFG-PEDES dataset from here and RSTPReid dataset from here. Tri-PEDES is a combination of CUHK-PEDES, ICFG-PEDES, and RSTPReid.
Download the annotation json files from here.
Download the pretrained CLIP checkpoint from here and save it in path checkpoint/.

Data Preparation

CUHK-PEDES Organize them in your dataset folder as follows:

|-- dataset/
|   |-- CUHK-PEDES/
|       |-- imgs
            |-- cam_a
            |-- cam_b
            |-- ...
|       |-- train_reid.json
|       |-- test_reid.json
|       |-- val_reid.json
|-- others/

ICFG-PEDES

Organize them in your dataset folder as follows:

|-- dataset/
|   |-- ICFG-PEDES/
|       |-- imgs
            |-- test
            |-- train 
|       |-- train_reid.json
|       |-- test_reid.json
|       |-- val_reid.json
|-- others/

RSTPReid

Organize them in your dataset folder as follows:

|-- dataset/
|   |-- RSTPReid/
|       |-- imgs
|       |-- train_reid.json
|       |-- test_reid.json
|       |-- val_reid.json
|-- others/

Configuration

In config/TriPEDES_pretrain.yaml, set the paths for dataset path and the CLIP checkpoint path.

Training

You can start the the finetuning process of Task-oriented Knowledge Adaption by using the following command:

bash adaptation.sh

After knowledge adaptation for CLIP, you can train a vision-to-text converting network by using the following command:

bash tokenlearning.sh

Test

If you need to test your trained model directly, you can use the following command:

bash model_test.sh

Acknowledgement

Citation

If you find this paper useful, please consider staring 🌟 this repo and citing 📑 our paper:

@inproceedings{InteractReID,
 author = {Xinyi, Wu and Cuiqun, Chen and Hui, Zeng and Zhiping, Cai and Bo, Du and Mang, Ye},
  title = {Interactive Sketch-based Person Re-Identification with Text Feedback},
  year = {2025},
  booktitle = {International Conference on Multimedia and Expo (ICME)},
  numpages = {9},
}

License

This code is distributed under an MIT LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
__pycache__		__pycache__
config		config
image		image
misc		misc
model		model
text_utils		text_utils
README.md		README.md
adaptation.sh		adaptation.sh
knowledge_adaptation.py		knowledge_adaptation.py
model_directly_test.py		model_directly_test.py
model_test.sh		model_test.sh
options.py		options.py
pseudo_token_learning.py		pseudo_token_learning.py
requirements.txt		requirements.txt
tokenlearning.sh		tokenlearning.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

【ICME 2025 Oral】Interactive Sketch-based Person Re-Identification with Text Feedback

Overview

Environment

Download

Data Preparation

Configuration

Training

Test

Acknowledgement

Citation

License

About

Uh oh!

Releases

Packages

Languages

littlexinyi/InteractReID

Folders and files

Latest commit

History

Repository files navigation

【ICME 2025 Oral】Interactive Sketch-based Person Re-Identification with Text Feedback

Overview

Environment

Download

Data Preparation

Configuration

Training

Test

Acknowledgement

Citation

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages