question about Model D training #11

jetyingjia · 2024-01-29T12:55:52Z

Awesome work, Congratulations!
I have some questions about the Model D training.
1、In this model，Pre-train with [Mask,Concept]，this concept means the text embeddings(2560 categories)? Than how get this concept to 1B masks?
2、In this paper, get 2.25TB image embedding. How use this data?

PhyscalX · 2024-01-29T13:33:50Z

Hi, @jetyingjia

Each mask has a pre-computed image embedding for encoding log target via encode_tgt(...)
The 2.25TB image embedding database contains 1B embeddings for 1B masks, used in 1.

BTW, there should be 60 days to compute 1B EVA-CLIP-E embeddings if using 8 NVIDIA A100 😅.

jetyingjia · 2024-01-30T04:06:52Z

Hi， @PhyscalX
1、This means the model D‘s classify branch target is the concept distribution（image embeeding project to 2560-dimension distribution logits), not the region pseudo label(many paper use pseudo label, eg：OWL)
2、The idea of learn the concept distribution, have other paper recommended?
Thank you!

PhyscalX · 2024-01-30T06:18:13Z

We have clarified that we use KL divergence loss in Sec 3.1.
This method is used by many CLIP-based distillation papers (e.g. RegionCLIP, a modified Faster R-CNN
for Open-Vocabulary Classification). However, it is challenging to integrate this method into SAM with 1B masks.

jetyingjia · 2024-01-30T09:12:25Z

@PhyscalX
Good idea，Do you have the plan to release the full project(including training)? As I want to fine-tune this model in my datasets.

PhyscalX · 2024-01-30T09:27:39Z

Refer to issue #5, currently, we have no plan to release the full code.
Instead, we have released the visual prompter and losses for pre-training and fine-tuning.

PhyscalX closed this as completed Apr 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

question about Model D training #11

question about Model D training #11

jetyingjia commented Jan 29, 2024

PhyscalX commented Jan 29, 2024 •

edited

Loading

jetyingjia commented Jan 30, 2024

PhyscalX commented Jan 30, 2024

jetyingjia commented Jan 30, 2024

PhyscalX commented Jan 30, 2024

question about Model D training #11

question about Model D training #11

Comments

jetyingjia commented Jan 29, 2024

PhyscalX commented Jan 29, 2024 • edited Loading

jetyingjia commented Jan 30, 2024

PhyscalX commented Jan 30, 2024

jetyingjia commented Jan 30, 2024

PhyscalX commented Jan 30, 2024

PhyscalX commented Jan 29, 2024 •

edited

Loading