Thank you for your excellent work and the insights shared in your research. I have been exploring the Semantics-Guided Geometry Sampler, and I couldn't find any released code related to this component.
If the code is already available, could you kindly advise me on where I might find it? If it hasn’t been released yet, I’d greatly appreciate it if you could share any updates on its availability. I suspect the implementation might reside in the build_vision_tower function within the multimodal_encoder module. However, it seems that the multimodal_encoder file is not provided in the available resources.
Looking forward to your response. Thank you again for your time and contribution to the field.