-
ManipTrans Public
Forked from ManipTrans/ManipTransPython GNU General Public License v3.0 UpdatedMar 31, 2025 -
Qwen2.5-Omni Public
Forked from QwenLM/Qwen2.5-OmniQwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.
Jupyter Notebook Apache License 2.0 UpdatedMar 30, 2025 -
ProLIP-1 Public
Forked from astra-vision/ProLIPCLIP's Visual Embedding Projector is a Few-shot Cornucopia
Shell UpdatedMar 30, 2025 -
dita-ot Public
Forked from dita-ot/dita-otDITA Open Toolkit — the open-source publishing engine for content authored in the Darwin Information Typing Architecture.
Java Apache License 2.0 UpdatedMar 28, 2025 -
Skip-DiT Public
Forked from OpenSparseLLMs/Skip-DiT✈️ Accelerating Vision Diffusion Transformers with Skip Branches.Python Apache License 2.0 UpdatedMar 28, 2025 -
-
-
deep-searcher Public
Forked from zilliztech/deep-searcherOpen Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
Python Apache License 2.0 UpdatedMar 27, 2025 -
EVolSplat Public
Forked from Miaosheng1/EVolSplatofficial code of CVPR2025 Evolsplat
Python Other UpdatedMar 27, 2025 -
OpenSDI Public
Forked from iamwangyabin/OpenSDIOfficial repository for CVPR 2025 paper: OpenSDI: Spotting Diffusion-Generated Images in the Open World
Python UpdatedMar 26, 2025 -
LPOSS Public
Forked from vladan-stojnic/LPOSSCode for LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation (CVPR2025)
Python MIT License UpdatedMar 26, 2025 -
surg-3m Public
Forked from visurg-ai/surg-3mOfficial repository for the paper "Surg-3M: A Dataset and Foundation Model for Perception in Surgical Settings".
Python UpdatedMar 26, 2025 -
TokenHSI Public
Forked from liangpan99/TokenHSI[CVPR 2025] TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization
UpdatedMar 26, 2025 -
deepfake-detection Public
Forked from yermandy/deepfake-detectionPython MIT License UpdatedMar 26, 2025 -
PanoGS Public
Forked from zhaihongjia/PanoGS[CVPR 2025] PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding
Apache License 2.0 UpdatedMar 26, 2025 -
AvatarArtist Public
Forked from ant-research/AvatarArtist[CVPR'25] Official PyTorch implementation of AvatarArtist: Open-Domain 4D Avatarization.
Python Apache License 2.0 UpdatedMar 26, 2025 -
-
PAVE Public
Forked from dragonlzm/PAVEThis repo holds the implementation of PAVE: Patching and Adapting Video Large Language Models (CVPR2025)
Python UpdatedMar 26, 2025 -
CAFe Public
Forked from haoyu-bu/CAFeCode for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"
Python Apache License 2.0 UpdatedMar 26, 2025 -
-
diffusion-4k Public
Forked from zhang0jhon/diffusion-4k[CVPR 2025] Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models
Python UpdatedMar 26, 2025 -
-
-
Change3D Public
Forked from zhuduowang/Change3DThe official code of Change3D: Revisiting Change Detection and Captioning from A Video Modeling Perspective.
Python UpdatedMar 25, 2025 -
BoNeSS-ST Public
Forked from KonstantinPakulev/BoNeSS-ST[CVPR 2025] Good Keypoints for the Two-View Geometry Estimation Problem. The reference implementation of the paper.
UpdatedMar 25, 2025 -
Splat-LOAM Public
Forked from rvp-group/Splat-LOAM2D Gaussian Splatting based LiDAR Odometry And Mapping
BSD 3-Clause "New" or "Revised" License UpdatedMar 25, 2025 -
CoMP-MM Public
Forked from SliMM-X/CoMP-MMOfficial repository of "CoMP: Continual Multimodal Pre-training for Vision Foundation Models"
Python Apache License 2.0 UpdatedMar 25, 2025 -
beyond-accuracy Public
Forked from visinf/beyond-accuracyBeyond Accuracy: What Matters in Designing Well-Behaved Models?
Python Apache License 2.0 UpdatedMar 25, 2025 -
Linear-MoE Public
Forked from OpenSparseLLMs/Linear-MoEPython Apache License 2.0 UpdatedMar 25, 2025 -
NexusGS Public
Forked from USMizuki/NexusGS[CVPR'25] NexusGS: Sparse View Synthesis with Epipolar Depth Priors in 3D Gaussian Splatting
JavaScript UpdatedMar 25, 2025