Skip to content
@FoundationVision

FoundationVision

Bytedance's opensource FoundationVision models

Welcome to FoundationVision @ ByteDance!

Introduction 👋

Hello! This is the GitHub space for the FoundationVision @ ByteDance.

We are dedicated to exploring the frontiers of multimodal intelligence, with the ultimate goal of building Artificial General Intelligence systems (AGI).

Our research focuses on deep learning and multimodal intelligence. We are particularly interested in:

  • Visual Foundation Models, Generative Pretrained Models and Large Language Models.
  • Multimodal Foundation Models and Representation Learning.
  • Open World Interaction via Unified Multi-modal generation and understanding.
  • Large-scale Multi-modal generative Pretraining and Alignment.

Our group strives to push the boundaries of multimodal intelligence and has produced highly influential works in the field, including:

Popular repositories Loading

  1. VAR VAR Public

    [NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

    Jupyter Notebook 8.5k 547

  2. ByteTrack ByteTrack Public

    [ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box

    Python 5.8k 1.1k

  3. LlamaGen LlamaGen Public

    Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

    Python 1.9k 91

  4. Infinity Infinity Public

    [CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

    Python 1.5k 82

  5. GLEE GLEE Public

    [CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

    Python 1.2k 74

  6. Waver Waver Public

    Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.

    752 81

Repositories

Showing 10 of 20 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…