Skip to content

Latest commit

 

History

History

deepseek-janus

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 

Run Janus by DeepSeek with SkyPilot

SkyPilot is a framework for running AI and batch workloads on any infra, offering unified execution, high cost savings, and high GPU availability.

DeepSeek-Janus on SkyPilot

On Jan 27, 2025, DeepSeek AI released the Janus. It outperforms state-of-the-art Vision Language Models such as LLaVA, supporting a variety of Vision-Language tasks such as image generation and Q&A.

This guide walks through how to run and host models on any infrastructure from ranging from Local GPU workstation, Kubernetes cluster and public clouds (15+ clouds supported).

Step 0: Bring any infra

Install SkyPilot on your local machine:

pip install 'skypilot-nightly[all]'

Pick one of the following depending on what infra you want to run:

If your local machine/cluster has GPU: you can run SkyPilot directly on existing machines.

If you want to use Clouds (15+ clouds are supported): See docs for details.

Step 1: Run it with SkyPilot

Now it's time to run Janus with SkyPilot. Commands may vary based on the GPUs available to you.

Run Janus (1.5B) with:

sky launch janus_1.5b.yaml \
  -c janus \
  --env HF_TOKEN=YOUR_HUGGING_FACE_API_TOKEN 

or run Janus Pro (7B) with:

sky launch januspro_7b.yaml \
  -c janus \
  --env HF_TOKEN=YOUR_HUGGING_FACE_API_TOKEN 

It will show all the available machines that you have and pricing

...

Considered resources (1 node):
-----------------------------------------------------------------------------------------------------------------
 CLOUD        INSTANCE                   vCPUs   Mem(GB)   ACCELERATORS   REGION/ZONE        COST ($)   CHOSEN   
-----------------------------------------------------------------------------------------------------------------
 Kubernetes   4CPU--16GB--1L4             4       16        L4:1           gke-cluster        0.00          ✔    
 RunPod       1x_L4_SECURE               4       24        L4:1           CA                 0.44                
 GCP          g2-standard-4              4       16        L4:1           us-east4-a         0.70                
 AWS          g6.xlarge                  4       16        L4:1           us-east-1          0.80                
 AWS          g5.xlarge                  4       16        A10G:1         us-east-1          1.01                
 Fluidstack   A100_PCIE_80GB::1          28      120       A100-80GB:1    ARIZONA_USA        1.80                
 RunPod       1x_A100-80GB_SECURE        8       80        A100-80GB:1    CA                 1.99                
 Paperspace   A100-80G                   12      80        A100-80GB:1    East Coast (NY2)   3.18                
 Azure        Standard_NV36ads_A10_v5    36      440       A10:1          eastus             3.20                
 Azure        Standard_NC24ads_A100_v4   24      220       A100-80GB:1    eastus             3.67                
 GCP          a2-ultragpu-1g             12      170       A100-80GB:1    us-central1-a      5.03                
 Azure        Standard_ND96asr_v4        96      900       A100:8         eastus             27.20               
 GCP          a2-highgpu-8g              96      680       A100:8         us-central1-a      29.39               
 AWS          p4d.24xlarge               96      1152      A100:8         us-east-1          32.77               
-----------------------------------------------------------------------------------------------------------------

Step 2: Access the deployed server

You should be able to access directly via a terminal prompt via

Running on public URL: https://xxxxxx.gradio.live

or you can access through getting the IP address of the deployed instance via

echo `sky status --ip janus`

Example Prompts

A blue sky,  vast fluffy clouds surrounding, enchanting, immortal, dynamic motion, cinematic, Unreal Engine 5 and Octane Render, highly detailed, photorealistic, natural colors, epic atmosphere, and breathtaking realism

DeepSeek-Janus on SkyPilot