Skip to content

Commit f29cf6f

Browse files
authored
Merge branch 'main' into promptless/add-serverless-integrations-page
2 parents 562a8f3 + 74776a3 commit f29cf6f

27 files changed

+778
-369
lines changed

hub/public-endpoint-reference.mdx

Lines changed: 109 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,6 +13,7 @@ The following models are currently available:
1313

1414
| Model | Description | Endpoint URL | Type | Price |
1515
|-------|-------------|----------|------|-------|
16+
| **IBM Granite-4.0-H-Small** | A 32B parameter long-context instruct model. | `https://api.runpod.ai/v2/granite-4-0-h-small/` | Text | \$0.01 per 1000 tokens |
1617
| **Qwen3 32B AWQ** | The latest LLM in the Qwen series, offering advancements in reasoning, instruction-following, agent capabilities, and multilingual support. | `https://api.runpod.ai/v2/qwen3-32b-awq/` | Text | \$0.01 per 1000 tokens |
1718
| **Flux Dev** | Offers exceptional prompt adherence, high visual fidelity, and rich image detail. | `https://api.runpod.ai/v2/black-forest-labs-flux-1-dev/` | Image | \$.02 per megapixel |
1819
| **Flux Schnell** | Fastest and most lightweight FLUX model, ideal for local development, prototyping, and personal use. | `https://api.runpod.ai/v2/black-forest-labs-flux-1-schnell/` | Image | \$.0024 per megapixel |
@@ -24,7 +25,11 @@ The following models are currently available:
2425
| **Seedream 4.0 Edit** | New-generation image editing with unified generation and editing architecture. | `https://api.runpod.ai/v2/seedream-v4-edit/` | Image | \$0.027 per megapixel
2526
| **Seedream 3.0** | Native high-resolution bilingual image generation (Chinese-English). | `https://api.runpod.ai/v2/seedream-3-0-t2i/` | Image | \$0.03 per megapixel
2627
| **Nano Banana Edit** | Google's state-of-the-art image editing model. | `https://api.runpod.ai/v2/nano-banana-edit/` | Image | \$0.027 per megapixel
28+
| **InfiniteTalk** | Audio-driven video generation model that creates talking or singing videos from a single image and audio input. | `https://api.runpod.ai/v2/infinitetalk/` | Video | \$0.25 per video generation |
29+
| **Kling v2.1 I2V Pro** | Professional-grade image-to-video with enhanced visual fidelity. | `https://api.runpod.ai/v2/kling-v2-1-i2v-pro/` | Video | \$0.36 per 5 seconds of video |
2730
| **Seedance 1.0 Pro** | High-performance video generation with multi-shot storytelling. | `https://api.runpod.ai/v2/seedance-1-0-pro/` | Video | \$0.62 per 5 seconds of video |
31+
| **SORA 2 I2V** | OpenAI's Sora 2 is a video and audio generation model. | `https://api.runpod.ai/v2/sora-2-i2v/` | Video | \$0.40 per video generation |
32+
| **SORA 2 Pro I2V** | OpenAI's Sora 2 Pro is a professional-grade video and audio generation model. | `https://api.runpod.ai/v2/sora-2-pro-i2v/` | Video | \$1.20 per video generation |
2833
| **WAN 2.5** | Image-to-video generation model. | `https://api.runpod.ai/v2/wan-2-5/` | Video | \$0.50 per 5 seconds of video
2934
| **WAN 2.2 I2V 720p LoRA** | Open-source video generation with LoRA support. | `https://api.runpod.ai/v2/wan-2-2-t2v-720-lora/` | Video | \$0.35 per 5 seconds of video
3035
| **WAN 2.2 I2V 720p** | Open-source AI video generation model that uses a diffusion transformer architecture for image-to-video generation. | `https://api.runpod.ai/v2/wan-2-2-i2v-720/` | Video | \$0.30 per 5 seconds of video
@@ -101,6 +106,43 @@ Flux Schnell is optimized for speed and real-time applications:
101106
Flux Schnell is optimized for speed and works best with lower step counts. Using higher values may not improve quality significantly.
102107
</Warning>
103108

109+
### IBM Granite-4.0-H-Small
110+
111+
IBM Granite-4.0-H-Small is a 32B parameter long-context instruct model.
112+
113+
```json
114+
{
115+
"input": {
116+
"messages": [
117+
{
118+
"role": "system",
119+
"content": "You are a helpful assistant. Please ensure responses are professional, accurate, and safe."
120+
},
121+
{
122+
"role": "user",
123+
"content": "What is Runpod?"
124+
}
125+
],
126+
"sampling_params": {
127+
"max_tokens": 512,
128+
"temperature": 0.7,
129+
"seed": -1,
130+
"top_k": -1,
131+
"top_p": 1
132+
}
133+
}
134+
}
135+
```
136+
137+
| Parameter | Type | Required | Default | Range | Description |
138+
|-----------|------|----------|---------|-------|-------------|
139+
| `messages` | array | Yes | - | - | Array of message objects with role and content. |
140+
| `sampling_params.max_tokens` | integer | No | 512 | - | Maximum number of tokens to generate. |
141+
| `sampling_params.temperature` | float | No | 0.7 | 0.0-1.0 | Controls randomness in generation. Lower values make output more deterministic. |
142+
| `sampling_params.seed` | integer | No | -1 | - | Seed for reproducible results. The default value (-1) will generate a random seed. |
143+
| `sampling_params.top_k` | integer | No | -1 | - | Restricts sampling to the top K most probable tokens. |
144+
| `sampling_params.top_p` | float | No | 1 | 0.0-1.0 | Nucleus sampling threshold. |
145+
104146
### Qwen3 32B AWQ
105147

106148
Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models.
@@ -394,6 +436,30 @@ Seedream 4.0 Edit provides advanced image editing capabilities with the same uni
394436
| `enable_safety_checker` | boolean | No | true | Enable content safety checking. |
395437

396438

439+
### InfiniteTalk
440+
441+
InfiniteTalk is an audio-driven video generation model that creates talking or singing videos from a single image and audio input.
442+
443+
```json
444+
{
445+
"input": {
446+
"prompt": "a cartoon computer talking",
447+
"image": "https://image.runpod.ai/assets/meigen-ai/poddy.jpg",
448+
"audio": "https://image.runpod.ai/assets/meigen-ai/audio.wav",
449+
"size": "480p",
450+
"enable_safety_checker": true
451+
}
452+
}
453+
```
454+
455+
| Parameter | Type | Required | Default | Description |
456+
|-----------|------|----------|---------|-------------|
457+
| `prompt` | string | Yes | - | Text description of the desired video. |
458+
| `image` | string | Yes | - | URL of the source image to animate. |
459+
| `audio` | string | Yes | - | URL of the audio file to drive the animation. |
460+
| `size` | enum | Yes | "480p" | Output video resolution. Valid options are `480p` and `720p`. |
461+
| `enable_safety_checker` | boolean | No | true | Enable content safety checking. |
462+
397463
### Kling v2.1 I2V Pro
398464

399465
Kling 2.1 Pro generates videos from static images with additional control parameters.
@@ -445,6 +511,48 @@ Seedance 1.0 Pro is a high-performance video generation model with multi-shot st
445511
| `size` | string | No | "1920x1080" | Video dimensions. |
446512
| `image` | string | No | "" | Optional source image URL for image-to-video generation. |
447513

514+
### SORA 2 I2V
515+
516+
OpenAI's Sora 2 is a video and audio generation model.
517+
518+
```json
519+
{
520+
"input": {
521+
"prompt": "Action: The mechs headlamps flicker a few times. It then slowly and laboriously pushes itself up with a damaged mechanical arm, sparks flying from gaps in its armor. Ambient Sound: Distant, continuous explosions (low rumbles); the sizzle and crackle of short-circuiting electricity from within the mech; heavy, grinding metallic sounds as the mech rises; faint, garbled static from a damaged comms unit.Character Dialogue: (Processed mechanical voice, weary but firm) `No retreat.`",
522+
"image": "https://image.runpod.ai/assets/sora-2-i2v/example.jpeg",
523+
"duration": 4
524+
}
525+
}
526+
```
527+
528+
| Parameter | Type | Required | Default | Description |
529+
|-----------|------|----------|---------|-------------|
530+
| `prompt` | string | Yes | - | Text description of the desired video, including action, ambient sound, and character dialogue. |
531+
| `image` | string | Yes | - | URL of the source image to animate. |
532+
| `duration` | integer | Yes | 4 | Video duration in seconds. Valid options: 4, 8, or 12. |
533+
534+
### SORA 2 Pro I2V
535+
536+
OpenAI's Sora 2 Pro is a professional-grade video and audio generation model.
537+
538+
```json
539+
{
540+
"input": {
541+
"prompt": "Action: She opened her hands\n\nAmbient Sound: The soft crackling of the dying fire in the oven; a high-pitched, happy little ding sound from the timer; the warm, persistent sizzle of butter melting on a nearby stovetop.\n\nCharacter Dialogue: (Voice is high-pitched, bubbly, and enthusiastic) \"Welcome to my bakery!\"\n\n\n",
542+
"image": "https://image.runpod.ai/assets/sora-2-pro-i2v/example.jpeg",
543+
"size": "720p",
544+
"duration": 4
545+
}
546+
}
547+
```
548+
549+
| Parameter | Type | Required | Default | Description |
550+
|-----------|------|----------|---------|-------------|
551+
| `prompt` | string | Yes | - | Text description of the desired video, including action, ambient sound, and character dialogue. |
552+
| `image` | string | Yes | - | URL of the source image to animate. |
553+
| `size` | string | No | "720p" | Output video resolution. |
554+
| `duration` | integer | Yes | 4 | Video duration in seconds. Valid options: 4, 8, or 12. |
555+
448556
### Whisper V3 Large
449557

450558
Whisper V3 Large is a state-of-the-art automatic speech recognition model that transcribes audio to text.
@@ -737,4 +845,4 @@ An open-source video generation model for creating 720p videos from text prompts
737845
| `flow_shift` | integer | No | 5 | - | Controls the motion flow in the generated video. |
738846
| `seed` | integer | No | -1 | - | Provide a seed for reproducible results. The default value (-1) will generate a random seed. |
739847
| `enable_prompt_optimization` | boolean | No | false | - | Whether to automatically optimize the prompt. |
740-
| `enable_safety_checker` | boolean | No | true | - | Whether to run safety checks on the output. |
848+
| `enable_safety_checker` | boolean | No | true | - | Whether to run safety checks on the output. |

pods/choose-a-pod.mdx

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
---
22
title: Choose a Pod
3-
description: "Learn how to select the right Pod by evaluating your resource requirements."
3+
description: "Select the right Pod by evaluating your resource requirements."
44
sidebar_position: 3
55
---
66

0 commit comments

Comments
 (0)