[WIP][cuebot][FIX] Fix GPU Resource Reservation Preventing Non-GPU Frame Dispatch #2095

anton-ubi · 2025-12-04T17:59:09Z

Problem
Hosts with GPU capabilities (ALLOW_GPU=true) incorrectly reject frames that have zero GPU memory requirements. This prevents efficient resource utilization where GPU-enabled hosts should accept both GPU and CPU-only workloads.

Root Cause
The getGpuJobs() method in CoreUnitDispatcher calls removeGpu() when no GPU-specific jobs are found. This method reduces all host resources (CPU cores, memory, GPU resources) to "reserve space for future GPU frames." When normal frames are subsequently evaluated, they fail resource checks due to these artificially reduced limits.

The problematic flow:

Look for GPU jobs → none found
Call removeGpu() → reduces idleCores (-100) and idleMemory (-4GB)
Evaluate normal frames → rejected due to insufficient resources
Call restoreGpu() → too late, frames already rejected

Solution
Add a configuration property dispatcher.gpu.skip_resource_reservation that disables GPU resource reservation when set to true. Default is false to maintain backward compatibility.

Configuration
The new behavior is controlled by environment variable CUEBOT_DISPATCHER_SKIP_GPU_RESERVATION:

Default (false): Preserves existing behavior
Skip (true): Disables resource reservation, allows full resource utilization

Recap

Before fix: GPU hosts may reject CPU-only frames despite having sufficient resources
After fix: GPU hosts efficiently handle both GPU and CPU-only workloads when optimization enabled.

Backward compatibility is preserved.
This resolves the counter-intuitive behavior where GPU-capable hosts reject valid CPU-only frames.

DiegoTavares · 2025-12-15T18:59:54Z

Looking good so far. It looks like you might have forgotten to add the new property to the test/resource's opencue.properties:

nested exception is org.springframework.beans.BeanInstantiationException: Failed to instantiate [com.imageworks.spcue.dispatcher.CoreUnitDispatcher]: Constructor threw exception; nested exception is java.lang.IllegalStateException: Required key 'dispatcher.gpu.skip_resource_reservation' not found

anton-ubi · 2025-12-20T08:27:32Z

It's now ready for review. Thanks for the hint.

DiegoTavares · 2026-01-05T16:24:47Z

I think you might have accidentally marked this PR as draft again.

anton-ubi · 2026-01-05T18:49:02Z

It's actually on purpose. I've been testing it again on our end lately and I'm not seeing the expected behavior it was supposed to fix. I'll keep you posted of any advancement.

Apologies for the noise.

anton-ubi added 2 commits December 4, 2025 12:57

skip_resource_reservation

e2b7b1c

Merge branch 'master' into skip_gpu_reservation

a5df98e

anton-ubi changed the title ~~skip_resource_reservation~~ Fix GPU Resource Reservation Preventing Non-GPU Frame Dispatch Dec 4, 2025

anton-ubi added 2 commits December 5, 2025 11:19

Merge branch 'master' into skip_gpu_reservation

38e07bb

Merge branch 'master' into skip_gpu_reservation

ee3bed2

anton-ubi added 2 commits December 20, 2025 03:02

add dispatcher.gpu.skip_resource_reservation to test properties

992427c

Merge branch 'master' into skip_gpu_reservation

e701ef2

anton-ubi changed the title ~~Fix GPU Resource Reservation Preventing Non-GPU Frame Dispatch~~ [cuebot][FIX] Fix GPU Resource Reservation Preventing Non-GPU Frame Dispatch Dec 20, 2025

anton-ubi marked this pull request as ready for review December 20, 2025 08:18

anton-ubi requested review from DiegoTavares, lithorus and ramonfigueiredo as code owners December 20, 2025 08:18

Merge branch 'master' into skip_gpu_reservation

fa61560

anton-ubi marked this pull request as draft December 25, 2025 00:22

DiegoTavares approved these changes Jan 5, 2026

View reviewed changes

anton-ubi changed the title ~~[cuebot][FIX] Fix GPU Resource Reservation Preventing Non-GPU Frame Dispatch~~ [WIP][cuebot][FIX] Fix GPU Resource Reservation Preventing Non-GPU Frame Dispatch Jan 5, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP][cuebot][FIX] Fix GPU Resource Reservation Preventing Non-GPU Frame Dispatch #2095

[WIP][cuebot][FIX] Fix GPU Resource Reservation Preventing Non-GPU Frame Dispatch #2095

Uh oh!

anton-ubi commented Dec 4, 2025 •

edited

Loading

Uh oh!

DiegoTavares commented Dec 15, 2025

Uh oh!

anton-ubi commented Dec 20, 2025

Uh oh!

DiegoTavares commented Jan 5, 2026

Uh oh!

anton-ubi commented Jan 5, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[WIP][cuebot][FIX] Fix GPU Resource Reservation Preventing Non-GPU Frame Dispatch #2095

Are you sure you want to change the base?

[WIP][cuebot][FIX] Fix GPU Resource Reservation Preventing Non-GPU Frame Dispatch #2095

Uh oh!

Conversation

anton-ubi commented Dec 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DiegoTavares commented Dec 15, 2025

Uh oh!

anton-ubi commented Dec 20, 2025

Uh oh!

DiegoTavares commented Jan 5, 2026

Uh oh!

anton-ubi commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

anton-ubi commented Dec 4, 2025 •

edited

Loading

anton-ubi commented Jan 5, 2026 •

edited

Loading