memory : fix broken batch splits for recurrent cache #14575

compilade · 2025-07-08T01:28:47Z

Splits producing more than one ubatch per batch for recurrent models were broken with #14512.

This could cause SEGFAULTS and possibly other problems when using any ubatch size smaller than a processed batch with a recurrent model (e.g. Mamba, Mamba-2, etc.)

(I first noticed this when getting a SEGFAULT with Mamba after updating #14139 to a commit after #14512 was merged)

This fixes it by moving the completeness check after the ubatch split loop.

Make sure to read the contributing guidelines before submitting a PR

Splits producing more than one ubatch per batch for recurrent models were broken with #14512. This fixes it by moving the completeness check after the ubatch split loop.

* origin/master: model : fix hunyuan moe chat template (ggml-org#14584) model : add SmolLM3 (ggml-org#14581) memory : fix broken batch splits for recurrent cache (ggml-org#14575) vulkan : fix rope with partial rotation and non-cont src (ggml-org#14582) server: Add ability to mount server at prefix (ggml-org#14544) model : add hunyuan moe (ggml-org#14425) vulkan: increase timeout for CI (ggml-org#14574) cuda : fix rope with partial rotation and non-cont src (ggml-org#14580) CUDA: add bilinear interpolation for upscale (ggml-org#14563) musa: fix build warnings (unused variable) (ggml-org#14561) llama : fix incorrect minicpm3 v_states shape (ggml-org#14571) llama : remove ggml_cont where possible (ggml-org#14568)

Splits producing more than one ubatch per batch for recurrent models were broken with ggml-org#14512. This fixes it by moving the completeness check after the ubatch split loop.

memory : fix broken batch splits for recurrent cache

2ff3354

Splits producing more than one ubatch per batch for recurrent models were broken with #14512. This fixes it by moving the completeness check after the ubatch split loop.

compilade requested a review from ggerganov July 8, 2025 01:28

compilade added the bugfix fixes an issue or bug label Jul 8, 2025

compilade mentioned this pull request Jul 8, 2025

batch : add n_used count #14512

Merged

ggerganov approved these changes Jul 8, 2025

View reviewed changes

ggerganov merged commit bb4f7a9 into master Jul 8, 2025
48 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

memory : fix broken batch splits for recurrent cache #14575

memory : fix broken batch splits for recurrent cache #14575

Uh oh!

compilade commented Jul 8, 2025

Uh oh!

Uh oh!

Uh oh!

memory : fix broken batch splits for recurrent cache #14575

memory : fix broken batch splits for recurrent cache #14575

Uh oh!

Conversation

compilade commented Jul 8, 2025

Uh oh!

Uh oh!

Uh oh!