-
Notifications
You must be signed in to change notification settings - Fork 1
Pull requests: auroralabs-loci/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
UPSTREAM PR #22129: Tensor-parallel: Fix delayed AllReduce on Gemma-4 MoE
#1361
opened Apr 20, 2026 by
loci-dev
Loading…
UPSTREAM PR #22102: fix: GLM-DSA crash in llama-tokenize when using vocab_only
#1360
opened Apr 19, 2026 by
loci-dev
Loading…
UPSTREAM PR #22101: mtmd: add granite-speech support (ibm-granite/granite-4.0-1b-speech)
#1359
opened Apr 19, 2026 by
loci-dev
Loading…
UPSTREAM PR #22070: ggml-cuda: gate native ue4m3 conversion to sm_90+
#1358
opened Apr 18, 2026 by
loci-dev
Loading…
UPSTREAM PR #22066: sycl: Battlemage (BMG) optimizations — AOT, Q5_K reorder, PAD stride fix, new ops, oneMKL routing
#1357
opened Apr 18, 2026 by
loci-dev
Loading…
UPSTREAM PR #21647: ci: add android arm64 build and release
#1355
opened Apr 17, 2026 by
loci-dev
Loading…
UPSTREAM PR #21071: hexagon: optimize HMX matmul operations
#1351
opened Apr 15, 2026 by
loci-dev
Loading…
UPSTREAM PR #21652: Prevent the sum of the dequantized activation in q8_1 from overflowing
#1350
opened Apr 15, 2026 by
loci-dev
Loading…
UPSTREAM PR #21870: common: skip reasoning budget sampler when no budget is requested
#1349
opened Apr 14, 2026 by
loci-dev
Loading…
UPSTREAM PR #21821: llama : add --hugepages for HugeTLB-backed weight loading (Linux)
#1347
opened Apr 13, 2026 by
loci-dev
Loading…
UPSTREAM PR #21554: hexagon: optimization for HMX mat_mul
#1346
opened Apr 12, 2026 by
loci-dev
Loading…
UPSTREAM PR #21787: vulkan: fix output corruption on GCN 2.0/3.0 (Vulkan 1.2)
#1345
opened Apr 12, 2026 by
loci-dev
Loading…
UPSTREAM PR #21753: vulkan: Support asymmetric FA in coopmat2 path
#1344
opened Apr 11, 2026 by
loci-dev
Loading…
UPSTREAM PR #21344: gfx1151 nwarps, tile sizing to curb VGPR pressure
#1342
opened Apr 10, 2026 by
loci-dev
Loading…
UPSTREAM PR #21431: vulkan: Tweak Xe2 warptile configuration
#1341
opened Apr 10, 2026 by
loci-dev
Loading…
UPSTREAM PR #21597: SYCL: fix multi-GPU system RAM exhaustion by using Level Zero allocations
#1340
opened Apr 8, 2026 by
loci-dev
Loading…
7 tasks done
UPSTREAM PR #21421: mtmd: add Gemma 4 audio conformer encoder support
#1336
opened Apr 6, 2026 by
loci-dev
Loading…
9 tasks done
UPSTREAM PR #21216: common : simplify autoparser tagged parser rules
#1335
opened Apr 6, 2026 by
loci-dev
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.