-
Notifications
You must be signed in to change notification settings - Fork 14.7k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
nix: fix nix develop .#python-scripts
devops
improvements to build systems and github actions
nix
Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
#19218
opened Jan 30, 2026 by
teto
Loading…
ggml-cpu: split across kv for faster TG
ggml
changes relating to the ggml tensor library for machine learning
#19209
opened Jan 30, 2026 by
am17an
Loading…
ggml-virtgpu: make the code thread safe
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
#19204
opened Jan 30, 2026 by
kpouget
Loading…
ggml-cpu: optimize q4_0_q8_0 scales using Zvfhmin
ggml
changes relating to the ggml tensor library for machine learning
#19196
opened Jan 30, 2026 by
ixgbe
Loading…
Remove pipeline cache mutexes
ggml
changes relating to the ggml tensor library for machine learning
#19195
opened Jan 30, 2026 by
nikhilJain17
Loading…
Refactor changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
vk_semaphore to be a shared_ptr object
ggml
#19193
opened Jan 30, 2026 by
sredman
Loading…
Bump cmake max version (needed for Windows on Snapdragon builds)
build
Compilation issues
documentation
Improvements or additions to documentation
#19188
opened Jan 29, 2026 by
max-krasnyansky
Loading…
model: support Longcat-Flash (help wanted)
help wanted
Needs help from the community
model
Model specific
python
python script changes
ggml-backend: fix async set/get fallback sync
ggml
changes relating to the ggml tensor library for machine learning
#19179
opened Jan 29, 2026 by
JohannesGaessler
Loading…
ggml: optimize ggml_vec_dot_mxfp4_q8_0 dot product on ARM SVE
ggml
changes relating to the ggml tensor library for machine learning
#19171
opened Jan 29, 2026 by
jiangshhh
Loading…
common : convert string contents to arrays if template requires typed content
jinja parser
Issues related to the jinja parser
#19156
opened Jan 28, 2026 by
ownia
Loading…
common: fix 32-bit overflow for avoiding year 2038 problem
#19148
opened Jan 27, 2026 by
GermanAizek
Loading…
[WIP]ggml-hexagon: flash-attn opt - part2
ggml
changes relating to the ggml tensor library for machine learning
llama: Add option to merge gate and exp weights
model
Model specific
python
python script changes
#19139
opened Jan 27, 2026 by
am17an
Loading…
ggml: aarch64: Implement SVE in Gemm q4_k 8x8 q8_k Kernel
ggml
changes relating to the ggml tensor library for machine learning
#19132
opened Jan 27, 2026 by
abhijain1204fujitsu
Loading…
Unified Delta Net
model
Model specific
python
python script changes
#19125
opened Jan 27, 2026 by
ymcki
Loading…
ggml-cpu: add RVV repack GEMM and GEMV for quantization types
ggml
changes relating to the ggml tensor library for machine learning
#19121
opened Jan 26, 2026 by
taimur-10x
Loading…
server: print actual model name in 'model not found" error
examples
server
#19117
opened Jan 26, 2026 by
teto
Loading…
Check if ctx or model is null before calling sampler
examples
#19101
opened Jan 26, 2026 by
gagankonana
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-01-27.