-
Notifications
You must be signed in to change notification settings - Fork 12.8k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
test-opt: allow slight inprecision
testing
Everything test related
#15503
opened Aug 22, 2025 by
JohannesGaessler
Loading…
[CANN]Rope repeat Optimization
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#15501
opened Aug 22, 2025 by
noemotiovon
Loading…
ggml : introduce semantic versioning
ggml
changes relating to the ggml tensor library for machine learning
vulkan: Rewrite synchronization to allow some overlap between nodes
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#15489
opened Aug 21, 2025 by
jeffbolznv
Loading…
tests: Generate unique input values for count_equal
testing
Everything test related
#15487
opened Aug 21, 2025 by
jeffbolznv
Loading…
Fix incorrect causual attention mask caused by M-Rope
examples
#15474
opened Aug 21, 2025 by
rujialiu
Loading…
CUDA: Accelerate MXFP4 table lookup using changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
__byte_perm
ggml
#15451
opened Aug 20, 2025 by
Qeeweew
Loading…
ggml WebGPU: add support for quantization types
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
#15440
opened Aug 20, 2025 by
reeselevine
Loading…
CANN: optimize rope cache
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#15436
opened Aug 20, 2025 by
hipudding
Loading…
llama: use FA + max. GPU layers by default
examples
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
script
Script related
#15434
opened Aug 19, 2025 by
JohannesGaessler
Loading…
vulkan: optimize mul_mat_id loading row ids into shared memory
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#15427
opened Aug 19, 2025 by
jeffbolznv
Loading…
support interns1-mini
python
python script changes
#15412
opened Aug 19, 2025 by
RunningLeon
Loading…
Thinking model disabled assistant prefill
examples
server
#15404
opened Aug 18, 2025 by
gabe-l-hart
Loading…
vulkan : support ggml_mean
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#15393
opened Aug 18, 2025 by
Acly
Loading…
fix: Add conditional compilation for OpenCL 2.0 compatibility
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#15383
opened Aug 18, 2025 by
baonudesifeizhai
Loading…
vulkan: optimize mxfp4
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#15363
opened Aug 16, 2025 by
lovedheart
Loading…
aLoRA Support
examples
python
python script changes
server
#15327
opened Aug 14, 2025 by
gabe-l-hart
Loading…
1 task done
OpenCL: add fused group_norm/norm, mul, add
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
testing
Everything test related
#15314
opened Aug 14, 2025 by
rmatif
Loading…
Add OpenVINO backend
devops
improvements to build systems and github actions
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.