Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

kv-cache : support layer reuse
#15504 opened Aug 22, 2025 by ggerganov Loading…
test-opt: allow slight inprecision testing Everything test related
#15503 opened Aug 22, 2025 by JohannesGaessler Loading…
[CANN]Rope repeat Optimization Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#15501 opened Aug 22, 2025 by noemotiovon Loading…
ggml : introduce semantic versioning ggml changes relating to the ggml tensor library for machine learning
#15499 opened Aug 22, 2025 by danbev Draft
model : gpt-oss add response_format support
#15494 opened Aug 22, 2025 by aldehir Loading…
Model: Add support for Seed-OSS python python script changes script Script related
#15490 opened Aug 21, 2025 by pwilkin Draft
vulkan: Rewrite synchronization to allow some overlap between nodes ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#15489 opened Aug 21, 2025 by jeffbolznv Loading…
tests: Generate unique input values for count_equal testing Everything test related
#15487 opened Aug 21, 2025 by jeffbolznv Loading…
mtmd : support Kimi VL model examples python python script changes
#15458 opened Aug 20, 2025 by ngxson Loading…
CUDA: Accelerate MXFP4 table lookup using __byte_perm ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#15451 opened Aug 20, 2025 by Qeeweew Loading…
ggml WebGPU: add support for quantization types ggml changes relating to the ggml tensor library for machine learning python python script changes
#15440 opened Aug 20, 2025 by reeselevine Loading…
CANN: optimize rope cache Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#15436 opened Aug 20, 2025 by hipudding Loading…
llama: use FA + max. GPU layers by default examples ggml changes relating to the ggml tensor library for machine learning python python script changes script Script related
#15434 opened Aug 19, 2025 by JohannesGaessler Loading…
vulkan: optimize mul_mat_id loading row ids into shared memory ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#15427 opened Aug 19, 2025 by jeffbolznv Loading…
support interns1-mini python python script changes
#15412 opened Aug 19, 2025 by RunningLeon Loading…
rpc : reuse compute graphs ggml changes relating to the ggml tensor library for machine learning
#15405 opened Aug 18, 2025 by rgerganov Draft
vulkan : support ggml_mean ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#15393 opened Aug 18, 2025 by Acly Loading…
fix: Add conditional compilation for OpenCL 2.0 compatibility ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#15383 opened Aug 18, 2025 by baonudesifeizhai Loading…
vulkan: optimize mxfp4 ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#15363 opened Aug 16, 2025 by lovedheart Loading…
aLoRA Support examples python python script changes server
#15327 opened Aug 14, 2025 by gabe-l-hart Loading…
1 task done
OpenCL: add fused group_norm/norm, mul, add ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend testing Everything test related
#15314 opened Aug 14, 2025 by rmatif Loading…
Add OpenVINO backend devops improvements to build systems and github actions documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#15307 opened Aug 14, 2025 by wine99 Draft
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.