Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Support jina-reranker-v3 cross-encoder architecture python python script changes testing Everything test related
#22576 opened May 1, 2026 by Dampish0 Draft
hexagon: enable non-contiguous row tensor support for unary ops ggml changes relating to the ggml tensor library for machine learning Hexagon
#22574 opened May 1, 2026 by aparmp-quic Contributor Loading…
llama-quant : fix --tensor-type when default qtype is overriden
#22572 opened May 1, 2026 by ddh0 Contributor Loading…
Swap out F16 for BF16 in Q8_1 activations to avoid overflowing values ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#22571 opened May 1, 2026 by bartowski1182 Contributor Draft
[Draft] feat: implement paged KV cache and attention examples ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#22569 opened Apr 30, 2026 by matiaslin Contributor Draft
ggml-cpu: fix msvc c2440 cast error for m512bh and m256bh in sgemm.cpp ggml changes relating to the ggml tensor library for machine learning
#22568 opened Apr 30, 2026 by nanodan52 Loading…
devops: SYCL: upgraded the default compute-runtime version devops improvements to build systems and github actions
#22567 opened Apr 30, 2026 by WizardlyBump17 Contributor Loading…
fix: consistent memory breakdown for models loaded with no_alloc testing Everything test related
#22566 opened Apr 30, 2026 by giladgd Contributor Loading…
cmake: fix MATH_LIBRARY check on Windows MSVC ggml changes relating to the ggml tensor library for machine learning
#22564 opened Apr 30, 2026 by ServeurpersoCom Contributor Loading…
server : avoid checkpoint data host copies examples server
#22558 opened Apr 30, 2026 by ggerganov Member Loading…
ggml-virtgpu: fix transitive dependency in headers ggml changes relating to the ggml tensor library for machine learning
#22557 opened Apr 30, 2026 by Juste-Leo2 Loading…
kleidiai : update to v1.24.0 and use release archive ggml changes relating to the ggml tensor library for machine learning
#22549 opened Apr 30, 2026 by chaxu01 Collaborator Loading…
server: support Vertex AI compatible API examples python python script changes server
#22545 opened Apr 30, 2026 by ngxson Contributor Draft
docs : update speculative decoding parameters after refactor (#22397) documentation Improvements or additions to documentation
#22539 opened Apr 30, 2026 by ggerganov Member Draft
fix: CUDA device PCI bus ID de-dupe OOMing (ignoring other 3 gpus entirely) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#22533 opened Apr 29, 2026 by lucyknada Loading…
[Model] Support MiniCPM-V 4.6 documentation Improvements or additions to documentation examples python python script changes
#22529 opened Apr 29, 2026 by tc-mb Contributor Loading…
sycl: Add optional USM system allocations documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#22526 opened Apr 29, 2026 by ifdu Loading…
ggml-cpu: optimize ggml_gemm_q4_K_8x8_q8_K interleaving/staging for AVX-512 (and AVX2) ggml changes relating to the ggml tensor library for machine learning
#22525 opened Apr 29, 2026 by HyeongiJeon Loading…
Programmatic Dependent Launch (PDL) for more performance on newer NVIDIA GPUs (Hopper+) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#22522 opened Apr 29, 2026 by aendk Contributor Draft
mtmd : add Nemotron 3 Nano Omni support (parakeet) examples python python script changes
#22520 opened Apr 29, 2026 by danbev Member Draft
ProTip! Adding no:label will show everything without a label.