Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

ggml-cpu: Faster IQ1 mul_mat_vec on AVX2 using BMI2 instructions ggml changes relating to the ggml tensor library for machine learning
#12154 opened Mar 2, 2025 by remyoudompheng Loading…
Some portability improvements from trying to build with Visual Studio 2017 examples ggml changes relating to the ggml tensor library for machine learning
#12150 opened Mar 2, 2025 by mgroeber9110 Loading…
Vulkan: Add DP4A MMQ and Q8_1 quantization shader ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12135 opened Mar 1, 2025 by 0cc4m Draft
6 tasks
Server: openai-style lookup decoding examples python python script changes server
#12127 opened Mar 1, 2025 by eeroel Loading…
sycl: cleanup oneDNN related code documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12097 opened Feb 27, 2025 by sgeor255 Draft
opencl:Fix profile-related errors ggml changes relating to the ggml tensor library for machine learning
#12095 opened Feb 27, 2025 by simon886212 Loading…
cmake : fix undefined reference errors for std::filesystem in ggml (#12092) ggml changes relating to the ggml tensor library for machine learning
#12094 opened Feb 27, 2025 by hbuxiaofei Loading…
vulkan: subgroup size test ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12087 opened Feb 26, 2025 by daniandtheweb Draft
Cache based tokenization for the server input prompts demo Demonstrate some concept or idea, not intended to be merged examples server
#12067 opened Feb 25, 2025 by vnicolici Loading…
[WIP]backend: Integrating QNN (Qualcomm AI Engine Direct) as a dedicated backend for Qualcomm NPUs build Compilation issues ggml changes relating to the ggml tensor library for machine learning
#12063 opened Feb 25, 2025 by chraac Draft
Updated readme file for rpc server examples
#12052 opened Feb 24, 2025 by SudaisAlam Loading…
llama-tts examples
#12042 opened Feb 23, 2025 by marcoStocchi Loading…
tool-call: fix Qwen 2.5 Coder support, add micro benchmarks, support trigger patterns for lazy grammars examples python python script changes script Script related server testing Everything test related
#12034 opened Feb 22, 2025 by ochafik Loading…
2 of 3 tasks
Add GGML_HIP_ROCWMMA_FATTN to enable rocWMMA for FlashAttention ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#12032 opened Feb 22, 2025 by hjc4869 Loading…
server webui easy config selection demo Demonstrate some concept or idea, not intended to be merged examples server
#12031 opened Feb 22, 2025 by poulphunter Loading…
common: add -jf / --json-schema-file flag
#12011 opened Feb 21, 2025 by ochafik Loading…
llama : add xcframework build script devops improvements to build systems and github actions examples script Script related
#11996 opened Feb 21, 2025 by danbev Loading…
deepseek r1 series debug log warning fix and chat template support testing Everything test related
#11994 opened Feb 21, 2025 by swordow Loading…
ggml-cpu: add arm64 CPU feature check for OpenBSD, FreeBSD ggml changes relating to the ggml tensor library for machine learning
#11939 opened Feb 18, 2025 by brad0 Loading…
rpc: check op supporting ggml changes relating to the ggml tensor library for machine learning
#11923 opened Feb 17, 2025 by thxCode Loading…
ProTip! Follow long discussions with comments:>50.