-
Notifications
You must be signed in to change notification settings - Fork 863
Pull requests: jundot/omlx
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat(download): add concurrent model downloads and per-download worker settings
#758
opened Apr 14, 2026 by
uncle9x9
Loading…
5 tasks done
feat: PlanarQuant3 KV cache + DFlash speculative decoding
#757
opened Apr 14, 2026 by
sooth
Loading…
6 tasks
fix(reranker): align JinaForRanking with Jina Reranker V3 listwise scoring
#745
opened Apr 13, 2026 by
j-huang-rj
Loading…
fix(oq): chunked load/quantize and streaming VLM sanitizer for huge MoE models
#737
opened Apr 12, 2026 by
yohann-bearzi
Loading…
feat: add single_model_mode to force unload before load
#730
opened Apr 12, 2026 by
jroth1111
Loading…
[Performance] add hot cache only mode and optimize memory usage
#701
opened Apr 10, 2026 by
RepublicOfKorokke
Loading…
4 of 14 tasks
feat(admin): add hotswappable engine package management
#679
opened Apr 9, 2026 by
0xClandestine
Loading…
Add support for ParoQuant and custom quantization method loading
#209
opened Mar 13, 2026 by
liang2kl
Loading…
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.