13 Jun 23:27

mudler

4d3d54d

v4.4.3 Latest

Latest

What's Changed

Other Changes

chore: ⬆️ Update CrispStrobe/CrispASR to d745bda4386ae0f9d1d2f23fff8ec95d76428221 by @localai-bot in #10260
docs: ⬆️ update docs version mudler/LocalAI by @localai-bot in #10259
chore: ⬆️ Update antirez/ds4 to d881f2a05e8ff6bec001315a36b794b4aa310173 by @localai-bot in #10262
chore: ⬆️ Update mudler/parakeet.cpp to 9db92be63179a27201d3b88d5d40c545b2ac48ae by @localai-bot in #10263
feat(react-ui): add Indonesian language support by @dedyf5 in #10266
chore: ⬆️ Update ggml-org/llama.cpp to 4c6595503fe45d5a39f88d194e270f64c7424677 by @localai-bot in #10261
feat(backend): locate-anything-cpp (open-vocabulary object detection via ggml) by @localai-bot in #10264
fix(router): production-ready request router + auto-size batch for embedding/rerank by @richiejp in #10104
chore(model gallery): 🤖 add 1 new models via gallery agent by @localai-bot in #10270
feat(parakeet-cpp): enable GGML_CUDA_GRAPHS in the cublas build by @localai-bot in #10273
fix(darwin): publish sherpa-onnx and speaker-recognition images for darwin/arm64 by @localai-bot in #10275
fix(crispasr): write piper TTS WAV at the model's native sample rate by @localai-bot in #10277
feat(crispasr): bundle espeak-ng and add piper TTS voices to the gallery by @localai-bot in #10283
chore: ⬆️ Update mudler/parakeet.cpp to b8012f11e5269126eddb7f4fd02f891a2ccc29b0 by @localai-bot in #10281
docs: ⬆️ update docs version mudler/LocalAI by @localai-bot in #10279
fix(mlx): route vision-language models to the mlx-vlm backend by @localai-bot in #10274
fix(darwin): fix vibevoice-cpp build linkage + fail-safe go backend packaging by @localai-bot in #10276
fix(agents): emit chat event timestamps in milliseconds (#9867) by @aniruddh909 in #10243
fix(realtime): keep transcription model on a language-only session.update by @localai-bot in #10295
chore: ⬆️ Update mudler/locate-anything.cpp to 92c1682da792c1e8a5dec91acc2be4b02c742ded by @localai-bot in #10282
fix(config): backend-gate the top_k=40 sampler default (#6632) by @localai-bot in #10285
feat(gallery): add 60 piper TTS voices across 42 languages (Phase 2) by @localai-bot in #10296
fix(deps): bump cogito to fix MCP image-result panic (#10101) by @localai-bot in #10294
fix(neutts): pin torchaudio to match torch (fixes undefined symbol) (#9798) by @localai-bot in #10292
fix(gallery): make opus a meta backend for platform auto-selection (#9813) by @localai-bot in #10291
chore(model gallery): 🤖 add 1 new models via gallery agent by @localai-bot in #10298
fix(gallery): correct meta-backend definitions for platform auto-selection by @localai-bot in #10299
chore(model gallery): 🤖 add 1 new models via gallery agent by @localai-bot in #10302
ci(darwin): build the ds4 backend for darwin/arm64 (metal) by @localai-bot in #10303
fix(react-ui): stop Talk pipeline overflow and center collapsed-rail avatar by @localai-bot in #10305
chore(model gallery): 🤖 add 1 new models via gallery agent by @localai-bot in #10304
fix(react-ui): make agent chat timestamps format-agnostic (#9867) by @localai-bot in #10290
model: fix case-insensitive suffix matching and skip .bak files in ListFilesInModelPath by @pos-ei-don in #10306
fix(xsysinfo): container-aware total RAM detection (cgroup/lxcfs) (#8059) by @localai-bot in #10288
feat(distributed): declarative per-model scheduling via env/args by @localai-bot in #10308
feat(sherpa-onnx): add Kokoro TTS + multilingual Piper voices by @localai-bot in #10309
feat(omnivoice-cpp): add OmniVoice TTS backend (file + streaming, voice cloning + voice design) by @localai-bot in #10310
feat(i18n): add Korean (ko) translation by @moduvoice in #10312
feat(qwen3-tts-cpp): migrate to ServeurpersoCom/qwentts.cpp (streaming, speakers, voice design) by @localai-bot in #10316
feat(realtime): gate realtime pipeline voice models behind voice recognition by @localai-bot in #10319
chore: ⬆️ Update vllm-project/vllm cu130 wheel to 0.23.0 by @localai-bot in #10314
test(e2e): live-server voice-recognition gate test by @localai-bot in #10324

New Contributors

@aniruddh909 made their first contribution in #10243
@moduvoice made their first contribution in #10312

Full Changelog: v4.4.2...v4.4.3

Contributors

richiejp, pos-ei-don, and 4 other contributors

Assets 9

11 Jun 22:22

mudler

v4.4.2

58cdc05

v4.4.2

What's Changed

Other Changes

chore: ⬆️ Update ggml-org/llama.cpp to ac4cddeb0dbd778f650bf568f6f08344a06abe3a by @localai-bot in #10239
chore: ⬆️ Update CrispStrobe/CrispASR to 4b27392ffd0991a857594652cbb8b57e585bcd7b by @localai-bot in #10241
fix(vllm): parse tool_call function arguments before applying the chat template by @pos-ei-don in #10256
fix(cuda): install cuda-nvrtc-dev alongside the other CUDA dev packages by @pos-ei-don in #10257

Full Changelog: v4.4.1...v4.4.2

Contributors

pos-ei-don and localai-bot

Assets 9

11 Jun 16:33

mudler

v4.4.1

f618636

v4.4.1

What's Changed

Other Changes

docs: ⬆️ update docs version mudler/LocalAI by @localai-bot in #10245
chore: ⬆️ Update antirez/ds4 to 8384adf0f9fa0f3bb342dd925372de778b95b263 by @localai-bot in #10242
fix(vllm): restore compatibility with vLLM >= 0.22 (get_tokenizer moved to vllm.tokenizers) by @pos-ei-don in #10252
feat(realtime): stream the LLM / TTS / transcription pipeline stages by @localai-bot in #10176
docs: fix broken relref to realtime page by @localai-bot in #10255

New Contributors

@pos-ei-don made their first contribution in #10252

Full Changelog: v4.4.0...v4.4.1

Contributors

pos-ei-don and localai-bot

Assets 9

10 Jun 20:10

mudler

v4.4.0

fba8c9c

v4.4.0

🎉 LocalAI 4.4.0 Release! 🚀

LocalAI 4.4.0 is out!

This is a big, multimodal-and-distributed release. Two brand-new audio backends land - parakeet.cpp (NVIDIA NeMo Parakeet ASR) and CrispASR (a multi-architecture ASR and TTS engine) - alongside native object detection + segmentation (rfdetr-cpp), video understanding in llama-cpp, and LTX-2 video generation in stablediffusion-ggml. Distributed mode grows up: prefix-cache-aware routing is on by default, and file transfers become resumable. There's a new intelligent middleware layer for request routing, PII filtering and cloud-model proxying, a security hardening pass that closes a credential-leak class across every outbound HTTP client, an interactive local-ai chat CLI, RAG source citations for agents, and a long run of reasoning / tool-call streaming fixes.

📌 TL;DR

Area	Summary
🎙️ Two new ASR backends	`parakeet-cpp` (NeMo FastConformer TDT/CTC/RNNT, streaming, word/segment timestamps) and `crispasr` (many ASR architectures + TTS in one binary).
🧭 Intelligent Middleware	Capability-based model routing, PII detection/redaction, cloud-model proxies + a MITM proxy for subscription-auth Claude Code / Codex.
🛰️ Distributed v4	Prefix-cache-aware routing (on by default), NATS JWT auth + TLS/mTLS, worker registration-token enforcement, resumable HTTP file transfers, boot-time model prefetch, ds4 layer-split inference.
🎥 Video, both ways	Video input (understanding) in `llama-cpp` via mtmd, and video generation via LTX-2 in `stablediffusion-ggml`.
👁️ Detection + Segmentation	New native `rfdetr-cpp` backend (RF-DETR), 32 prebuilt GGUFs, bbox + per-detection PNG masks.
🔐 Outbound HTTP hardening	`pkg/httpclient` refuses cross-host credential-leaking redirects across every outbound client (GHSA-3mj3-57v2-4636).
🗣️ TTS per-request control	`instructions` + a generic `params` map plumbed end to end (Qwen3-TTS VoiceDesign / CustomVoice, Chatterbox).
💻 `local-ai chat`	Interactive terminal chat against a running server, with `/models`, `/model`, `/clear`.
📚 RAG citations	Agent answers now append a clickable `Sources:` block from the Knowledge Base.
🧠 Models	Gemma 4 QAT family + QAT-matched MTP speculative-decoding bundles, Ideogram4, LTX-2.3 22B GGUFs.

🚀 New Features & Major Enhancements

🎙️ Audio Gets Serious: Two New ASR Backends

This release doubles down on speech-to-text with two independent, cgo-less Go backends (purego, CGO_ENABLED=0), each shipping a full CI matrix, gallery importer and docs.

parakeet-cpp - NVIDIA NeMo Parakeet (#10084). Wraps parakeet.cpp, a C++/ggml port of NeMo Parakeet (FastConformer TDT/CTC/RNNT/hybrid) that matches the upstream PyTorch models on CPU. Text transcription, OpenAI-compatible word timestamps, and cache-aware streaming (16 kHz PCM chunks, <EOU>/<EOB> utterance boundaries). GGUFs for all 10 Parakeet models × 5 quants ship in mudler/parakeet-cpp-gguf. Follow-ups in this cycle made it production-grade:

Dynamic batching (#10112) - concurrent transcription requests are batched for throughput.
Real, NeMo-faithful segment timestamps (#10207) - words are grouped into segments exactly like NeMo's get_segment_offsets (sentence-punctuation boundaries by default, opt-in segment_gap_threshold silence splitting in encoder frames). Streaming FinalResult segments now carry start/end when the library exposes the ABI v4 JSON entry points.
nemotron-3.5-asr multilingual streaming (#10199) + per-request language selection.

crispasr - many architectures + TTS in one backend (#10099). Wraps CrispASR (a whisper.cpp/ggml fork, MIT) through its session C-ABI. One backend serves ASR or TTS depending on the loaded model, with the architecture auto-detected from the GGUF (or forced via backend:). The gallery gains 36 -crispasr entries (32 ASR + 4 TTS):

ASR (e2e-verified across Whisper / Parakeet / Moonshine): parakeet, canary, cohere, qwen3, voxtral, granite, fastconformer-ctc, wav2vec2, hubert, data2vec, glm-asr, kyutai-stt, firered-asr, moonshine, mimo-asr, and more.
TTS (all four e2e-verified to valid 24 kHz mono WAV): vibevoice, chatterbox, qwen3-tts CustomVoice, orpheus - via backend: / codec: / speaker: / voice: model options.

🧭 Intelligent Middleware: Routing, PII Filtering & Cloud Proxies

A new middleware layer (#9802) analyzes, routes, filters and transforms chat requests before they hit a model.

Capability-based routing. Requests are classified (e.g. via an ArchRouter-style model) and scored across the capabilities they may require, then routed to the smallest model that satisfies them - easy requests go to small specialized models, hard or uncertain ones to larger general-purpose models. Classified embeddings are reused via cosine similarity so similar requests skip re-classification.
PII filtering. Private information is detected per-pattern and can be redacted, rerouted, or blocked, with a streaming PII filter that preserves a buffered-emit invariant on /v1/chat/completions, Anthropic /v1/messages, and /v1/completions. A per-model PII pattern editor lives in the model config UI.
Cloud model proxies + MITM. Cloud models and a MITM proxy can take part in routing/filtering - send easy requests to local models and hard ones to the cloud, and use Claude Code / Codex subscriptions (OAuth) through the PII filter via the MITM proxy (subject to provider ToS). Emits proxy_connect + proxy_traffic audit events and restores its listener from runtime_settings.json on restart.

Usage stats are recorded end to end and surfaced in REST, the UI, and MCP. Outbound clients used by this path were also the trigger for the security pass below.

🛰️ Distributed Mode v4

Distributed mode keeps maturing across routing, security and resilience.

Prefix-cache-aware routing, on by default (#10071). Routing now biases toward the replica that already holds the relevant KV/prefix cache, as a load-guarded hint that never routes worse than today's round-robin. A generic prefix tree (pkg/radixtree) maps cumulative prompt-prefix hashes to nodes; core/services/nodes/prefixcache turns the rendered prompt into a deterministic xxhash chain and makes a filter-then-score decision (narrow to load-eligible replicas, then prefer the longest-prefix match), feeding a preferredNodeID into the existing atomic SELECT ... FOR UPDATE pick. Observations sync across frontends over NATS. Round-robin is the floor; disable with --distributed-prefix-cache=false.

NATS JWT auth + TLS/mTLS (#10159). Previously anyone with access to the NATS port could publish backend-install messages or agent jobs (an SSRF / accidental-exposure risk). This adds JWT authentication and TLS/mTLS options, with workers acquiring and auto-refreshing their NATS credentials. Complemented by worker file-transfer registration-token enforcement (#10183).

Resumable file transfers (#10109). Large model GGUFs over flaky/throttled links no longer restart from byte 0. The worker's PUT /v1/files/<key> honors Content-Range (308/416 resume semantics, X-Content-SHA256 binding, final-hash verification) and the master-side stager HEAD-probes for the last accepted offset and resumes, switching to an outer time budget (LOCALAI_FILE_TRANSFER_BUDGET, default 1h) with exponential backoff.

ds4 layer-split distributed inference (#10098). Manual layer-split inference for the ds4 backend: a coordinator owns layers 0:K and listens; workers dial in and own higher ranges, each loading only its slice of the GGUF (a new dependency-free ds4-worker binary, driven via local-ai worker ds4-distributed). Fully back-compatible when ds4_role is absent.

Operational glue. Boot-time gallery prefetch via LOCALAI_PREFETCH_MODELS (#10108); a gated X-LocalAI-Node response header for attribution (#9976); plus fixes: self-heal stale "model not loaded" routing (#10181), stage directory-based models to remote nodes (#10175), in-flight tracking for non-LLM methods - VAD, diarize, voice (#10238), reconciler survives frontend restarts (#9981), cross-replica OpCache sync (#9983), and the reinstall/upgrade UI no longer sticks on "reinstalling" (#10214).

🎥 Video, Both Directions

Video input / understanding in llama-cpp (#10216). Video-capable multimodal models (e.g. SmolVLM2-Video) can now be sent a video in a chat request, mirroring the existing image and audio paths. Tracks the upstream mtmd video landing (ggml-org/llama.cpp#24269); grpc-server.cpp forwards request->videos() into the mtmd files vector on both the template and non-template paths, and the React chat UI accepts video/*, renders an inline <video controls> player, and emits video_url content parts. allow_video is auto-gated by whether the loaded mmproj supports it. ffmpeg/ffprobe (already in the runtime image) extract frames.

Video generation via LTX-2 (#9980). stablediffusion-ggml wires audio_vae_path and embeddings_connectors_path through to the upstream LTX-2 fields, with a new gallery/ltx-ggml.yaml template (T2V / I2V / FLF2V recipes) and six LTX-2.3 22B GGUF gallery entries (dev + distilled, UD-Q4_K_M / Q4_K_M / Q8_0), each bundling the text encoder + video VAE + audio VAE + embeddings connectors. Follow-up fixes wi...

Contributors

richiejp, mudler, and 9 other contributors

Assets 9

30 May 21:23

mudler

v4.3.6

aee4611

v4.3.6

What's Changed

Other Changes

chore: ⬆️ Update ggml-org/llama.cpp to 22d66b567eef11cf2e9832f04db64ee0323a0fd0 by @localai-bot in #10080
security(http): refuse redirects on outbound clients via hardened pkg/httpclient by @richiejp in #10087
feat(parakeet-cpp): add NVIDIA NeMo Parakeet ASR backend (parakeet.cpp) by @localai-bot in #10084
chore: ⬆️ Update antirez/ds4 to e16ead1e29c81a67bbb64e5b001117679cf9ce6e by @localai-bot in #10076
chore: ⬆️ Update mudler/parakeet.cpp to 30a307553f1965ceb38a1a922069a71e7dd67bf3 by @localai-bot in #10092

Full Changelog: v4.3.5...v4.3.6

Contributors

richiejp and localai-bot

Assets 9

29 May 23:18

mudler

v4.3.5

7e4df67

v4.3.5

What's Changed

Bug fixes 🐛

fix: tool-call JSON leaks into content with stream+tools on tokenizer-template models (#10052) by @localai-bot in #10057
fix(openai): stop streaming tool-call double-emission when autoparser is active by @bozhouDev in #10055
fix(application): stop backend processes synchronously on shutdown by @richiejp in #10058
fix(functions): validate auto-detected XML tool-call names — robust glm-4.5/Hermes guard (#9722, supersedes #9940) by @localai-bot in #10059
fix(model): track intentional stops, stop misreading clean shutdowns as crashes by @richiejp in #10060

Exciting New Features 🎉

feat(reasoning): honor per-request reasoning_effort on chat completions by @localai-bot in #10082

Other Changes

chore: ⬆️ Update mudler/rf-detr.cpp to ecf64d7f7f20d73ebd906a983f398ed287256320 by @localai-bot in #10035
docs: ⬆️ update docs version mudler/LocalAI by @localai-bot in #10046
chore: ⬆️ Update antirez/ds4 to 22393e770ea8eb7501d8718d6f66c6374004e03f by @localai-bot in #10047
chore: ⬆️ Update leejet/stable-diffusion.cpp to 0e4ee04488159b81d95a9ffcd983a077fd5dcb77 by @localai-bot in #10048
chore: ⬆️ Update ggml-org/llama.cpp to 751ebd17a58a8a513994509214373bb9e6a3d66c by @localai-bot in #10049
chore: ⬆️ Update ikawrakow/ik_llama.cpp to 6eff055a0cc0e427a6849cfcb5de531b4b82d667 by @localai-bot in #10050
chore: ⬆️ Update ggml-org/whisper.cpp to c932729a304f7d9eb5354afa38624cfa86a780cf by @localai-bot in #10051
test(react-ui): cover models gallery empty-state reset flow by @Oceankj in #10019
test(utils): cover path verification, sanitization, and unique naming by @TLoE419 in #9978
chore(model gallery): 🤖 add 1 new models via gallery agent by @localai-bot in #10061
chore: ⬆️ Update ikawrakow/ik_llama.cpp to 8960c5ba5ee9db30ba838304373aa4dbec9f7cbd by @localai-bot in #10077
chore: ⬆️ Update vllm-project/vllm cu130 wheel to 0.22.0 by @localai-bot in #10079
chore(model-gallery): ⬆️ update checksum by @localai-bot in #10081
docs: ⬆️ update docs version mudler/LocalAI by @localai-bot in #10074
chore: ⬆️ Update mudler/rf-detr.cpp to 65c0ffcc9a9bc9dae38252f63d0417c9845a6cf7 by @localai-bot in #10075
chore: ⬆️ Update ggml-org/whisper.cpp to f24588a272ae8e23280d9c220536437164e6ed28 by @localai-bot in #10078

New Contributors

@bozhouDev made their first contribution in #10055
@Oceankj made their first contribution in #10019
@TLoE419 made their first contribution in #9978

Full Changelog: v4.3.4...v4.3.5

Contributors

richiejp, Oceankj, and 3 other contributors

Assets 9

28 May 21:22

mudler

v4.3.4

1c92b00

v4.3.4

What's Changed

Other Changes

fix(turboquant): guard upstream-only grpc-server fields for fork by @localai-bot in #10043

Full Changelog: v4.3.3...v4.3.4

Contributors

localai-bot

Assets 9

28 May 13:04

mudler

v4.3.3

b81a6d0

v4.3.3

What's Changed

Other Changes

chore: ⬆️ Update ikawrakow/ik_llama.cpp to 3bf7e836c2c5a895e8d12d3eb7e398ae7ab2f9ce by @localai-bot in #10037
chore(model-gallery): ⬆️ update checksum by @localai-bot in #10038
chore: ⬆️ Update ggml-org/llama.cpp to aa50b2c2ae91326d5aad956ceeb015d1d48e626b by @localai-bot in #10034
chore: ⬆️ Update leejet/stable-diffusion.cpp to 29ab511fc75f89fbab148665eab1a8e10a139a72 by @localai-bot in #10033
chore: ⬆️ Update ggml-org/whisper.cpp to 6dcdd6536456158667747f724d6bd3a2ceaa8d88 by @localai-bot in #10032
chore: ⬆️ Update antirez/ds4 to 072bc0feb187be5f374c08b16d0045e1ad7bc9bc by @localai-bot in #10036
fix(openresponses): populate Content and accept bare {role,content} items (#10039) by @Anai-Guo in #10040
perf(react-ui): code-split bundle, speed up coverage suite by @richiejp in #10042

Full Changelog: v4.3.2...v4.3.3

Contributors

richiejp, localai-bot, and Anai-Guo

Assets 4

27 May 22:06

mudler

v4.3.2

373dc44

v4.3.2

What's Changed

👒 Dependencies

chore(deps): bump github.com/nats-io/nats.go from 1.50.0 to 1.52.0 by @dependabot[bot] in #10003
chore(deps): bump github.com/aws/aws-sdk-go-v2/credentials from 1.19.15 to 1.19.17 by @dependabot[bot] in #10008
chore(deps): bump actions/stale from 10.2.0 to 10.3.0 by @dependabot[bot] in #10002
chore(deps): bump sentence-transformers from 5.5.0 to 5.5.1 in /backend/python/transformers by @dependabot[bot] in #10007
chore(deps): update transformers requirement from >=5.8.1 to >=5.9.0 in /backend/python/transformers by @dependabot[bot] in #10005
chore(deps): bump protobuf from 6.33.5 to 7.35.0 in /backend/python/transformers by @dependabot[bot] in #10004

Other Changes

feat(middleware): Model routing, PII filtering, Cloud model proxies by @richiejp in #9802
fix(intel): VRAM detection by @richiejp in #9944
feat(distributed): gated X-LocalAI-Node response header (middleware + wrapper) by @localai-bot in #9976
fix(distributed): persist per-model load info so reconciler survives frontend restart by @localai-bot in #9981
feat(stablediffusion-ggml): LTX-2 support + LTX-2.3 GGUF gallery entries by @localai-bot in #9980
fix(distributed): sync gallery OpCache + caches across frontend replicas by @localai-bot in #9983
fix(gallery/ltx-2.3): add diffusion_model flag to all variants by @mudler in #9986
fix(gallery/ltx-2.3): add vae_decode_only:false for i2v / flf2v by @mudler in #9987
fix(reasoning): stop leaking into content when autoparser is in pure-content mode by @localai-bot in #9991
fix(stablediffusion-ggml): mux LTX-2 audio into output MP4 by @localai-bot in #9990
feat(swagger): update swagger by @localai-bot in #9992
docs: ⬆️ update docs version mudler/LocalAI by @localai-bot in #9993
fix(streaming/tools): stop healing-marker stubs from gating off content by @localai-bot in #9999
chore: ⬆️ Update antirez/ds4 to ad0209f6a4b067574d2b4afe896c08c177156b31 by @localai-bot in #9996
chore: ⬆️ Update ikawrakow/ik_llama.cpp to b4e1d916c5ec7e75ea3c124dd090425a99fc613f by @localai-bot in #9995
chore: ⬆️ Update leejet/stable-diffusion.cpp to 1ceb5bd9df7784bcdf67dd9ed8bf0198b542ebc9 by @localai-bot in #9994
chore: ⬆️ Update ggml-org/whisper.cpp to e0fd1f6787a5bd4a4957dd97c5b64df882ee7b0c by @localai-bot in #9997
fix(streaming/tools): don't leak prefill-misclassified content as trailing reasoning chunk by @localai-bot in #10000
chore: ⬆️ Update ggml-org/llama.cpp to 35c9b1f39ebe5a7bb83986d64415a079218be78d by @localai-bot in #9998
chore(model gallery): 🤖 add 1 new models via gallery agent by @localai-bot in #10011
fix(dockerignore): exclude local-only artifacts from build context by @richiejp in #10015
chore(model gallery): 🤖 add 1 new models via gallery agent by @localai-bot in #10016
test: add Go + React UI coverage gates and fill test gaps by @richiejp in #9989
fix(qwen-asr): enable timestamp output when forced_aligner is configured by @fqscfqj in #10013
fix(nemo): extract Hypothesis.text for TDT/RNNT ASR models by @fqscfqj in #10012
chore: ⬆️ Update ikawrakow/ik_llama.cpp to d2da6da05c73aeb658a3d1751f386c24e6963856 by @localai-bot in #10020
chore: ⬆️ Update ggml-org/whisper.cpp to 27101c01dcac1676e2b6422256233cd0f1f9ae28 by @localai-bot in #10021
chore: ⬆️ Update ggml-org/llama.cpp to 0d18aaa9d1a8af3df9abccd828e22eeaac7f840b by @localai-bot in #10022
chore: ⬆️ Update leejet/stable-diffusion.cpp to 92dc7268fc4ffb0c0cc0bd52dfcefea91326e797 by @localai-bot in #10023
chore: ⬆️ Update antirez/ds4 to e8e8779b261c10f36ad6270ba732c8f0be5b62e3 by @localai-bot in #10024
UI: add 'Fits in my GPU' filter on Install Models by @siddimore in #10017
fix(react-ui): share single /api/operations poller across consumers by @localai-bot in #10029
feat(backend): rfdetr-cpp native object detection + segmentation backend by @localai-bot in #10028
fix(react-ui): polish 'Fits in my GPU' filter to use design-system Toggle by @localai-bot in #10030
fix(react-ui): force .check() on hidden Toggle input in fits-filter e2e by @localai-bot in #10031

New Contributors

@fqscfqj made their first contribution in #10013

Full Changelog: v4.3.1...v4.3.2

Contributors

richiejp, mudler, and 4 other contributors

Assets 9

25 May 07:28

mudler

v4.3.1

1dcd1ae

v4.3.1

What's Changed

Other Changes

Fix kokoros backend build break from Backend trait drift by @Copilot in #9972
chore: ⬆️ Update antirez/ds4 to f91c12b50a1448527c435c028bfc70d1b00f6c33 by @localai-bot in #9975
chore: ⬆️ Update ikawrakow/ik_llama.cpp to 9f7ba245ab41e118f03aa8dd5134d18a81159d02 by @localai-bot in #9973
chore: ⬆️ Update ggml-org/llama.cpp to 549b9d84330c327e6791fa812a7d60c0cf63572e by @localai-bot in #9974

Full Changelog: v4.3.0...v4.3.1

Contributors

localai-bot

Assets 9

Uh oh!

Releases: mudler/LocalAI

v4.4.3

What's Changed

Other Changes

New Contributors

Contributors

Uh oh!

v4.4.2

What's Changed

Other Changes

Contributors

Uh oh!

v4.4.1

What's Changed

Other Changes

New Contributors

Contributors

Uh oh!

v4.4.0

🎉 LocalAI 4.4.0 Release! 🚀

📌 TL;DR

🚀 New Features & Major Enhancements

🎙️ Audio Gets Serious: Two New ASR Backends

🧭 Intelligent Middleware: Routing, PII Filtering & Cloud Proxies

🛰️ Distributed Mode v4

🎥 Video, Both Directions

Contributors

Uh oh!

v4.3.6

What's Changed

Other Changes

Contributors

Uh oh!

v4.3.5

What's Changed

Bug fixes 🐛

Exciting New Features 🎉

Other Changes

New Contributors

Contributors

Uh oh!

v4.3.4

What's Changed

Other Changes

Contributors

Uh oh!

v4.3.3

What's Changed

Other Changes

Contributors

Uh oh!

v4.3.2

What's Changed

👒 Dependencies

Other Changes

New Contributors

Contributors

Uh oh!

v4.3.1

What's Changed

Other Changes

Contributors

Uh oh!