Skip to content

Unknown model architecture: qwen3 #1173

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
aropb opened this issue Apr 29, 2025 · 2 comments
Open

Unknown model architecture: qwen3 #1173

aropb opened this issue Apr 29, 2025 · 2 comments

Comments

@aropb
Copy link

aropb commented Apr 29, 2025

Description

LLamaSharp 0.23.0
CUDA
Windows
LLM: Qwen3-8B-Q5_K_M.gguf

Errors:
llama_model_load: error loading model: error loading model architecture: unknown model architecture: 'qwen3'
llama_model_load_from_file_impl: failed to load model

llama.cpp:
ggml-org/llama.cpp#12828

@aropb aropb changed the title llama_model_load: error loading model: error loading model architecture: unknown model architecture: 'qwen3' Unknown model architecture: qwen3 Apr 29, 2025
@nipeone
Copy link

nipeone commented Apr 29, 2025

Because supporting qwen3 requires a relatively new version of llama.cpp, but currently the latest version of llama.cpp supported by LLamaSharp only reaches b026.

@sangyuxiaowu
Copy link
Contributor

wait #1179

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants