You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It was a versions issue, however I had pulled the latest code.
The problem was that since the last version I had installed, the location of the llama-cli file was changed (it used to be on the root directory, now it's on build/bin), so it didn't overwrite the one I was using to check the version or to do the actual inference.
Thank you, hope this helps someone else so I don't feel that dumb,
Name and Version
version: 3400 (97bdd26)
built with cc (Ubuntu 9.4.0-1ubuntu1~20.04.2) 9.4.0 for x86_64-linux-gnu
Operating systems
Linux
GGML backends
CUDA
Hardware
NVIDIA GeForce RTX 3050
Models
microsoft_Phi-4-mini-instruct-Q8_0.gguf
Problem description & steps to reproduce
When I run the llama server with the phi4 mini gguf I get the error
First Bad Commit
No response
Relevant log output
The text was updated successfully, but these errors were encountered: