Bug: Model isn't loading #9563

iladshyan · 2024-09-20T10:47:29Z

What happened?

llama3.1 isn't loading at all. I get following in the terminal and the program just quits.:

./llama-cli -m "C:\<path>\llama3.1.gguf" -p "The world is a place where"
build: 3787 (6026da52) with MSVC 19.29.30154.0 for x64
main: llama backend init
main: load the model and apply lora adapter, if any

Name and Version

./llama-cli --version
version: 3787 (6026da5)
built with MSVC 19.29.30154.0 for x64

What operating system are you seeing the problem on?

Windows

Relevant log output

build: 3787 (6026da52) with MSVC 19.29.30154.0 for x64
main: llama backend init
main: load the model and apply lora adapter, if any

The text was updated successfully, but these errors were encountered:

ninadakolekar · 2024-09-23T07:10:23Z

I'm also facing same issue. Tried updating to 3803 but didn't help.

iladshyan · 2024-09-23T07:13:01Z

I'm also facing same issue. Tried updating to 3803 but didn't help.

By any chance you are migrating from Ollama? If not where did you get your model files?

ninadakolekar · 2024-09-23T07:14:30Z

I downloaded GGUF from huggingface: https://huggingface.co/bartowski/Phi-3.5-mini-instruct-GGUF

salocinrevenge · 2024-11-01T23:40:03Z

I tried to run this exactly model: https://huggingface.co/bartowski/Phi-3.5-mini-instruct-GGUF/blob/main/Phi-3.5-mini-instruct-IQ2_M.gguf on UBUNTU and I got the problem of memory:

ggml/src/ggml.c:438: fatal error
ggml_aligned_malloc: insufficient memory (attempted to allocate 49152,00 MB)

probably the memory allocator is "over allocing" memory.

I tried a bigger model (in parameters) https://huggingface.co/TheBloke/Llama-2-13B-GGUF/blob/main/llama-2-13b.Q2_K.gguf and the model ran normally with almost no extra memory

slaren · 2024-11-01T23:46:14Z

@salocinrevenge add -c 1024 to the command line to use a smaller context size.

@ggerganov we keep getting bug reports because people don't realize that they cannot use the full context of current models. Should we revert this change?

iladshyan · 2024-11-07T15:30:47Z

Did anyone find a fix for the original issue?

iladshyan · 2024-11-08T13:22:12Z

I found out what was causing the issue. I was using the avx2 version while my CPU only supports avx. Changing it fixed it. It would be great if the program could inform the user of unsupported version.

iladshyan added bug-unconfirmed high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow) labels Sep 20, 2024

github-actions bot added the stale label Oct 24, 2024

github-actions bot removed the stale label Nov 2, 2024

ggerganov mentioned this issue Nov 2, 2024

llama : adjust default context size + print warnings #10136

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug: Model isn't loading #9563

Bug: Model isn't loading #9563

iladshyan commented Sep 20, 2024

ninadakolekar commented Sep 23, 2024

iladshyan commented Sep 23, 2024

ninadakolekar commented Sep 23, 2024 •

edited

Loading

salocinrevenge commented Nov 1, 2024

slaren commented Nov 1, 2024

iladshyan commented Nov 7, 2024

iladshyan commented Nov 8, 2024

Bug: Model isn't loading #9563

Bug: Model isn't loading #9563

Comments

iladshyan commented Sep 20, 2024

What happened?

Name and Version

What operating system are you seeing the problem on?

Relevant log output

ninadakolekar commented Sep 23, 2024

iladshyan commented Sep 23, 2024

ninadakolekar commented Sep 23, 2024 • edited Loading

salocinrevenge commented Nov 1, 2024

slaren commented Nov 1, 2024

iladshyan commented Nov 7, 2024

iladshyan commented Nov 8, 2024

ninadakolekar commented Sep 23, 2024 •

edited

Loading