support youtu-vl model #18479

f291400 · 2025-12-30T06:36:25Z

Make sure to read the contributing guidelines before submitting a PR

ngxson

also wait for @CISC @ggerganov reviews for libllama changes

gguf-py/gguf/gguf_writer.py

tools/mtmd/clip-model.h

convert_hf_to_gguf_update.py

convert_hf_to_gguf.py

tools/mtmd/clip-impl.h

convert_hf_to_gguf.py

… is None

convert_hf_to_gguf.py

gguf-py/gguf/gguf_writer.py

convert_hf_to_gguf.py

CISC

LGTM after whitespace fixes

f291400 · 2025-12-31T02:52:15Z

So should I revert to the previous state, or leave it as it is?

CISC · 2025-12-31T02:56:26Z

So should I revert to the previous state, or leave it as it is?

Revert, but don't touch ggml/src/ggml-sycl/CMakeLists.txt.

CISC · 2025-12-31T03:02:48Z

Perfect.

f291400 · 2025-12-31T14:55:26Z

I'd like to inquire when this model will be supported. I see that it has been open-sourced, and I want to try it out as soon as possible. https://huggingface.co/tencent/Youtu-LLM-2B

When it is ready and merged, however it also requires support in minja for its chat template: ochafik/minja#24

Thanks. I will change rsplit to split in chat_template.json.

CISC · 2025-12-31T15:02:24Z

Thanks. I will change rsplit to split in chat_template.json.

Don't, just use it properly instead, as I suggested here: https://huggingface.co/tencent/Youtu-LLM-2B/discussions/1

f291400 · 2025-12-31T15:07:01Z

@ngxson @ggerganov
Could you please review the code and check if there are any remaining bugs that need to be addressed? I would like to merge these changes into the main branch as soon as possible.

ngxson · 2026-01-01T11:03:15Z

Lint CI fails, please fix before we can merge.

CISC · 2026-01-01T12:44:30Z

Lint CI fails, please fix before we can merge.

LOL, I think it just picks up the previous master error somehow...

@f291400 Try rebasing, should fix the CI.

ngxson · 2026-01-01T13:07:53Z

If it's fixed on master then I think it's ok to merge as-is then @CISC ?

Beside, @f291400 if you want reviews to be fast and efficient, read the contribution guidelines and validate your changes carefully.

This PR is create from your master branch, maintainers cannot push fixes directly here; the PR is also moved 2-3 times which make our work extremely inefficient.

CISC · 2026-01-01T13:09:48Z

If it's fixed on master then I think it's ok to merge as-is then @CISC ?

I ran flake8 locally and it doesn't report any errors, and the line number given by CI is bogus, so I think it should be fine.

svlys · 2026-01-01T17:25:36Z

Thank you for acknowledging this submission, which will greatly advance the use of youtu-llm on llama.cpp.

ngxson · 2026-01-01T17:41:55Z

I attempted to merge via GH web UI but failed, so unfortunately you need to fix the merge conflict yourself @f291400

src/llama-vocab.h

CISC · 2026-01-01T17:55:46Z

This PR is create from your master branch, maintainers cannot push fixes directly here; the PR is also moved 2-3 times which make our work extremely inefficient.

So, it seems GitHub started allowing this now?

ngxson · 2026-01-01T18:15:08Z

So, it seems GitHub started allowing this now?

No idea, probably allowed via web UI only?

I never have problem applying patches via web UI. But if I do gh pr checkout locally and if the PR is created from fork's master branch, I always get permission error on git push

CISC · 2026-01-01T18:17:57Z

So, it seems GitHub started allowing this now?

No idea, probably allowed via web UI only?

I never have problem applying patches via web UI. But if I do gh pr checkout locally and if the PR is created from fork's master branch, I always get permission error on git push

I'm pretty sure merging from master used to fail.

* Support Youtu-VL Model * merge code * fix bug * revert qwen2 code & support rsplit in minja.hpp * update warm info * fix annotation * u * revert minja.hpp * fix * Do not write routed_scaling_factor to gguf when routed_scaling_factor is None * fix expert_weights_scale * LGTM after whitespace fixes * fix * fix * fix * layers to layer_index * enum fix --------- Co-authored-by: Xuan-Son Nguyen <[email protected]> Co-authored-by: Sigbjørn Skjæret <[email protected]>

f291400 added 8 commits December 23, 2025 14:49

Support Youtu-VL Model

8733bf3

merge code

1600974

fix bug

867709c

revert qwen2 code & support rsplit in minja.hpp

3e816b2

update warm info

3ec91fb

fix annotation

251852a

u

0e8f2a0

revert minja.hpp

db46213

f291400 requested review from CISC, ggerganov and ngxson as code owners December 30, 2025 06:36

github-actions bot added examples python python script changes labels Dec 30, 2025

f291400 mentioned this pull request Dec 30, 2025

Support Youtu-VL Model #18315

Closed

loci-dev mentioned this pull request Dec 30, 2025

UPSTREAM PR #18479: support youtu-vl model auroralabs-loci/llama.cpp#755

Open

ngxson requested changes Dec 30, 2025

View reviewed changes

gguf-py/gguf/gguf_writer.py Outdated Show resolved Hide resolved

tools/mtmd/clip-model.h Outdated Show resolved Hide resolved

convert_hf_to_gguf_update.py Outdated Show resolved Hide resolved

CISC reviewed Dec 30, 2025

View reviewed changes

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

convert_hf_to_gguf.py Show resolved Hide resolved

tools/mtmd/clip-impl.h Outdated Show resolved Hide resolved

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

fix

9a2ff8b

CISC reviewed Dec 30, 2025

View reviewed changes

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

Do not write routed_scaling_factor to gguf when routed_scaling_factor…

cda8538

… is None

f291400 requested review from CISC and ngxson December 31, 2025 01:16