Granite Docling stopping #16438

gabe-l-hart · 2025-10-06T05:07:02Z

Description

This is a follow up to #16110 to fix the issue of the model not terminating correctly. It turned out to be a few lingering bugs in the tokenization:

A duplicate <fake_token_around_image> before the first image slice
Only a single \n before the start of the global image instead of a double newline
The granite template implementation in llama-chat.cpp had an errant \n at the end when adding the assistant generation prompt

I'm pretty sure (3) was the main cause of the problem, but all three are fixed here. For reference, here are the chat templates for the various models that use the granite chat template:

granite-docling-258M: https://huggingface.co/ibm-granite/granite-docling-258M?chat_template=default#L20
granite-3.3-8b-instruct: https://huggingface.co/ibm-granite/granite-3.3-8b-instruct?chat_template=default#L60
granite-3.2-8b-instruct: https://huggingface.co/ibm-granite/granite-3.2-8b-instruct?chat_template=default#L65
granite-3.1-8b-instruct: https://huggingface.co/ibm-granite/granite-3.1-8b-instruct?chat_template=default#L62
granite-3.0-8b-instruct: https://huggingface.co/ibm-granite/granite-3.0-8b-instruct?chat_template=default#L33

Branch: GraniteDoclingStopping Signed-off-by: Gabe Goodhart <[email protected]>

… prompt There should not be one, even for the language models. Branch: GraniteDoclingStopping Signed-off-by: Gabe Goodhart <[email protected]>

CISC

You need to update the test as well (where interestingly no-one caught the additional \n at the end:

llama.cpp/tests/test-chat-template.cpp

Lines 214 to 219 in b1afcab

    
           { 
        
               /* .name= */ "ibm-granite/granite-3.0-8b-instruct", 
        
               /* .template_str= */ "{%- if tools %}\n    {{- '<|start_of_role|>available_tools<|end_of_role|>\n' }}\n    {%- for tool in tools %}\n    {{- tool | tojson(indent=4) }}\n    {%- if not loop.last %}\n        {{- '\n\n' }}\n    {%- endif %}\n    {%- endfor %}\n    {{- '<|end_of_text|>\n' }}\n{%- endif %}\n{%- for message in messages %}\n    {%- if message['role'] == 'system' %}\n    {{- '<|start_of_role|>system<|end_of_role|>' + message['content'] + '<|end_of_text|>\n' }}\n    {%- elif message['role'] == 'user' %}\n    {{- '<|start_of_role|>user<|end_of_role|>' + message['content'] + '<|end_of_text|>\n' }}\n    {%- elif message['role'] == 'assistant' %}\n    {{- '<|start_of_role|>assistant<|end_of_role|>'  + message['content'] + '<|end_of_text|>\n' }}\n    {%- elif message['role'] == 'assistant_tool_call' %}\n    {{- '<|start_of_role|>assistant<|end_of_role|><|tool_call|>' + message['content'] + '<|end_of_text|>\n' }}\n    {%- elif message['role'] == 'tool_response' %}\n    {{- '<|start_of_role|>tool_response<|end_of_role|>' + message['content'] + '<|end_of_text|>\n' }}\n    {%- endif %}\n    {%- if loop.last and add_generation_prompt %}\n    {{- '<|start_of_role|>assistant<|end_of_role|>' }}\n    {%- endif %}\n{%- endfor %}", 
        
               /* .expected_output= */       "<|start_of_role|>system<|end_of_role|>You are a helpful assistant<|end_of_text|>\n<|start_of_role|>user<|end_of_role|>Hello<|end_of_text|>\n<|start_of_role|>assistant<|end_of_role|>Hi there<|end_of_text|>\n<|start_of_role|>user<|end_of_role|>Who are you<|end_of_text|>\n<|start_of_role|>assistant<|end_of_role|>   I am an assistant   <|end_of_text|>\n<|start_of_role|>user<|end_of_role|>Another question<|end_of_text|>\n<|start_of_role|>assistant<|end_of_role|>\n", 
        
               /* .expected_output_jinja= */ "<|start_of_role|>system<|end_of_role|>You are a helpful assistant<|end_of_text|>\n<|start_of_role|>user<|end_of_role|>Hello<|end_of_text|>\n<|start_of_role|>assistant<|end_of_role|>Hi there<|end_of_text|>\n<|start_of_role|>user<|end_of_role|>Who are you<|end_of_text|>\n<|start_of_role|>assistant<|end_of_role|>   I am an assistant   <|end_of_text|>\n<|start_of_role|>user<|end_of_role|>Another question<|end_of_text|>\n<|start_of_role|>assistant<|end_of_role|>", 
        
           },

ngxson · 2025-10-06T12:56:22Z

The failed CI seems to be unrelated to the current PR

CISC · 2025-10-06T13:01:17Z

The failed CI seems to be unrelated to the current PR

They are definitely related, see my comment. :)

ngxson · 2025-10-06T13:10:37Z

Ah yeah ok I misread the CI output

gabe-l-hart · 2025-10-06T13:19:27Z

Yikes, thanks! I'll get that fixed asap

Branch: GraniteDoclingStopping Signed-off-by: Gabe Goodhart <[email protected]>

gabe-l-hart · 2025-10-06T16:48:10Z

@CISC @ngxson I think the tests should be in good shape now

gabe-l-hart added 3 commits October 6, 2025 01:01

fix: Fix duplicate fake image before token on first slice

c112bb1

Branch: GraniteDoclingStopping Signed-off-by: Gabe Goodhart <[email protected]>

fix: Use double-newline before overview image

2de4d2a

Branch: GraniteDoclingStopping Signed-off-by: Gabe Goodhart <[email protected]>

fix: Remove incorrect newline at the end of granite chat template gen…

505c019

… prompt There should not be one, even for the language models. Branch: GraniteDoclingStopping Signed-off-by: Gabe Goodhart <[email protected]>

gabe-l-hart requested a review from ngxson as a code owner October 6, 2025 05:07

github-actions bot added the examples label Oct 6, 2025

gabe-l-hart requested a review from CISC October 6, 2025 05:07

This was referenced Oct 6, 2025

Model: Granite docling + Idefics3 preprocessing (SmolVLM) #16206

Merged

Granite Docling 258m ollama/ollama#12355

Open

CISC approved these changes Oct 6, 2025

View reviewed changes

This comment was marked as spam.

Sign in to view

ngxson approved these changes Oct 6, 2025

View reviewed changes

tests: Remove bad newline from granite chat template test (legacy)

f7dde51

Branch: GraniteDoclingStopping Signed-off-by: Gabe Goodhart <[email protected]>

gabe-l-hart requested a review from ggerganov as a code owner October 6, 2025 13:21

github-actions bot added the testing Everything test related label Oct 6, 2025

CISC merged commit c08002a into ggml-org:master Oct 6, 2025
67 of 68 checks passed

gabe-l-hart deleted the GraniteDoclingStopping branch October 7, 2025 02:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Granite Docling stopping #16438

Granite Docling stopping #16438

Uh oh!

gabe-l-hart commented Oct 6, 2025

Uh oh!

CISC left a comment

Uh oh!

This comment was marked as spam.

ngxson commented Oct 6, 2025

Uh oh!

CISC commented Oct 6, 2025

Uh oh!

ngxson commented Oct 6, 2025

Uh oh!

gabe-l-hart commented Oct 6, 2025

Uh oh!

gabe-l-hart commented Oct 6, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	{
	/* .name= */ "ibm-granite/granite-3.0-8b-instruct",
	/* .template_str= */ "{%- if tools %}\n {{- '<\|start_of_role\|>available_tools<\|end_of_role\|>\n' }}\n {%- for tool in tools %}\n {{- tool \| tojson(indent=4) }}\n {%- if not loop.last %}\n {{- '\n\n' }}\n {%- endif %}\n {%- endfor %}\n {{- '<\|end_of_text\|>\n' }}\n{%- endif %}\n{%- for message in messages %}\n {%- if message['role'] == 'system' %}\n {{- '<\|start_of_role\|>system<\|end_of_role\|>' + message['content'] + '<\|end_of_text\|>\n' }}\n {%- elif message['role'] == 'user' %}\n {{- '<\|start_of_role\|>user<\|end_of_role\|>' + message['content'] + '<\|end_of_text\|>\n' }}\n {%- elif message['role'] == 'assistant' %}\n {{- '<\|start_of_role\|>assistant<\|end_of_role\|>' + message['content'] + '<\|end_of_text\|>\n' }}\n {%- elif message['role'] == 'assistant_tool_call' %}\n {{- '<\|start_of_role\|>assistant<\|end_of_role\|><\|tool_call\|>' + message['content'] + '<\|end_of_text\|>\n' }}\n {%- elif message['role'] == 'tool_response' %}\n {{- '<\|start_of_role\|>tool_response<\|end_of_role\|>' + message['content'] + '<\|end_of_text\|>\n' }}\n {%- endif %}\n {%- if loop.last and add_generation_prompt %}\n {{- '<\|start_of_role\|>assistant<\|end_of_role\|>' }}\n {%- endif %}\n{%- endfor %}",
	/* .expected_output= */ "<\|start_of_role\|>system<\|end_of_role\|>You are a helpful assistant<\|end_of_text\|>\n<\|start_of_role\|>user<\|end_of_role\|>Hello<\|end_of_text\|>\n<\|start_of_role\|>assistant<\|end_of_role\|>Hi there<\|end_of_text\|>\n<\|start_of_role\|>user<\|end_of_role\|>Who are you<\|end_of_text\|>\n<\|start_of_role\|>assistant<\|end_of_role\|> I am an assistant <\|end_of_text\|>\n<\|start_of_role\|>user<\|end_of_role\|>Another question<\|end_of_text\|>\n<\|start_of_role\|>assistant<\|end_of_role\|>\n",
	/* .expected_output_jinja= */ "<\|start_of_role\|>system<\|end_of_role\|>You are a helpful assistant<\|end_of_text\|>\n<\|start_of_role\|>user<\|end_of_role\|>Hello<\|end_of_text\|>\n<\|start_of_role\|>assistant<\|end_of_role\|>Hi there<\|end_of_text\|>\n<\|start_of_role\|>user<\|end_of_role\|>Who are you<\|end_of_text\|>\n<\|start_of_role\|>assistant<\|end_of_role\|> I am an assistant <\|end_of_text\|>\n<\|start_of_role\|>user<\|end_of_role\|>Another question<\|end_of_text\|>\n<\|start_of_role\|>assistant<\|end_of_role\|>",
	},

Granite Docling stopping #16438

Granite Docling stopping #16438

Uh oh!

Conversation

gabe-l-hart commented Oct 6, 2025

Description

Uh oh!

CISC left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as spam.

ngxson commented Oct 6, 2025

Uh oh!

CISC commented Oct 6, 2025

Uh oh!

ngxson commented Oct 6, 2025

Uh oh!

gabe-l-hart commented Oct 6, 2025

Uh oh!

gabe-l-hart commented Oct 6, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants