Show reasoner model's thinking in a `<<< thinking` section #151

juodumas · 2025-03-07T15:24:48Z

Parse reasoning_content in response chunks and put it in a separate <<< thinking section. Do not send the contents of this section back to the model.

Tested with deepseek api and their deepseek-reasoner model.

Also tested locally with sglang --model-path Qwen/QwQ-32B-AWQ --reasoning-parser deepseek-r1:

>>> user

hi

<<< thinking

Okay, the user said "hi". I should respond in a friendly way. Maybe say "Hello! How can I assist you today?" That's a common greeting and opens the conversation for them to ask something. I need to keep it simple and welcoming. Let me check if there's anything else I should consider. No, that should be good. Alright, I'll go with that.


<<< assistant



Hello! How can I assist you today?

>>> user

(tested with deepseek api and deepseek-reasoner)

madox2 · 2025-03-07T20:11:08Z

py/chat.py


            text_chunks = make_chat_text_chunks(messages, options)
-            render_text_chunks(text_chunks)
+            for chunk in text_chunks:


here it would be nice to re-use render_text_chunks helper function, it covers more functionality like AIRedo. For example filter thinking chunks first, then if not empty render it with render_text_chunks and then handle content

For me it's confusing how render_text_chunks handles insertion to support in-place edits vs. chat.
If you want this in the render_text_chunks function, could you please first check if you agree with this change #148 ?

madox2 · 2025-03-07T20:13:06Z

py/utils.py

@@ -156,10 +156,12 @@ def parse_chat_messages(chat_content):
    for line in lines:
        match line:
            case '>>> system':
-                messages.append({'role': 'system', 'content': [{ 'type': 'text', 'text': '' }]})
+                messages.append({'role': 'system', 'content': ''})


why is this change needed? that also breaks unit tests.
Btw. it would be nice to have an unit test checking thinking section is omitted (tests/chat_test.py)

It's a regression introduced by some recent change. Perhaps should have been a separate PR.

A content array is not supported by some providers, namely:

groq:

{"error":{"message":"'messages.0' : for 'role:system' the following must be satisfied[('messages.0.content' : value must be a string)]","type":"invalid_request_error"}}

deepseek:

Failed to deserialize the JSON body into the target type: messages[0]: data did not match any variant of untagged enum ChatCompletionRequestContent at line 1 column 109

I haven't looked at the tests yet because I wanted to know if you will accept this PR.

Okay, sounds good

If I may ask you to keep the previous structured format here and move this logic to make_chat_text_chunks - reformat text messages before the request. that would makes things easier when I'll be merging custom providers support.

madox2 · 2025-03-07T20:16:37Z

py/utils.py

+        if reasoning_content := delta.get('reasoning_content'):
+            return {"thinking": reasoning_content}
+        if content := delta.get('content'):
+            return {"content": content}


If there is no special reason, handling the chunk the same way as in map_chunk_no_stream would make it probably simpler, e.g.:

delta = _choices(resp)[0].get('message', {}) reasoning_content = delta.get('reasoning_content', '') content = delta.get('content', '') return {"content": content, "thinking": reasoning_content}

The reason is that a chunk can have {"content": None}, so delta.get('content', '') would return None, not an empty string. But I can simplify it taking inspiration from #150

madox2 · 2025-03-07T20:24:40Z

Hi, thank you for the contribution! Even though reasoning_content is not part of the OpenAI API, I generally like the idea of the thinking sections. I would not be surprised other providers providing similar API. When we are done with custom provider extensions, this feature might be also useful for a better model integrations.

I currently can't test R1, but made a quick code review.

juodumas · 2025-03-10T11:40:29Z

Thanks for the review.
Let me know if the code is acceptable after my changes and if you would like to see unit test updates.

madox2 · 2025-03-10T19:31:03Z

It works great! I will write some unit tests myself while merging it into provider-extensions.

Thanks a lot @juodumas and @runningmaverick

Show reasoner model's thinking in a <<< thinking section

44bcd0c

(tested with deepseek api and deepseek-reasoner)

This was referenced Mar 7, 2025

Fix empty space on assistant responses line when 'virtualedit=all' is set. #148

Merged

support reasoning_content field in reasoning model api like deepseek-r1 #150

Closed

madox2 reviewed Mar 7, 2025

View reviewed changes

Update thinking section feature based on maintainer's comments

521b417

madox2 merged commit f03db32 into madox2:main Mar 10, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Show reasoner model's thinking in a `<<< thinking` section #151

Show reasoner model's thinking in a `<<< thinking` section #151

juodumas commented Mar 7, 2025

madox2 Mar 7, 2025

juodumas Mar 7, 2025

madox2 Mar 7, 2025

juodumas Mar 7, 2025

madox2 Mar 7, 2025

madox2 Mar 8, 2025

madox2 Mar 7, 2025

juodumas Mar 7, 2025

madox2 commented Mar 7, 2025

juodumas commented Mar 10, 2025 •

edited

Loading

madox2 commented Mar 10, 2025

Show reasoner model's thinking in a <<< thinking section #151

Show reasoner model's thinking in a <<< thinking section #151

Conversation

juodumas commented Mar 7, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

madox2 commented Mar 7, 2025

juodumas commented Mar 10, 2025 • edited Loading

madox2 commented Mar 10, 2025

Show reasoner model's thinking in a `<<< thinking` section #151

Show reasoner model's thinking in a `<<< thinking` section #151

juodumas commented Mar 10, 2025 •

edited

Loading