Implement CoT no-op for reasoning models #8375

FireMasterK · 2025-06-13T10:43:47Z

Add an optional parameter to allow specifying whether a model is a reasoning model or not. If not specified, we to determine it using litellm.supports_reasoning.
I had to upgrade to upgrade the LiteLLM library to have access to the function, as it wasn't present in the version used earlier.

Additional context: https://x.com/DSPyOSS/status/1931886536221724848

okhat · 2025-06-14T21:06:00Z

This is AMAZING. Thank you @FireMasterK !!

I think we'll need to think (no pun intended) about reasoning models like Qwen3 that expose their reasoning in the string... What does LiteLLM do for that. I guess it says False?

Also I guess we should still return a reasoning field, but set it to something that indicates that the reasoning is not available.

FireMasterK · 2025-06-15T03:09:35Z

I think we'll need to think (no pun intended) about reasoning models like Qwen3 that expose their reasoning in the string... What does LiteLLM do for that. I guess it says False?

I don't think they support telling whether a model displays its CoT or not, which is unfortunate.

It also seems like LiteLLM doesn't have enough mappings for reasoning models (eg. Qwen3 models are not there), which means we can't solely rely on LiteLLM telling if a model supports reasoning or not.
Source: https://github.com/BerriAI/litellm/blob/main/model_prices_and_context_window.json

Also I guess we should still return a reasoning field, but set it to something that indicates that the reasoning is not available.

Yes, but some model providers providers summarize the CoT, what should we do about these? I have added the parsing of reasoning_content from non-streaming responses. I'm not sure what should be done for reasoning content. Should I create a new class like dspy.streaming.ReasoningStreamListener?

dspy/predict/chain_of_thought.py

FireMasterK · 2025-06-24T12:08:17Z

I fixed the serialization issues, so all tests now pass! Let me know what I should do about #8375 (comment).

I'm happy to create another PR or add them in this PR. Whichever you prefer.

FireMasterK force-pushed the reasoning-models-cot branch from c8b1c4e to b4e570b Compare June 15, 2025 03:09

FireMasterK force-pushed the reasoning-models-cot branch from b4e570b to 2181050 Compare June 15, 2025 03:17

FireMasterK commented Jun 15, 2025

View reviewed changes

dspy/predict/chain_of_thought.py Outdated Show resolved Hide resolved

FireMasterK force-pushed the reasoning-models-cot branch from 2181050 to a6fd573 Compare June 18, 2025 09:57

FireMasterK added 4 commits June 24, 2025 17:34

Implement CoT no-op for reasoning models

19eb50b

Support late-configuration of LM to fix certain tests.

80b27e6

Simple reasoning_content parsing in BaseLM.

ad9f5c1

Fix circular import and serialization issues.

903e7e4

FireMasterK force-pushed the reasoning-models-cot branch from a6fd573 to 903e7e4 Compare June 24, 2025 12:04

Evaluate cached predict before named_parameters call.

146ad64

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement CoT no-op for reasoning models #8375

Implement CoT no-op for reasoning models #8375

Uh oh!

FireMasterK commented Jun 13, 2025

Uh oh!

okhat commented Jun 14, 2025 •

edited

Loading

Uh oh!

FireMasterK commented Jun 15, 2025

Uh oh!

Uh oh!

FireMasterK commented Jun 24, 2025

Uh oh!

Uh oh!

Implement CoT no-op for reasoning models #8375

Are you sure you want to change the base?

Implement CoT no-op for reasoning models #8375

Uh oh!

Conversation

FireMasterK commented Jun 13, 2025

Uh oh!

okhat commented Jun 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

FireMasterK commented Jun 15, 2025

Uh oh!

Uh oh!

FireMasterK commented Jun 24, 2025

Uh oh!

Uh oh!

okhat commented Jun 14, 2025 •

edited

Loading