Allow DSPy to use the native reasoning from models #8822

chenmoneygithub · 2025-09-18T01:47:24Z

This is a followup work of #8764. This problem is harder than it looks like because:

We need to have ChainOfThought work for both reasoning model and non-reasoning model. And if reasoning model is used, we should avoid having the model generate the reasoning field.
Without explicit reasoning field in output fields, dspy.Predict should not return reasoning. Basically we need to keep the signature faithful.

That basically means we need a way to identify users are specifying "reasoning" in their signature, and it fits into our definition of predefined type. Users are not forbidden from using reasoning of string type, but it won't use the native reasoning.

TomeHirata · 2025-09-19T01:17:05Z

dspy/adapters/types/reasoning.py

+        if not litellm.supports_reasoning(lm.model):
+            return False
+
+        if "reasoning_effort" in lm_kwargs:


We don't need to check lm_kwargs, instead we should probably overwrite reasoning_effort if native reasoning is on

so reasoning_effort has multiple valid values - low, medium, and high. I am setting the default value as low, but if users specify the other values we want to keep those unchanged.

the default reasoning_effort on OpenAI reasoning models ismedium, so I'd be careful setting it differently here

TomeHirata · 2025-09-19T01:23:33Z

dspy/predict/chain_of_thought.py

    def __init__(
        self,
        signature: str | type[Signature],
-        rationale_field: FieldInfo | None = None,


should we show deprecation warnings if these arguments are used?

sg, let me add that

TomeHirata · 2025-09-19T01:24:51Z

dspy/clients/base_lm.py


    def _process_completion(self, response, merged_kwargs):
        """Process the response of OpenAI chat completion API and extract outputs.
-        


Is it auto-formatting? Shall we add lint rule so that linting does not change lines unrelated a PR's scope?

that's my plugin for trailing whitespaces. I didn't expect this PR to be this big at the first place...

TomeHirata · 2025-09-19T01:30:07Z

dspy/clients/base_lm.py

        tool_calls = []
+        reasoning_contents = []
+
        for output_item in response.output:


Sorry I'm confused. What's the meaning of multiple items in the output field? I thought it'd be the same as choices of chat completion

This is definitely a weird point from Responses API - all these items in output belongs to one answer. Basically Responses API doesn't have the n arg that controls the number of answer to be generated.

Talked offline. It seems all output items represent one choice.

TomeHirata · 2025-09-22T16:59:25Z

dspy/adapters/types/base_type.py

        return formatted

+    @classmethod
+    def adapt_to_native_lm_feature(cls, lm, lm_kwargs) -> bool:


For the naming, how about is_native_lm_feature_available?

TomeHirata · 2025-09-22T17:07:13Z

dspy/adapters/types/reasoning.py

+
+        if "reasoning_effort" in lm_kwargs:
+            # `lm_kwargs` overrides `lm.kwargs`
+            reasoning_effort = lm_kwargs["reasoning_effort"]


nit: we can simplify by reasoning_effort = lm_kwargs.get("reasoning_effort") or lm.kwargs.get("reasoning_effort")

ohh good catch! The structure is right but code is actually buggy - if users explicitly turn off native reasoning, we should respect the setting, so I am not using get to capture the value. Updated the code.

TomeHirata · 2025-09-22T17:12:16Z

dspy/predict/chain_of_thought.py

+
+        from dspy.adapters.types.reasoning import Reasoning
+
+        extended_signature = signature.prepend(name="reasoning", field=OutputField(), type_=Reasoning)


Should we continue to return string in the reasoning field for backward compatibility?

dspy.Reasoning is a str-like type. Basically:

Printing a dspy.Reasoning prints out the string part, i.e., content.

Users can directly compare a dspy.Reasoning to a string.

We need this specific type to modify the signature inside Adapter. I don't feel this is a perfectly clean solution, but it's the most robust way in my mind. would like to hear your thoughts!

TomeHirata · 2025-09-22T17:18:50Z

dspy/clients/lm.py

-            not self._warned_zero_temp_rollout
-            and rollout_id is not None
-            and (temperature is None or temperature == 0)
+        # Normalize reasoning_effort to get reasoning summaries (for OpenAI reasoning models which don't expose


Shall we move this logic to _convert_chat_request_to_responses_request?

nice catch, done

TomeHirata · 2025-09-22T17:21:07Z

dspy/clients/base_lm.py

+                    for content_item in output_item.content:
+                        reasoning_contents.append(content_item.text)
+                elif getattr(output_item, "summary", None) and len(output_item.summary) > 0:
+                    for summary_item in output_item.summary:


When is this summary field used?

This is another weird spot of Responses API - according to the API documentation, the reasoning content can be contained in content or summary of the reasoning block: https://platform.openai.com/docs/api-reference/responses/object. While we use the content as golden, the API itself suggests that it's possible the reasoning is contained in summary.

arnavsinghvi11 and others added 5 commits September 3, 2025 23:03

support for native reasoning in CoT for reasoning models

6af5b75

ruff and test

c699a1f

merge main

beb85de

merge main

4c5b633

Introduce dspy.Reasoning to handle ChainOfThought on reasoning models

5228863

chenmoneygithub marked this pull request as draft September 18, 2025 01:47

chenmoneygithub added 3 commits September 17, 2025 18:48

remove unintended file

3210914

Merge branch 'main' into cot_reasoning

d5b0dfb

fix

b2daf8f

chenmoneygithub changed the title ~~[WIP] Allow DSPy to use the native reasoning from models~~ Allow DSPy to use the native reasoning from models Sep 18, 2025

chenmoneygithub requested review from TomeHirata, okhat and arnavsinghvi11 September 18, 2025 20:41

chenmoneygithub marked this pull request as ready for review September 18, 2025 20:41

chenmoneygithub added 3 commits September 18, 2025 16:38

make reasoning string-like

3cff43a

increment

3258da5

go

8de0a65

chenmoneygithub force-pushed the cot_reasoning branch from 35a60a7 to 8de0a65 Compare September 19, 2025 00:12

polish the docstring

ec2fbe4

TomeHirata reviewed Sep 19, 2025

View reviewed changes

chenmoneygithub force-pushed the cot_reasoning branch from e806ef5 to 41a0313 Compare September 19, 2025 04:38

automatically turn on reasoning for COT on reasoning model

56973f0

chenmoneygithub force-pushed the cot_reasoning branch from 41a0313 to 56973f0 Compare September 19, 2025 04:45

TomeHirata reviewed Sep 22, 2025

View reviewed changes

comments

c65b774

chenmoneygithub force-pushed the cot_reasoning branch from e91f7f7 to c65b774 Compare September 22, 2025 20:19


		def _process_completion(self, response, merged_kwargs):
		"""Process the response of OpenAI chat completion API and extract outputs.


		from dspy.adapters.types.reasoning import Reasoning

		extended_signature = signature.prepend(name="reasoning", field=OutputField(), type_=Reasoning)

Allow DSPy to use the native reasoning from models #8822

Are you sure you want to change the base?

Allow DSPy to use the native reasoning from models #8822

Uh oh!

Conversation

chenmoneygithub commented Sep 18, 2025

Uh oh!

TomeHirata Sep 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

TomeHirata Sep 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

TomeHirata Sep 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

TomeHirata Sep 19, 2025 •

edited

Loading

TomeHirata Sep 19, 2025 •

edited

Loading

TomeHirata Sep 22, 2025 •

edited

Loading