fix(condenser): When condensation is triggered by the unhandled_condensation_request condition, it will result in empty condensation. #1034

CLFutureX · 2025-11-05T09:14:47Z

Background: When condensation is triggered by the unhandled_condensation_request condition, if the number of events in the current view is less than half of the max_size, it will result in empty condensation.

forExample:

keep_first = 4
len(view) = 10
max_size = 30

target_size = 15
events_from_tail = 10 

forgotten_events = view[self.keep_first : -events_from_tail]
                 = view[4:-10]
result: forgotten_events is empty

Optimization: When condensation is triggered by the unhandled_condensation_request condition, adjust the current target_size to len(view) // 2.

… based on the view size Signed-off-by: CLFutureX <[email protected]>

CLFutureX · 2025-11-06T01:41:06Z

@csmith49 hey, PTAL

CLFutureX · 2025-11-08T13:37:18Z

@xingyaoww @enyst hey，PTAL， thanks

enyst · 2025-11-08T17:02:24Z

@OpenHands please make build, then run pre-commit on all files

openhands-ai · 2025-11-08T17:02:34Z

I'm on it! enyst can track my progress at all-hands.dev

enyst · 2025-11-08T17:05:09Z

openhands-sdk/openhands/sdk/context/condenser/llm_summarizing_condenser.py

+        if view.unhandled_condensation_request:
+            # Condensation triggered by a condensation request 
+            # should be calculated based on the view size.
+            target_size = len(view) // 2


@CLFutureX can you please tell, when does this happen, is it when the context limit exceeeded may have been reached?

@CLFutureX can you please tell, when does this happen, is it when the context limit exceeeded may have been reached?

sure
path: openhands-sdk\openhands\sdk\agent\agent.py

`except LLMContextWindowExceedError: # If condenser is available and handles requests, trigger condensation if ( self.condenser is not None and self.condenser.handles_condensation_requests() ): logger.warning( "LLM raised context window exceeded error, triggering condensation" ) on_event(CondensationRequest()) return `

If calling the LLM goes over the window limit, it’ll trigger a LLMContextWindowExceedError. That’ll then create a CondensationRequest. After that, in the view file (openhands-sdk\openhands\sdk\context\view.py), if there’s a CondensationRequest, it’ll set unhandled_condensation_request to True. So when the should_condense method gets called next time, it’ll return True—and by then, most of the events in the view will be smaller than max_size.

@OpenHands See this discussion. I think @CLFutureX is correct, if context limit was exceeded we shouldn’t assume the condenser will work as if max_size of events was reach, it almost certainly wasn’t. What do you think about the solution or what quick reasonable alternatives do we have here?

I'm on it! enyst can track my progress at all-hands.dev

Summary of my response to the PR comment

Answer to the question

I agree with the premise: when LLMContextWindowExceedError occurs, event-count heuristics based on max_size are the wrong proxy. Basing target_size on the current view size for unhandled_condensation_request is directionally correct.

However, as implemented, it can still yield an empty “forgotten” slice in small views (e.g., len(view) ≤ 2*(keep_first+1)), so condensation does nothing and doesn’t relieve token pressure.

Quick, reasonable improvements

Minimal fix on top of the PR (recommended now):

Keep the PR’s change: if view.unhandled_condensation_request, set target_size = len(view) // 2.

Add a simple clamp to ensure we always forget something when structurally possible:

head = view[:keep_first]

events_from_tail = target_size - len(head) - 1

available_tail = max(0, len(view) - keep_first - 1)

tail_keep = min(max(events_from_tail, 1), available_tail)

forgotten = view[keep_first : len(view) - tail_keep]

If available_tail == 0, there’s no tail space; optionally force-drop the first middle event if present to guarantee progress.

Quick alternatives (follow-ups):

Token-aware condensation: compute tokens with llm.get_token_count(messages) and select how many tail events to keep (e.g., via binary search) so head + summary + tail fits a safe fraction of max_input_tokens. This directly addresses cases where a few long messages blow the window.

Adaptive keep_first: if events_from_tail ≤ 0, temporarily reduce keep_first (bounded to 0) until at least one tail event can be kept and at least one middle event can be forgotten.

Safety fallback: if the computed “forgotten” slice is empty, force forgetting at least the first middle event (index keep_first) and keep one tail event to ensure progress.

Recommendation

Short-term: accept the PR’s intent and add the clamping guard so condensation on request cannot be a no-op.

Medium-term: consider the token-aware approach so condensation responds to real token pressure rather than event counts.

_{View full conversation}

enyst

Thank you! I think PR 912 should also help here…

…enser.py to satisfy CI Co-authored-by: openhands <[email protected]>

CLFutureX · 2025-11-10T07:03:25Z

Thank you! I think PR 912 should also help here…

thats right，this is a good way to optimize.

Condensation triggered by a condensation request should be calculated…

2616b57

… based on the view size Signed-off-by: CLFutureX <[email protected]>

Merge branch 'main' into fix_condenser_forgetten

4ac0408

enyst reviewed Nov 8, 2025

View reviewed changes

Merge branch 'main' into fix_condenser_forgetten

673fa78

enyst approved these changes Nov 10, 2025

View reviewed changes

style(condenser): apply pre-commit formatting to llm_summarizing_cond…

2c81419

…enser.py to satisfy CI Co-authored-by: openhands <[email protected]>

enyst enabled auto-merge (squash) November 10, 2025 04:08

enyst merged commit 0a23648 into OpenHands:main Nov 10, 2025
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(condenser): When condensation is triggered by the unhandled_condensation_request condition, it will result in empty condensation. #1034

fix(condenser): When condensation is triggered by the unhandled_condensation_request condition, it will result in empty condensation. #1034

Uh oh!

CLFutureX commented Nov 5, 2025 •

edited

Loading

Uh oh!

CLFutureX commented Nov 6, 2025

Uh oh!

CLFutureX commented Nov 8, 2025

Uh oh!

enyst commented Nov 8, 2025

Uh oh!

openhands-ai bot commented Nov 8, 2025

Uh oh!

enyst Nov 8, 2025

Uh oh!

CLFutureX Nov 10, 2025

Uh oh!

enyst Nov 10, 2025

Uh oh!

openhands-ai bot Nov 10, 2025

Uh oh!

openhands-ai bot Nov 10, 2025

Uh oh!

enyst left a comment

Uh oh!

Uh oh!

CLFutureX commented Nov 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix(condenser): When condensation is triggered by the unhandled_condensation_request condition, it will result in empty condensation. #1034

fix(condenser): When condensation is triggered by the unhandled_condensation_request condition, it will result in empty condensation. #1034

Uh oh!

Conversation

CLFutureX commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CLFutureX commented Nov 6, 2025

Uh oh!

CLFutureX commented Nov 8, 2025

Uh oh!

enyst commented Nov 8, 2025

Uh oh!

openhands-ai bot commented Nov 8, 2025

Uh oh!

enyst Nov 8, 2025

Choose a reason for hiding this comment

Uh oh!

CLFutureX Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

enyst Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

openhands-ai bot Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

openhands-ai bot Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

enyst left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

CLFutureX commented Nov 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

CLFutureX commented Nov 5, 2025 •

edited

Loading