fix: filter orphaned tool messages in _sanitize_assistant_messages by EmilyCheoh · Pull Request #8350 · AstrBotDevs/AstrBot

EmilyCheoh · 2026-05-26T08:57:08Z

After context truncation or compression removes an assistant message containing tool_calls, the corresponding role: "tool" response messages may remain in the conversation history. The API then rejects the request with:

400: unexpected tool_use_id found in tool_result blocks

Modifications / 改动点

Added a second pass in _sanitize_assistant_messages() (openai_source.py) that removes any role: "tool" message whose tool_call_id does not match a tool_calls entry in a preceding assistant message
Acts as a last-line-of-defense before API dispatch, complementing the existing fix_messages() in ContextTruncator
This is NOT a breaking change. / 这不是一个破坏性变更。

Screenshots or Test Results / 运行截图或测试结果

Error before fix:
[sources.openai_source]: Chat Model request error: Error code: 400 - {'error': {'message': 'unexpected tool_use_id found in tool_result blocks: toolu_01AGPDyN5PStuEuoumrdgC9o. Each tool_result block must have a corresponding tool_use block in the previous message.'}}

After fix, orphaned messages are silently filtered and the request succeeds:
[sources.openai_source]: Filtered 4 orphaned tool message(s)

Checklist / 检查清单

😊 If there are new features added in the PR, I have discussed it with the authors through issues/emails, etc.
/ 如果 PR 中有新加入的功能，已经通过 Issue / 邮件等方式和作者讨论过。
👀 My changes have been well-tested, and "Verification Steps" and "Screenshots" have been provided above.
/ 我的更改经过了良好的测试，并已在上方提供了“验证步骤”和“运行截图”。
🤓 I have ensured that no new dependencies are introduced, OR if new dependencies are introduced, they have been added to the appropriate locations in requirements.txt and pyproject.toml.
/ 我确保没有引入新依赖库，或者引入了新依赖库的同时将其添加到 requirements.txt 和 pyproject.toml 文件相应位置。
😮 My changes do not introduce malicious code.
/ 我的更改没有引入恶意代码。

Summary by Sourcery

Bug Fixes:

Remove tool messages whose tool_call_id no longer matches any preceding assistant tool_calls after context truncation, avoiding 400 errors from unexpected tool_use_ids.

Implement filtering of orphaned tool messages from cleaned messages to prevent API errors.

gemini-code-assist

Code Review

This pull request introduces logic to filter out orphaned tool messages from the OpenAI payload to prevent 400 API errors caused by context truncation. The review feedback suggests making the message and tool call parsing more robust by supporting both dictionary and object attribute access, as the messages might not always be dictionaries.

gemini-code-assist · 2026-05-26T08:58:07Z

+        valid_tc_ids = set()
+        final: list = []
+        _orphan_count = 0
+        for msg in cleaned:
+            if not isinstance(msg, dict):
+                final.append(msg)
+                continue
+            role = msg.get("role")
+            if role == "assistant" and msg.get("tool_calls"):
+                valid_tc_ids = {tc["id"] for tc in msg["tool_calls"] if isinstance(tc, dict) and "id" in tc}
+                final.append(msg)
+            elif role == "tool":
+                if msg.get("tool_call_id") in valid_tc_ids:
+                    final.append(msg)
+                    valid_tc_ids.discard(msg.get("tool_call_id"))
+                else:
+                    _orphan_count += 1
+            else:
+                valid_tc_ids = set()
+                final.append(msg)


The current implementation assumes that all messages in cleaned are dictionaries and that tool_calls contains only dictionary elements. If cleaned contains Message objects (or if tool_calls contains ToolCall objects), the code will either skip processing them or raise a TypeError / KeyError.

To make this sanitization robust and adhere to defensive programming practices, we should support both dictionary and object attribute access for role, tool_calls, and tool_call_id.

valid_tc_ids = set() final: list = [] _orphan_count = 0 for msg in cleaned: role = msg.get("role") if isinstance(msg, dict) else getattr(msg, "role", None) tool_calls = msg.get("tool_calls") if isinstance(msg, dict) else getattr(msg, "tool_calls", None) if role == "assistant" and isinstance(tool_calls, list) and tool_calls: valid_tc_ids = { tc["id"] if isinstance(tc, dict) else getattr(tc, "id", None) for tc in tool_calls } valid_tc_ids.discard(None) final.append(msg) elif role == "tool": tool_call_id = msg.get("tool_call_id") if isinstance(msg, dict) else getattr(msg, "tool_call_id", None) if tool_call_id in valid_tc_ids: final.append(msg) valid_tc_ids.discard(tool_call_id) else: _orphan_count += 1 else: valid_tc_ids = set() final.append(msg)

sourcery-ai

Hey - I've left some high level feedback:

The logic around valid_tc_ids would be clearer and less error-prone if you mutated a single set (valid_tc_ids.clear() / .update(...)) instead of reassigning it in different branches, which also makes the intended lifetime of the tracked IDs more obvious.
Consider explicitly typing final to reflect the expected message structure (e.g., list[dict[str, Any]]) to make the intent and constraints of the sanitization pass clearer to future maintainers.

Prompt for AI Agents

Please address the comments from this code review:

## Overall Comments
- The logic around `valid_tc_ids` would be clearer and less error-prone if you mutated a single set (`valid_tc_ids.clear()` / `.update(...)`) instead of reassigning it in different branches, which also makes the intended lifetime of the tracked IDs more obvious.
- Consider explicitly typing `final` to reflect the expected message structure (e.g., `list[dict[str, Any]]`) to make the intent and constraints of the sanitization pass clearer to future maintainers.

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

fix: guard against empty tool_call arguments and orphaned tool messages

be4a4f0

Implement filtering of orphaned tool messages from cleaned messages to prevent API errors.

dosubot Bot added size:S This PR changes 10-29 lines, ignoring generated files. area:provider The bug / feature is about AI Provider, Models, LLM Agent, LLM Agent Runner. labels May 26, 2026

gemini-code-assist Bot reviewed May 26, 2026

View reviewed changes

sourcery-ai Bot reviewed May 26, 2026

View reviewed changes

style: format to pass ruff check

6bfc0c9

dosubot Bot added size:M This PR changes 30-99 lines, ignoring generated files. and removed size:S This PR changes 10-29 lines, ignoring generated files. labels May 26, 2026

github-actions Bot mentioned this pull request May 27, 2026

🦞 OpenClaw 生态日报 2026-05-27 ivanweng2077/big_model_radar#97

Open

Soulter force-pushed the master branch 3 times, most recently from a4c4a7d to 9bd38ca Compare May 28, 2026 16:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: filter orphaned tool messages in _sanitize_assistant_messages#8350

fix: filter orphaned tool messages in _sanitize_assistant_messages#8350
EmilyCheoh wants to merge 2 commits into
AstrBotDevs:masterfrom
EmilyCheoh:fix/sanitize-tool-messages

EmilyCheoh commented May 26, 2026 •

edited by sourcery-ai Bot

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 26, 2026

Uh oh!

sourcery-ai Bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

EmilyCheoh commented May 26, 2026 • edited by sourcery-ai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Modifications / 改动点

Screenshots or Test Results / 运行截图或测试结果

Checklist / 检查清单

Summary by Sourcery

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 26, 2026

Choose a reason for hiding this comment

Uh oh!

sourcery-ai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

EmilyCheoh commented May 26, 2026 •

edited by sourcery-ai Bot

Loading