fix: DeepSeek tool call parsing - nested objects & JSON repair by valkryhx · Pull Request #94 · CJackHwang/ds2api

valkryhx · 2026-03-16T10:39:35Z

Summary

修复 DeepSeek 工具调用解析问题，支持嵌套 JSON 对象和缺失数组括号的自动修复。

Problem

DeepSeek 在返回 tool calls 时有时会输出不规范的 JSON：

缺失数组方括号：{"todos": {"content": "task1"}, {"content": "task2"}}
嵌套对象中的方括号丢失：{"input": {"q": "value"}}, {"input": {"path": "file"}}
未加引号的键名：{tool_calls: [...]}

这些情况导致工具调用被当作普通文本返回，客户端无法识别和执行。

Solution

1. 升级正则表达式支持单层嵌套

// 修复前：无法处理嵌套 {}
var missingArrayBracketsPattern = regexp.MustCompile(`(:\s*)(\{[^{}]*\}(?:\s*,\s*\{[^{}]*\})+)`)

// 修复后：支持单层嵌套
var missingArrayBracketsPattern = regexp.MustCompile(`(:\s*)(\{(?:[^{}]|\{[^{}]*\})*\}(?:\s*,\s*\{(?:[^{}]|\{[^{}]*\})*\})+)`)

2. 添加 RepairLooseJSON 函数

修复未加引号的键名：{key: -> {"key":
修复缺失的数组括号：{"a":1}, {"b":2} -> [{"a":1}, {"b":2}]

3. 增强关键词检测

支持多种 tool call 语法：

tool_calls
function.name:
[tool_call_history]

4. 添加 OOM 保护

限制回溯搜索范围
限制 JSON 对象扫描长度

Files Changed

File	Change
internal/util/toolcalls_parse.go	新增 RepairLooseJSON 函数和正则修复
internal/util/toolcalls_test.go	新增 10+ 个测试用例
internal/util/toolcalls_candidates.go	增强关键词检测和 OOM 保护
internal/adapter/openai/tool_sieve_core.go	支持多关键词模式匹配
internal/js/helpers/stream-tool-sieve/sieve.js	JS 版本同步修复
TESTING.md	新增调试命令文档
README.md	新增测试快速参考

Test Cases Added

单层嵌套对象 - 2个元素
3个单层嵌套对象
混合嵌套 - 对象和原始值混合
4个嵌套对象
无空格逗号分隔
对象内包含数组
DeepSeek 8皇后真实输出
简单无嵌套对象
复杂单层嵌套
5个嵌套对象

Verification

go test -v -run 'TestParseToolCalls|TestRepair' ./internal/util/
# All 33 tests pass

- Fix: Expand stream sieve keywords to support function.name: and [TOOL_CALL_HISTORY] - Fix: Add repairInvalidJSONBackslashes to handle unescaped backslashes in Windows paths - Sync: Update JS stream sieve to match Go implementation - Test: Add unit tests for backslash repair and deepseek format parsing - Tool: Move repair json test tool to tests/repair_json_tool.go

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 7318d1f4a8

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-03-16T10:44:44Z

internal/adapter/openai/tool_sieve_core.go

 		return -1
 	}
 	lower := strings.ToLower(s)
+	keywords := []string{"tool_calls", "function.name:", "[tool_call_history]"}


Restrict capture start to payloads consumeToolCapture can parse

Including function.name: and [tool_call_history] in findToolSegmentStart makes the sieve enter capture mode for text-kv style output, but consumeToolCapture still only looks for tool_calls, so these captures never become ready until stream finalization; in practice, any chunk containing function.name: can cause the rest of the response to be buffered and emitted late as plain text instead of structured tool calls.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-03-16T10:44:44Z

internal/util/toolcalls_parse.go

+			if i+1 < len(runes) {
+				next := runes[i+1]
+				switch next {
+				case '"', '\\', '/', 'b', 'f', 'n', 'r', 't':


Escape \n/\t when repairing Windows-style path backslashes

The repair routine treats \n, \t, \r, etc. as already-valid escapes and leaves them unchanged, so inputs like C:\new\tools (model intended literal backslashes) are decoded as newline/tab characters after JSON unmarshal; this silently mutates tool arguments and can execute the wrong path/command instead of preserving the original Windows path text.

Useful? React with 👍 / 👎.

- Upgrade missingArrayBracketsPattern regex to support single-level nested {} objects - This fixes DeepSeek's list hallucination where tool call JSON objects contain nested fields like {"input": {"q": "value"}} - Add comprehensive test cases covering 2-5 nested objects, mixed nested/primitive fields, and real DeepSeek 8-queen output patterns - Add RepairLooseJSON function to repair unquoted keys and missing array brackets Fixes: DeepSeek tool call parsing with nested JSON objects

…ds and safety limits - Add support for multiple keywords: tool_calls, function.name:, [tool_call_history] - Add OOM protection with search limits in extractToolCallObjects - Add max scan length limit in extractJSONObject to prevent OOM on unclosed objects - Update tool_sieve to handle more tool call patterns - Add loose JSON repair in parseToolCallPayload for better error recovery This improves DeepSeek tool call parsing robustness.

vercel · 2026-03-17T08:28:39Z

@valkryhx is attempting to deploy a commit to the cjack's projects Team on Vercel.

A member of the Team first needs to authorize it.

- Add targeted test commands to TESTING.md for debugging tool call issues - Add quick test commands reference in README.md - Document specific test cases for DeepSeek tool call parsing

chatgpt-codex-connector bot reviewed Mar 16, 2026

View reviewed changes

huangxun added 2 commits March 17, 2026 16:24

docs: add testing documentation for tool call debugging

cf569f4

- Add targeted test commands to TESTING.md for debugging tool call issues - Add quick test commands reference in README.md - Document specific test cases for DeepSeek tool call parsing

valkryhx changed the title ~~fix: correctly parse and emit tool calls from DeepSeek responses~~ fix: DeepSeek tool call parsing - nested objects & JSON repair Mar 17, 2026

CJackHwang closed this Mar 17, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: DeepSeek tool call parsing - nested objects & JSON repair#94

fix: DeepSeek tool call parsing - nested objects & JSON repair#94
valkryhx wants to merge 4 commits intoCJackHwang:mainfrom
valkryhx:main

valkryhx commented Mar 16, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Mar 16, 2026

Uh oh!

chatgpt-codex-connector bot Mar 16, 2026

Uh oh!

vercel bot commented Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

valkryhx commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problem

Solution

1. 升级正则表达式支持单层嵌套

2. 添加 RepairLooseJSON 函数

3. 增强关键词检测

4. 添加 OOM 保护

Files Changed

Test Cases Added

Verification

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

vercel bot commented Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

valkryhx commented Mar 16, 2026 •

edited

Loading