agent/tools: fix shorthand-schema bug that aborts agent turns mid-task by alex-fedotyev · Pull Request #63 · ClickHouse/nerve

alex-fedotyev · 2026-05-04T23:55:42Z

Summary

The Claude Agent SDK's _build_schema (claude_agent_sdk/__init__.py:393-413) converts shorthand input_schema dicts by forcing every property into required: list(properties.keys()) and silently dropping descriptions and defaults via _python_type_to_json_schema. Most of nerve/agent/tools.py uses the shorthand form, so 23 tools advertise optional-with-default fields as required to the model.

This PR adds a thin wrapper around claude_agent_sdk.tool that pre-promotes shorthand dicts to the explicit JSON Schema form the SDK passes through unchanged. properties are kept intact (so descriptions and defaults survive) and required only lists fields that have no default. The four explicit schemas already in the file (_NOTIFY_SCHEMA, _ASK_USER_SCHEMA, _REACT_SCHEMA, _SEND_STICKER_SCHEMA) are untouched.

The fix is one place; every existing and future tool that uses the shorthand decorator picks it up automatically.

Symptoms (when this has occurred)

When the model trusted a documented default and omitted the field on a call, the bundled CLI threw McpToolCallError instead of returning a normal is_error: true tool result. The exception propagates up the streaming connection and ends the agent's turn early. From the user's POV the assistant just stopped mid-task. Recurring log signature:

[WARN] Streaming stall detected: NN.Ns gap between events (stall #1)
[ERROR] MCP server "nerve" Input validation error: '<field>' is a required property
[WARN] [Stall] tool_dispatch_end tool=mcp__nerve__<tool> outcome=error durationMs=...
[ERROR] McpToolCallError: McpToolCallError: Input validation error: '<field>' is a required property
[WARN] Streaming completed with 1 stall(s), total stall time: NN.Ns

After the stream completes, Nerve's engine registers a background-task tracker, so the session looks "done" even though the model intended to continue. Tools that have repro'd this in the wild (recent logs) include mcp__nerve__plan_propose and mcp__nerve__memorize. Any tool with at least one default-annotated field is exposed:

task_search, task_create, task_list, task_update, task_done, memory_recall, conversation_history, memory_records_by_date, memorize, memory_update, category_update, sync_status, list_sources, poll_source, poll_all_sources, read_source, plan_propose, plan_list, plan_decline, skill_create, skill_run_script, hoa_execute.

Side benefit

The shorthand path also routes property values through _python_type_to_json_schema, which has no dict branch and falls back to {"type": "string"} — so descriptions and defaults are stripped before the model ever sees them. Promoting shorthand to explicit schema fixes that as well; the model now receives the parameter descriptions Nerve has been writing for months.

Test plan

pytest tests/test_tools_schemas.py -v — 78 passed (4 wrapper behaviour cases + parametrised audit of every tool in ALL_TOOLS for type/properties/required correctness and "no defaulted field is required").
pytest tests/test_session_mcp.py tests/test_engine.py — 26 passed, no regressions.
pytest -q (full suite, excluding network-bound CLI tests) — 509 passed; 2 pre-existing failures in test_bootstrap.py and test_services.py related to the docker-mcp spike, unrelated to this change.

Out of scope

Bug 2 from the original analysis — that McpToolCallError is raised at all on validation failures instead of being returned as is_error: true — lives in the bundled Claude Code CLI (/$bunfs/root/src/entrypoints/cli.js:4894), not Nerve. This PR removes the most common trigger; the upstream fix would prevent any future schema mismatch from terminating a turn.

The Claude Agent SDK's _build_schema (claude_agent_sdk/__init__.py:393-413) converts shorthand input_schema dicts by forcing every property into required: list(properties.keys()) and silently dropping descriptions and defaults via _python_type_to_json_schema. Almost every tool in this file uses the shorthand form, so 23 tools advertised optional- with-default fields as required to the model. When the model trusted a documented default and omitted the field, the SDK threw McpToolCallError, which propagates up the stream and ends the agent's turn early; Nerve's engine then logs "Streaming completed with N stall(s)" and registers a background-task tracker, leaving users with the impression that the assistant stopped mid-task. The wrapper pre-promotes shorthand dicts to the explicit JSON Schema form the SDK passes through unchanged. The four explicit schemas already in the file (_NOTIFY_SCHEMA, _ASK_USER_SCHEMA, _REACT_SCHEMA, _SEND_STICKER_SCHEMA) are untouched. Side benefit: parameter descriptions now reach the model for every tool that previously relied on the shorthand. Adds tests/test_tools_schemas.py with 78 assertions covering wrapper behaviour and a parametrised audit of every tool in ALL_TOOLS.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

agent/tools: fix shorthand-schema bug that aborts agent turns mid-task#63

agent/tools: fix shorthand-schema bug that aborts agent turns mid-task#63
alex-fedotyev wants to merge 1 commit into
ClickHouse:mainfrom
alex-fedotyev:claude/fix-tool-schema-shorthand

alex-fedotyev commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

alex-fedotyev commented May 4, 2026

Summary

Symptoms (when this has occurred)

Side benefit

Test plan

Out of scope

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant