Skip to content

feat(agent-interface): reconcile per-harness reasoning-effort sets (gtm#398 Part A)#12

Open
vutuanlinh2k2 wants to merge 1 commit into
mainfrom
feat/reconcile-harness-effort-sets
Open

feat(agent-interface): reconcile per-harness reasoning-effort sets (gtm#398 Part A)#12
vutuanlinh2k2 wants to merge 1 commit into
mainfrom
feat/reconcile-harness-effort-sets

Conversation

@vutuanlinh2k2

Copy link
Copy Markdown
Contributor

Part A of gtm-agent#398 — make the chat reasoning-effort picker show only what each harness can really run.

What

reasoningEffortsFor/harnessReasoningEfforts previously sliced the canonical ladder by a per-harness
ceiling, which offered levels the cli-bridge adapters don't accept. Add an explicit
harnessReasoningEffortsOverride (preferred over the ceiling slice) keyed to the real adapter sets:

harness was now (real adapter set)
codex none·minimal·low·medium·high minimal·low·medium·high
claude-code none…ultracode low·medium·high·xhigh·ultracode (--effort ladder; ultracode ≈ max)
pi / openclaw none…ultracode minimal·low·medium·high·xhigh
kimi-code none·minimal·low·medium·high minimal·high (binary --thinking toggle)

Also: acp added to harnessIgnoresEffort (its runner reads no reasoningEffort).

none is dropped from the clamp-based harnesses where it's inert (≡ auto, which the picker always
shows). The ceiling table keeps only the no-thinking runners (cli-base/nanoclaw); router/model-driven
harnesses (opencode, gemini) keep the full ladder and are narrowed by the model (gtm side, Part C).

Why

Audit found the picker's offered effort levels diverge from what each CLI adapter actually accepts
(codex degrades xhigh→high, claude has no none/minimal, kimi is binary, etc.). Full write-up in
gtm-agent#398.

Verification

  • tsc --noEmit clean; build clean; package suite 21/21 (updated per-harness assertions +
    harnessHonorsEffort("acp") === false).
  • Changeset: @tangle-network/agent-interface minor.

Next in the cascade

sandbox-ui consumes this (kimi labels + neutral copy) → gtm bumps both + populates per-model
supportsReasoning. Tracked in gtm-agent#398 (Parts B, C).

…th real adapters

The chat effort picker offered levels harnesses can't actually run. Add an explicit
`harnessReasoningEffortsOverride` keyed to the real cli-bridge adapter sets, preferred by
`harnessReasoningEfforts` over the `none…ceiling` slice:

- codex        → minimal/low/medium/high   (drop the inert `none`)
- claude-code  → low/medium/high/xhigh/ultracode  (its `--effort` ladder; ultracode ≈ max)
- pi / openclaw → cap at xhigh
- kimi-code    → binary minimal/high        (its `--thinking` on/off toggle)
- acp          → added to the ignore-effort set (reads no reasoningEffort)

The ceiling table now keeps only the no-thinking runners (cli-base/nanoclaw); router/model-driven
harnesses (opencode, gemini) still default to the full ladder, narrowed by the model.

gtm-agent#398 Part A.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant