feat(ai-proxy): pure helpers for the post-override effective request by janiussyafiq · Pull Request #13370 · apache/apisix

janiussyafiq · 2026-05-13T20:30:44Z

Description

Setup for the upcoming ai-cache plugin. ai-cache needs the request body as it would be sent upstream — post-converter, post-override — to compute a cache key. Today that view only exists inside build_request, which isn't callable in isolation (HTTP, signing, auth). This PR extracts it as a pure helper.

effective_request_for_cache(ctx) in apisix/plugins/ai-proxy/base.lua returns the post-converter, post-override body. The inline override block in build_request is extracted to a pure apply_instance_overrides and called at the same call site — zero behavior change.

Two wrinkles worth noting:

The helper applies the converter first, then the overrides — same order as build_request. Without the converter step, the helper would return a pre-converter body and ai-cache would hash a different shape than what hits upstream.
resolve_target_protocol mirrors the routing in before_proxy so ai-cache (priority 1035, access phase) can compute the view before before_proxy populates ctx.ai_target_protocol.

The ai-request-rewrite change is a regression fix surfaced in review: the refactor changed build_request's extra_opts contract but missed the sidecar caller, which kept passing a now-dead field. Pass ai_instance = conf instead.

Which issue(s) this PR fixes:

N/A — new internal API surface for the upcoming ai-cache plugin.

Behavior change

None for ai-proxy / ai-proxy-multi. The ai-request-rewrite change restores conf.options propagation that the refactor inadvertently broke.

Checklist

I have explained the need for this PR and the problem it solves
I have explained the changes or the new features added to this PR
I have added tests corresponding to this change
I have updated the documentation to reflect this change — N/A, internal helper surface
I have verified that this change is backward compatible

Move the three-step instance-override application (options flat overwrite, override.llm_options capability hook, override.request_body deep merge) out of the inline block in ai-providers/base.lua build_request and into a new pure helper in apisix/plugins/ai-proxy/base.lua. build_request calls the helper at the same point the inline code lived (post-converter), so the body sent upstream is unchanged. extra_opts no longer carries the four override-derived fields; it passes the picked ai_instance through and the helper reads from it directly. Zero behavior change. Motivation: ai-cache (planned follow-up plugin) needs to compute its cache key from the post-override effective body without going through build_request, which performs the upstream HTTP call, signing, and keepalive.

…elpers Two pure helpers on top of apply_instance_overrides (introduced in the preceding refactor), both in apisix/plugins/ai-proxy/base.lua: - effective_model(ctx) returns ai_instance.options.model when the operator forces a model on the instance, falling back to ctx.var.request_llm_model (the client-supplied model that detect_request_type mirrors). - effective_request_for_cache(ctx) returns the request body as it would be sent upstream: reads the parsed body, resolves the target protocol from ctx.ai_client_protocol against the provider's capabilities (so peer plugins running in access phase before before_proxy can still get the post-override view), and applies apply_instance_overrides. A small internal resolve_target_protocol helper mirrors the routing logic in before_proxy so callers don't have to wait for ctx.ai_target_protocol to be populated. These helpers exist for ai-cache (planned follow-up) to compute a cache key over the effective body without invoking build_request (which would make the upstream HTTP call). The signatures are pure and ctx-driven. Test: t/plugin/ai-proxy-request-body-override.t TEST 17 drives a real request through ai-proxy with options + override.request_body, then uses serverless-post-function (priority -2000, runs after ai-proxy access at 1040) to invoke both helpers and log their output. Asserts both the upstream-received body AND the helper outputs reflect the same post-override view.

effective_model duplicates information already present on the body that effective_request_for_cache returns (ai_instance.options.model is written onto the body during apply_instance_overrides step 1). Callers that need the model can read it off the effective body. A cheap ctx-only model lookup can be added later if a concrete consumer needs it without parsing the body. Updates TEST 17 to drop the EFFECTIVE_MODEL assertion; the EFFECTIVE_BODY assertions still prove the helper produces the same body the upstream receives.

The cache key produced via effective_request_for_cache should reflect what would actually be sent upstream. Previously the helper only applied apply_instance_overrides, so if a converter was in the chain (e.g. anthropic-messages client routed to an openai-chat provider) the helper returned the pre-converter body while build_request sent the converted body - the cache key would diverge from the upstream request shape. Now the helper: 1. Reads the request body 2. Resolves (target_protocol, converter) via resolve_target_protocol 3. Applies the converter when present 4. Applies apply_instance_overrides resolve_target_protocol's return signature widens from `target_protocol` to `(target_protocol, converter)`; the fast-path (ctx.ai_target_protocol already set) returns ctx.ai_converter alongside. Tests: - TEST 17 (no-converter path) reformatted - the inline serverless-post- function was a single 297-char line; broken into a readable multi-line body to match the style used elsewhere in the file. - TEST 18 added covering the converter path: anthropic-messages client to an openai provider. Asserts EFFECTIVE_BODY contains max_completion_tokens (post-converter rename of max_tokens) and temperature 0.42 (post-override), but NOT the original max_tokens field - proving the converter ran inside the helper. Drive-by: comment on apply_instance_overrides shortened from 11 lines to 5 (precedence rules + "mutates in place"). Other two helpers keep their longer docs.

The b796b9d refactor changed build_request to read overrides from opts.ai_instance, but the ai-request-rewrite sidecar caller was missed and kept passing the now-dead opts.model_options. Result: conf.options silently stopped propagating to the LLM sidecar request body. Fix: pass ai_instance = conf. conf has the same .options / .override shape apply_instance_overrides reads; the override.llm_options / request_body branches are no-ops since the rewrite schema only defines override.endpoint. t/plugin/ai-request-rewrite2.t TEST 1, which validates extra_option in the LLM-stub request body, now passes (was failing with status 400 "LLM service returned error status: 400" once httpbin is reachable).

Previously only checked status=200 plus the EFFECTIVE_BODY error_log regex. The cache-key correctness contract requires the helper's output to match what build_request actually sends upstream — but with no upstream-side assertion, the test would have passed even if the helper diverged from build_request as long as the helper's own log contained the expected fields. Decode body.content[1].text (the openai-chat body echoed by the /v1/chat/completions stub, surfaced through the converter's response transform) and assert max_completion_tokens=10, temperature=0.42, max_tokens=nil. Combined with the existing EFFECTIVE_BODY regex on the same fields, this pins down helper == upstream for the converter's distinctive markers. Mirrors TEST 17's structure.

Copilot

Pull request overview

This PR extracts AI proxy request-body override handling into reusable helpers so future plugins can compute the effective upstream request body before proxying.

Changes:

Adds apply_instance_overrides and effective_request_for_cache helpers in ai-proxy base logic.
Updates provider request building to delegate override application to the new helper.
Fixes ai-request-rewrite to pass the AI instance/config through the updated provider request contract.
Adds regression tests for effective post-conversion/post-override request bodies.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 4 comments.

File	Description
`apisix/plugins/ai-proxy/base.lua`	Adds effective request-body helper and routes override application through shared logic.
`apisix/plugins/ai-providers/base.lua`	Delegates request override application to `ai-proxy.base`.
`apisix/plugins/ai-request-rewrite.lua`	Passes plugin config as `ai_instance` for provider request construction.
`t/plugin/ai-proxy-request-body-override.t`	Adds tests covering effective request body after overrides and conversion.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+    return _M.apply_instance_overrides(
+        request_body, ai_instance, ai_provider, target_protocol)


+    end
+    local ok, ai_provider = pcall(require,
+        "apisix.plugins.ai-providers." .. ai_instance.provider)
+    if not ok then
+        return nil, "failed to load provider: " .. tostring(ai_instance.provider)
+    end
+    local target_protocol, converter = resolve_target_protocol(ctx, ai_provider)


-                core.table.try_read_attr(ai_instance, "override", "request_body"),
-            request_body_force_override =
-                core.table.try_read_attr(ai_instance, "override", "request_body_force_override"),
+            ai_instance = ai_instance,


        endpoint = core.table.try_read_attr(conf, "override", "endpoint"),
        auth = conf.auth,
-        model_options = conf.options,
+        ai_instance = conf,


dosubot Bot added size:M This PR changes 30-99 lines, ignoring generated files. tech debt labels May 13, 2026

janiussyafiq mentioned this pull request May 13, 2026

feat(ai-proxy): add effective_model and effective_request_for_cache helpers #13371

Closed

5 tasks

dosubot Bot added size:L This PR changes 100-499 lines, ignoring generated files. and removed size:M This PR changes 30-99 lines, ignoring generated files. labels May 13, 2026

janiussyafiq changed the title ~~refactor(ai-proxy): extract apply_instance_overrides into a pure helper~~ feat(ai-proxy): pure helpers for the post-override effective request May 13, 2026

janiussyafiq added 4 commits May 14, 2026 05:36

janiussyafiq force-pushed the refactor/ai-proxy-apply-instance-overrides branch from 2cd96b3 to 1d8a9a2 Compare May 14, 2026 00:21

chore: fix lint

b7562c0

shreemaan-abhishek requested a review from Copilot May 14, 2026 05:15

Copilot started reviewing on behalf of shreemaan-abhishek May 14, 2026 05:16 View session

Copilot AI reviewed May 14, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(ai-proxy): pure helpers for the post-override effective request#13370

feat(ai-proxy): pure helpers for the post-override effective request#13370
janiussyafiq wants to merge 7 commits into
apache:masterfrom
janiussyafiq:refactor/ai-proxy-apply-instance-overrides

janiussyafiq commented May 13, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		return _M.apply_instance_overrides(
		request_body, ai_instance, ai_provider, target_protocol)

Conversation

janiussyafiq commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Which issue(s) this PR fixes:

Behavior change

Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

janiussyafiq commented May 13, 2026 •

edited

Loading