feat: Codex OAuth (ChatGPT subscription) as LLM provider by TatsuKo-Tsukimi · Pull Request #453 · dataelement/Clawith

TatsuKo-Tsukimi · 2026-04-22T07:36:27Z

Summary

Adds a new LLM provider codex-oauth so users can authenticate Clawith's LLM calls with their ChatGPT Plus/Pro subscription instead of a static OpenAI API key. Follows the same OAuth 2.1 + PKCE flow used by OpenClaw, Hermes, and numman-ali/opencode-openai-codex-auth:

Public client app_EMoamEEZ73f0CkXaXp7hrann (shared across OSS Codex integrations)
PKCE against auth.openai.com/oauth/{authorize,token}
Inference hits chatgpt.com/backend-api/responses — same Responses API protocol Clawith already speaks via OpenAIResponsesClient, so the new client reuses all payload / parsing / streaming logic from the existing class and only overrides base URL + required headers (OpenAI-Beta: responses=experimental, originator: codex_cli_rs, chatgpt-account-id decoded from the access-token JWT)

Why

Codex subscribers currently can't reuse their flat-rate ChatGPT Plus/Pro coverage inside Clawith — they have to provision a separate OpenAI API key with per-token billing. This PR adds parity with other open-source agent platforms that integrated Codex OAuth.

How it works

Backend

New CodexOAuthClient extends OpenAIResponsesClient. Per-call, _ensure_fresh_token() opens a session, SELECT ... FOR UPDATEs the llm_models row, refreshes if expires_at < now + 60s, and persists the new token pair (AES-256-CBC via the existing encrypt_data / decrypt_data utilities). Concurrency-safe against multiple workers / sessions touching the same row.
ProviderSpec.protocol Literal extended with "codex_oauth"; registry entry added; create_llm_client dispatches to the new client.
get_llm_client_for_model(model, ...) factory in services/llm/utils.py reads the new auth_type column and routes static vs OAuth. All call sites in caller.py, agent_tools.py, heartbeat.py, supervision_reminder.py switched to the factory.

Schema

New Alembic migration (add_codex_oauth_to_llm_models, chained on increase_api_key_length) extends llm_models:

auth_type VARCHAR(20) NOT NULL DEFAULT 'static' — 'static' keeps the existing API-key path, 'codex_oauth' marks OAuth rows.
oauth_access_token_encrypted VARCHAR(4096), oauth_refresh_token_encrypted VARCHAR(1024), oauth_expires_at TIMESTAMPTZ, oauth_account_id VARCHAR(255) — all nullable.
api_key_encrypted relaxed to nullable (OAuth rows don't carry a static key). The existing PUT /enterprise/llm-models/{id} handler already guards against empty / masked-value overwrites, so editing an OAuth model through the existing form won't clobber its tokens.

Frontend

ConnectChatGPTModal component with two tabs:
- Browser login — drives /start → opens the authorize URL in a new tab → polls /poll every 1.5s up to 5 minutes → captures the code from the loopback listener; manual redirect-URL paste as fallback if the browser's redirect can't reach localhost:1455.
- Paste tokens — imports access_token / refresh_token / expires_in_seconds directly (e.g. from a local codex login's ~/.codex/auth.json). Clawith takes over refresh from there.
"Connect ChatGPT" button placed next to "+ Add Model" in Company Settings → Models tab; invalidates the llm-models query on success so the new row appears immediately.
Full en / zh i18n under enterprise.llm.codex.*.

Loopback listener

Stdlib http.server bound to CODEX_OAUTH_LOOPBACK_HOST:1455 (defaults 127.0.0.1; Docker Compose sets 0.0.0.0 with a 1455:1455 host mapping so the browser's http://localhost:1455/auth/callback reaches the backend).
Single-bind per process. If the port is busy, /start returns loopback_ready=false and the UI surfaces the manual-paste fallback.
In-memory state → {verifier, code} cache with 10-minute TTL and thread-safe access.

Multi-tenant

llm_models.tenant_id already scopes LLM configs per tenant. Each OAuth grant is a separate row keyed by tenant, so one Clawith instance can hold multiple ChatGPT accounts.

Files

backend/app/services/llm/codex_oauth.py — PKCE / authorize URL / token exchange / refresh / JWT decode (pure, unit-testable)
backend/app/services/llm/client.py — CodexOAuthClient, registry entry, dispatch
backend/app/services/llm/utils.py — get_llm_client_for_model factory
backend/app/services/llm/caller.py, agent_tools.py, heartbeat.py, supervision_reminder.py — swap in factory
backend/app/models/llm.py — new columns
backend/alembic/versions/add_codex_oauth_to_llm_models.py — idempotent migration
backend/app/api/codex_oauth.py — 4 REST endpoints + loopback listener
backend/app/main.py — register router
backend/tests/test_codex_oauth.py — PKCE, URL params, exchange / refresh (httpx mocked), JWT decode
frontend/src/components/ConnectChatGPTModal.tsx
frontend/src/pages/EnterpriseSettings.tsx — wire the button
frontend/src/services/api.ts — typed codexOauthApi client + CODEX_OAUTH_MODELS whitelist
frontend/src/i18n/{en,zh}.json — enterprise.llm.codex.* keys
docker-compose.yml — 1455:1455 port + CODEX_OAUTH_LOOPBACK_HOST=0.0.0.0
docker-compose.test.yml — isolated test harness (port 3009, separate volumes)
test_codex_oauth.sh — paste-creds smoke test driver

Test plan

pytest backend/tests/test_codex_oauth.py — PKCE S256 consistency, authorize URL params, exchange / refresh over mocked httpx, JWT claim decode
Alembic: upgrade head applies cleanly from increase_api_key_length; downgrade drops the added columns and restores NOT NULL on api_key_encrypted
Manual: POST /start → authorize URL renders with all Codex-CLI params (codex_cli_simplified_flow=true, id_token_add_organizations=true, originator=codex_cli_rs); /poll on unknown state → {expired:true}; /complete on expired session → 400
Manual E2E (paste-creds flow): feed ~/.codex/auth.json contents → model row created with auth_type='codex_oauth', oauth_account_id decoded from JWT, tokens encrypted with SECRET_KEY
Manual E2E (browser flow): /start → ChatGPT login → browser redirect to localhost:1455/auth/callback → loopback listener captures code → /poll returns it → /complete persists model
Token refresh: backdate oauth_expires_at in DB → next inference call triggers refresh_token, updates the row under SELECT FOR UPDATE, retries the request
Inference: agent configured with the codex-oauth model → WebSocket chat fires POST https://chatgpt.com/backend-api/responses with Authorization: Bearer <jwt>, OpenAI-Beta: responses=experimental, originator: codex_cli_rs, and chatgpt-account-id headers

Known limitations

Loopback redirect URI is hardcoded by OpenAI to http://localhost:1455/auth/callback. For Clawith deployments where the user's browser can't reach the backend's port 1455 (e.g. browser on laptop, backend on remote cloud host), the paste-tokens mode covers the gap.
The in-memory OAuth-session cache is per-backend-process. Multi-worker deployments should switch to Redis if /start and /poll might hit different workers.
Codex API availability depends on region — Clawith itself is region-agnostic, but chatgpt.com/backend-api/responses is subject to OpenAI's own geo policies.

References

https://developers.openai.com/codex/auth — official OpenAI Codex authentication
https://docs.openclaw.ai/concepts/oauth — OpenClaw's Codex OAuth integration
https://github.com/numman-ali/opencode-openai-codex-auth — community reference implementation (MIT)

🤖 Generated with Claude Code

Adds a new provider 'codex-oauth' so users can authenticate with their ChatGPT Plus/Pro subscription instead of a static OpenAI API key. Follows the community OAuth 2.1 + PKCE flow used by OpenClaw / Hermes / numman-ali/opencode-openai-codex-auth. - Pure primitives in services/llm/codex_oauth.py (PKCE, authorize URL, exchange/refresh, JWT decode) - New CodexOAuthClient subclassing OpenAIResponsesClient - PROVIDER_REGISTRY entry + create_llm_client dispatch - llm_models schema: auth_type + oauth_* columns, api_key_encrypted nullable (+ Alembic migration) - get_llm_client_for_model() dispatcher; call sites switched - Loopback listener on :1455 + 4 REST endpoints (/start /poll /complete /paste-creds) - docker-compose: 1455 port + CODEX_OAUTH_LOOPBACK_HOST env - Unit tests for OAuth primitives Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Adds the end-to-end UX surface for the backend codex-oauth provider introduced in the previous commit. - ConnectChatGPTModal component: two tabs (browser OAuth + paste tokens). OAuth tab drives /start -> opens authorize URL in a new tab -> polls /poll every 1.5s up to 5 min; falls back to manual redirect-URL paste when loopback is unavailable. Paste tab lets users import tokens directly from a local codex login (~/.codex/auth.json). - EnterpriseSettings LLM tab: secondary 'Connect ChatGPT' button next to Add Model; invalidates the llm-models query on success so the new row appears immediately. - services/api.ts: typed codexOauthApi client (start / poll / complete / pasteCreds) plus CODEX_OAUTH_MODELS whitelist mirroring the backend. - i18n: enterprise.llm.codex.* keys in en.json and zh.json. - test_codex_oauth.sh: helper smoke test that drives the paste-creds flow end-to-end from a local ~/.codex/auth.json. The backend PUT /llm-models/{id} handler already guards api_key against empty/masked overwrites, so editing an auth_type='codex_oauth' model through the existing form is safe without a dedicated edit surface. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Spins up an independent Clawith instance on port 3009 with its own postgres/redis volumes and a dedicated bridge network (clawith_codex_network), so the Codex OAuth feature can be exercised end-to-end without touching a user's primary Clawith deployment. Usage: docker compose -p clawith-codex -f docker-compose.test.yml up -d --build Binds host:1455 for the loopback OAuth callback and sets CODEX_OAUTH_LOOPBACK_HOST=0.0.0.0 inside the container so the browser's http://localhost:1455/auth/callback reaches the backend listener. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 96136ed4b3

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-04-22T07:42:40Z

+) -> LLMModel:
+    settings = get_settings()
+    row = LLMModel(
+        tenant_id=getattr(current_user, "tenant_id", None),


Preserve tenant scoping when creating OAuth models

This path always writes tenant_id from current_user and never accepts a tenant override, unlike the existing /enterprise/llm-models flow. In a platform-admin session managing a different selected tenant, the new Codex OAuth model is inserted under the admin's own tenant (often null) instead of the target tenant, so it won't show up in that tenant's model list or be usable by its agents.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-04-22T07:42:40Z

+        )
+
+    return create_llm_client(
+        provider=model.provider,


Prevent static models from routing to codex_oauth client

The static-auth branch forwards model.provider directly to create_llm_client; once codex-oauth is exposed via provider discovery, admins can create/edit models through the normal static form with provider='codex-oauth' but auth_type='static'. Those models hit this branch and fail at runtime because create_llm_client requires OAuth context (model_id/session_factory) for codex_oauth, causing chat calls to error instead of executing.

Useful? React with 👍 / 👎.

…outing Two issues raised on PR dataelement#453 by the Codex auto-reviewer, both boundary gaps that would silently route callers to the wrong place. 1. Tenant scoping on model creation /complete and /paste-creds hardcoded tenant_id=current_user.tenant_id, unlike POST /enterprise/llm-models which accepts an optional tenant_id query override. A platform admin managing another tenant would write the new Codex OAuth row to their own tenant (often NULL), where the target tenant's agents can't see it. Adds tenant_id query param to /complete and /paste-creds with the same 'override or fall back to current_user.tenant_id' semantics as the existing endpoint. Refactored to _resolve_tenant_id() so both flows share validation (rejects malformed UUIDs with 400). 2. Static form routing static-auth rows through the OAuth client Once codex-oauth is registered in PROVIDER_REGISTRY it automatically shows up in /enterprise/llm-providers, so admins can create it via the normal Add Model form. The row lands as provider='codex-oauth' + auth_type='static' (DB default), and chat calls then hit create_llm_client's codex_oauth dispatch branch without the required model_id / session_factory context, raising ValueError instead of serving inference. Three-layer defense: - get_provider_manifest() skips providers whose protocol is marked hidden (codex_oauth for now), so the UI can't offer it. - POST /enterprise/llm-models rejects provider='codex-oauth' with a 400 that points at the dedicated endpoints. - PUT /enterprise/llm-models/{id}, for rows already auth_type= codex_oauth, locks down the identity fields (provider / base_url / api_key) while still allowing label / enabled / timeouts / etc. to be edited. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 30b58232cf

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-04-22T08:03:23Z

-        from app.services.llm import create_llm_client, get_max_tokens, LLMMessage, LLMError, get_model_api_key
+        from app.services.llm import (
+            create_llm_client,
+            get_llm_client_for_model,


Re-export OAuth client factory before package-level import

This import pulls get_llm_client_for_model from app.services.llm, but backend/app/services/llm/__init__.py does not export that symbol. When heartbeat/A2A/supervision paths hit this block, Python raises ImportError before any LLM call, so those workflows fail even for non-OAuth models. Please either re-export get_llm_client_for_model from __init__.py or import it from app.services.llm.utils at call sites.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-04-22T08:03:23Z

+        ),
+
+    complete: (body: { state: string; code: string; label: string; model: string }) =>
+        request<CodexOauthModelResponse>('/llm-models/codex-oauth/complete', {


Include tenant_id when completing Codex OAuth provisioning

Fresh evidence relative to the earlier tenant-scope concern: although the backend now accepts tenant_id on Codex OAuth endpoints, this new client call never appends the selected tenant, so platform-admin sessions provisioning for another tenant still default to the caller's own tenant context. As a result, created OAuth models can land under the wrong tenant and not appear in the target tenant’s model list.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-04-22T08:03:23Z

        raise HTTPException(status_code=404, detail="Model not found")

+    # Block turning a static row INTO an OAuth-only provider via this form.
+    _reject_oauth_provider_via_static_form(data.provider)


Skip OAuth-provider rejection for existing OAuth rows

This guard runs before checking model.auth_type, so updates that include an unchanged provider='codex-oauth' are rejected outright. The edit form submits provider with every save, which means normal edits (label/model settings) on already-provisioned Codex OAuth models fail with 400, despite the subsequent logic intending to only block identity-field mutation.

Useful? React with 👍 / 👎.

Three follow-ups on PR dataelement#453 after the first fix round. 1. Re-export get_llm_client_for_model from app.services.llm (P1) The utility was defined in services/llm/utils.py but the package's __init__.py only re-exported the older create_llm_client / get_* set. agent_tools.py, heartbeat.py, and supervision_reminder.py all import via 'from app.services.llm import ... get_llm_client_for_model', so any A2A / heartbeat / supervision path — with ANY model, OAuth or not — raised ImportError at the deferred-import line before the LLM call could start. WebSocket chat worked because caller.py imports directly from .utils, masking the gap in earlier testing. 2. Thread tenant_id through the frontend Codex OAuth calls (P1) Previous fix added a tenant_id query parameter to /complete and /paste-creds, but api.ts and ConnectChatGPTModal never forwarded one. Platform admins provisioning for another tenant still defaulted to the caller's own tenant. ConnectChatGPTModal now takes a tenantId prop; EnterpriseSettings passes selectedTenantId; the API client appends ?tenant_id=... when present. 3. Reorder the update_llm_model OAuth guards (P2) _reject_oauth_provider_via_static_form ran before the model.auth_type check, so editing an already-OAuth row with its own unchanged provider='codex-oauth' in the request body hit the reject path. Edit forms re-submit the current provider on every save, so routine label/enabled updates on Codex OAuth rows returned 400. Moved the reject into the else branch of the auth_type check; the OAuth-row path still locks identity fields (provider / base_url / api_key) but lets passive fields through. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: ea8348e304

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-04-22T08:55:50Z

+def _consume_session(state: str) -> dict[str, Any] | None:
+    """Read-and-remove a session on successful code exchange."""
+    with _sessions_lock:
+        return _sessions.pop(state, None)


Enforce OAuth session expiry before code exchange

_consume_session removes and returns any cached state without checking expires_at, so /llm-models/codex-oauth/complete can still accept sessions well past the documented 10-minute TTL until another /start happens to garbage-collect them. This extends the validity window of stale authorization states and makes the “expired session” guard unreliable for manual-complete flows.

Useful? React with 👍 / 👎.

TatsuKo-Tsukimi and others added 3 commits April 22, 2026 15:37

TatsuKo-Tsukimi force-pushed the feat/codex-oauth-provider branch from 96136ed to f2c1fa3 Compare April 22, 2026 07:38

chatgpt-codex-connector Bot reviewed Apr 22, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Codex OAuth (ChatGPT subscription) as LLM provider#453

feat: Codex OAuth (ChatGPT subscription) as LLM provider#453
TatsuKo-Tsukimi wants to merge 5 commits intodataelement:mainfrom
TatsuKo-Tsukimi:feat/codex-oauth-provider

TatsuKo-Tsukimi commented Apr 22, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Apr 22, 2026

Uh oh!

chatgpt-codex-connector Bot Apr 22, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Apr 22, 2026

Uh oh!

chatgpt-codex-connector Bot Apr 22, 2026

Uh oh!

chatgpt-codex-connector Bot Apr 22, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

TatsuKo-Tsukimi commented Apr 22, 2026

Summary

Why

How it works

Backend

Schema

Frontend

Loopback listener

Multi-tenant

Files

Test plan

Known limitations

References

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant