fix(openai-shim): strip `store` for local providers (vLLM, custom) by 0xfandom · Pull Request #1048 · Gitlawb/openclaude

0xfandom · 2026-05-07T08:45:36Z

Summary

Local OpenAI-compatible servers (vLLM, llama.cpp, custom self-hosted gateways) frequently validate request bodies against a strict JSON schema and reject unknown fields with 400. The shim sends store: false — an OpenAI-only flag for cloud conversation persistence — and already strips it for cloud hosts that share the same intolerance (Gemini #959, Cerebras #1040). Local servers have no remote-storage concept and belong in the same bucket.

This PR adds isLocal to shouldStripResponsesStore, so any baseUrl resolved by isLocalProviderUrl() (localhost / 127.0.0.1 / ::1 / 0.0.0.0) gets the field removed.

Impact

Users: strict local backends (vLLM + Qwen, llama.cpp custom builds) no longer 400 on first request. Lenient ones (Ollama) already ignored the field — no behavior change for them.
Devs: single-line OR added to existing strip predicate. Pattern matches the merged Cerebras/Gemini host predicates.

Testing

New unit test: Local provider (vLLM/Ollama/etc.): strips unsupported store on chat_completions (#672) covers http://localhost:8000/v1.
bun test src/services/api/openaiShim.test.ts — 92 pass / 0 fail.
bun run build — bundle clean.
bun run smoke — 0.9.2 OK.

Notes

Scope: store-only. Issue API Error: 400 : OpenClaude + vLLM em Setup Multi-GPU (RTX 3090) #672 also lists a separate max_tokens=32000 default colliding with vLLM max_model_len=32768 — that's a model-context concern, not a wire-shape one, and would need to land on the catalog/context side. Out of scope here.
Closes API Error: 400 : OpenClaude + vLLM em Setup Multi-GPU (RTX 3090) #672 (store symptom).

Local OpenAI-compatible servers (vLLM, llama.cpp, custom self-hosted gateways) often validate request bodies against a strict JSON schema and reject unknown fields with `400 Bad Request`. The shim already sends `store: false` (an OpenAI-only field for cloud conversation persistence) and strips it for known cloud hosts that share the same intolerance (Gemini, Cerebras). Local servers have no notion of remote conversation storage and fall in the same bucket. Add `isLocal` to `shouldStripResponsesStore` so any baseUrl resolved by `isLocalProviderUrl` (localhost / 127.0.0.1 / ::1 / 0.0.0.0) gets the field removed. Lenient locals (Ollama) already ignored it; this unblocks strict ones (vLLM Qwen) without behavior change for the former. Closes Gitlawb#672 (the `store: false` symptom; the separate `max_tokens` default vs. vLLM `max_model_len` collision is a different concern).

techbrewboss

Reviewed the shim change and targeted test. The implementation is narrowly scoped: local OpenAI-compatible URLs now use the same store stripping path as the existing strict providers, and the added regression test covers the localhost chat-completions request body.

Verification: bun test src/services/api/openaiShim.test.ts passes, 92 tests / 0 failures.

kevincodex1 requested review from Vasanthdev2004, auriti, gnanam1990 and techbrewboss May 7, 2026 13:33

kevincodex1 approved these changes May 7, 2026

View reviewed changes

techbrewboss approved these changes May 7, 2026

View reviewed changes

kevincodex1 merged commit 4830d6f into Gitlawb:main May 8, 2026
2 checks passed

github-actions Bot mentioned this pull request May 7, 2026

chore(main): release 0.10.0 #1039

Open

gnanam1990 mentioned this pull request May 8, 2026

fix(openai-shim): strip store when baseUrl points at Mistral #1047

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(openai-shim): strip `store` for local providers (vLLM, custom)#1048

fix(openai-shim): strip `store` for local providers (vLLM, custom)#1048
kevincodex1 merged 1 commit intoGitlawb:mainfrom
0xfandom:fix/672-strip-store-local-provider

0xfandom commented May 7, 2026

Uh oh!

techbrewboss left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

0xfandom commented May 7, 2026

Summary

Impact

Testing

Notes

Uh oh!

techbrewboss left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants