Fix none approach streaming passthrough for tool-calling clients by joby-brentsmith · Pull Request #313 · algorithmicsuperintelligence/optillm

joby-brentsmith · 2026-06-15T21:35:28Z

Fixes #312

Problem

The none approach is documented as a direct pass-through to the upstream OpenAI-compatible endpoint. In practice, when stream: true and tools are present, OptiLLM:

Calls upstream non-streaming
Extracts only choices[0].message.content (text)
Synthesizes a fake SSE response with finish_reason: "stop" and no tool_calls

Agent clients that depend on streamed tool_calls receive the assistant's announcement text but never get tool metadata to execute.

Solution

Scope limited to operation == 'SINGLE' and approaches[0] == 'none'. Optimization paths (rto, cot_reflection, moa, etc.) are unchanged.

generate_stream_passthrough() — when stream: true, call upstream with stream=True and yield each chunk as SSE without modification.
Original request messages — call none_approach(original_messages=messages, ...) directly instead of reconstructing from parse_conversation().
promote_tool_calls_to_first_choice() — for non-streaming responses, merge tool_calls from a later choice into choices[0] (provider-agnostic; same pattern as goose#6369).

Testing

Streaming (should show tool_calls chunks):

curl -s -N http://127.0.0.1:8000/v1/chat/completions \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4o",
    "messages": [{"role": "user", "content": "Run echo hello with the shell tool"}],
    "tools": [{"type": "function", "function": {
      "name": "shell",
      "parameters": {"type": "object", "properties": {"command": {"type": "string"}}, "required": ["command"]}
    }}],
    "tool_choice": "auto",
    "stream": true
  }' | rg "tool_calls|finish_reason"

Non-streaming split choices (should have tool_calls in choices[0]):

curl -s http://127.0.0.1:8000/v1/chat/completions \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4o",
    "messages": [{"role": "user", "content": "Run echo hello with the shell tool"}],
    "tools": [{"type": "function", "function": {
      "name": "shell",
      "parameters": {"type": "object", "properties": {"command": {"type": "string"}}, "required": ["command"]}
    }}],
    "stream": false
  }' | python3 -c "
import sys, json
d = json.load(sys.stdin)
c = d['choices'][0]
assert len(d['choices']) == 1
assert c['message'].get('tool_calls')
print('ok')
"

Files changed

optillm/server.py — 2 helpers + rewire none branch in proxy()

Made with Cursor

When approach is none and stream=true, forward upstream SSE chunks verbatim instead of synthesizing text-only responses. Preserve original request messages for multi-turn agent tool loops, and merge split tool_calls choices for non-streaming responses. Co-authored-by: Cursor <cursoragent@cursor.com>

CLAassistant · 2026-06-15T21:35:35Z

All committers have signed the CLA.

normalize_message_content() was flattening list content to text-only strings, dropping image_url parts. Keep list content intact when messages include non-text multimodal parts. Co-authored-by: Cursor <cursoragent@cursor.com>

Preserve multimodal message content in none passthrough

def8864

normalize_message_content() was flattening list content to text-only strings, dropping image_url parts. Keep list content intact when messages include non-text multimodal parts. Co-authored-by: Cursor <cursoragent@cursor.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix none approach streaming passthrough for tool-calling clients#313

Fix none approach streaming passthrough for tool-calling clients#313
joby-brentsmith wants to merge 2 commits into
algorithmicsuperintelligence:mainfrom
joby-brentsmith:fix/none-streaming-tool-calls

joby-brentsmith commented Jun 15, 2026

Uh oh!

CLAassistant commented Jun 15, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

joby-brentsmith commented Jun 15, 2026

Problem

Solution

Testing

Files changed

Uh oh!

CLAassistant commented Jun 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

CLAassistant commented Jun 15, 2026 •

edited

Loading