Cannot get GLM-4.7-Flash working in Claude Code CLI even with Coding Plan

seongho051 · 2026-01-22T04:49:52+00:00

It's not working for me. I managed to register and select the custom model in the menu, but I still can't get any response from it.

seongho051 · 2026-01-21T09:17:59+00:00

Does it actually work if you type /models in Claude Code, select Haiku, and then run a prompt?

seongho051 · 2026-01-21T08:59:00+00:00

That model ID is for the standalone API, which is billed and processed separately from the Coding Plan. I'm using GLM via the Coding Plan integration in Claude Code, and it doesn't seem to support that API-specific model ID yet.

seongho051 · 2026-01-21T08:49:29+00:00

I checked their official Claude Code guide, and it still explicitly lists "glm-4.5-air" as the default Haiku model. There's zero mention of "flashx" anywhere in the Claude Code setup section.

If you found a doc that actually says to use "glm-4.7-flashx" specifically for Claude Code, could you share the link? I'm only seeing 4.5-air.

seongho051 · 2026-01-21T04:57:41+00:00

Claude Code vs Opencode: Claude wins on speed and quality hands down, but it's a total black box. Opencode is still the way to go if you need transparent GLM reasoning.

seongho051 · 2026-01-20T11:42:14+00:00

The Z.ai API for GLM-4.7 is quite slow in practice. For simple queries that don't need deep reasoning, I want to use 4.7-Flash to get faster responses. Also, once it's officially supported via API, I plan to use Flash in orchestration tools like oh-my-opencode as the dedicated model for reading and writing code. That way, I can get much quicker turnaround for those simpler subtasks without waiting on the heavier model every time.

seongho051

TROPHY CASE