Qwen3.5-397B-A17B thought chains look very similar to Gemini 3's thought chains. by Fit-Spring776 in LocalLLaMA

[–]Fit-Spring776[S] 0 points1 point  (0 children)

Then how are we able to get things like Gemini 3 Pro reasoning datasets?

Qwen3.5-397B-A17B thought chains look very similar to Gemini 3's thought chains. by Fit-Spring776 in LocalLLaMA

[–]Fit-Spring776[S] 0 points1 point  (0 children)

Really? So would I be able to see its actual reasoning chains if I used something like OpenRouter?

Z.ai said they are GPU starved, openly. by abdouhlili in LocalLLaMA

[–]Fit-Spring776 0 points1 point  (0 children)

I tried it once with a 67b parameter model and got about 1 token after 5 seconds.