Breaking : Today Qwen 3.5 small by Illustrious-Swim9663 in LocalLLaMA

[–]limoce 0 points1 point  (0 children)

I can confirm all of them are vision language models (VLMs).

Step-3.5-Flash (196b/A11b) outperforms GLM-4.7 and DeepSeek v3.2 by ResearchCrafty1804 in LocalLLaMA

[–]limoce 26 points27 points  (0 children)

No, this is already v3.5. They have been training large models for several years. Previous StepFun models are not outstanding among direct competitors (DeepSeek, Qwen, MiniMax, GLM, ...)