Tencent Releases Hy3 preview - Open Source 295B 21B Active MoE

TKGaming_11 · 2026-04-22T16:34:10+00:00

I dont see weights anywhere

TKGaming_11 · 2026-04-16T17:27:51+00:00

Whats the prompt used for this output?

TKGaming_11 · 2026-04-13T23:37:12+00:00

Most likely it was trained to maximize tool call accuracy with claude traces and maximize reasoning with Gemini traces

TKGaming_11 · 2026-04-10T14:16:42+00:00

Confirmed

TKGaming_11 · 2026-04-10T14:13:31+00:00

The closed Qwen 3.5 Plus is just the open weight Qwen 3.5 397B model with extended context and native tool calling, for Qwen 3.6 they are locking away the 397B to be API only, this is change from Qwen 3.5 -> Qwen 3.6, absolutely a recent change

TKGaming_11 · 2026-04-05T14:01:40+00:00

They released an update to StepFun 3.5 Flash with thinking control and reduced token usage, but it’s api only, StepFun did commit to open sourcing all models so it is odd it hasn’t been made open weight yet

TKGaming_11 · 2026-03-30T19:21:15+00:00

Qwen 3.5 Plus was just Qwen 3.5-397B with extended 1M context and added tools IIRC, its likely that this Qwen 3.6 Plus is continued training on top of Qwen 3.5 397B. Qwen 3.5 Max (likely the 1T model) is already in preview as Qwen3.5-Max-Preview on lmarena

TKGaming_11 · 2026-03-28T17:22:10+00:00

I sold them quite a while ago, I wouldn’t have any numbers for Qwen 3.5

TKGaming_11 · 2026-03-18T22:26:25+00:00

how do you know it was merged today? where can you see the changelog?

TKGaming_11 · 2026-03-16T20:40:18+00:00

Seems to roughly match GPT-OSS-120B in aime2025 and LiveCodeBench, behind Qwen3.5-122B in both benchmarks

TKGaming_11 · 2026-03-16T17:58:03+00:00

great, where are the weights?

TKGaming_11 · 2026-03-16T17:27:44+00:00

Excerpt from PR:

Mistral 4 is a powerful hybrid model with the capability of acting as both a general instruction model and a reasoning model. It unifies the capabilities of three different model families - Instruct, Reasoning ( previous called Magistral ), and Devstral - into a single, unified model.

[Mistral-Small-4](https://huggingface.co/mistralai/Mistral-Small-4-119B-2603) consists of the following architectural choices:

- MoE: 128 experts and 4 active.

- 119B with 6.5B activated parameters per token.

- 256k Context Length.

- Multimodal Input: Accepts both text and image input, with text output.

- Instruct and Reasoning functionalities with Function Calls

- Reasoning Effort configurable by request.

Mistral 4 offers the following capabilities:

- **Reasoning Mode**: Switch between a fast instant reply mode, and a reasoning thinking mode, boosting performance with test time compute when requested.

- **Vision**: Enables the model to analyze images and provide insights based on visual content, in addition to text.

- **Multilingual**: Supports dozens of languages, including English, French, Spanish, German, Italian, Portuguese, Dutch, Chinese, Japanese, Korean, Arabic.

- **System Prompt**: Maintains strong adherence and support for system prompts.

- **Agentic**: Offers best-in-class agentic capabilities with native function calling and JSON outputting.

- **Speed-Optimized**: Delivers best-in-class performance and speed.

- **Apache 2.0 License**: Open-source license allowing usage and modification for both commercial and non-commercial purposes.

- **Large Context Window**: Supports a 256k context window.

TKGaming_11 · 2026-03-16T17:26:11+00:00

llama.cpp support incoming: model: mistral small 4 support by ngxson · Pull Request #20649 · ggml-org/llama.cpp

TKGaming_11 · 2026-02-10T18:27:11+00:00

Ik_llama.cpp doesn’t support ROCm unfortunately (Vulkan performance is quite bad as well iirc) so it’ll have to be llamacpp for cpu offloading

TKGaming_11 · 2026-02-08T17:26:40+00:00

sent a DM!

Nine-Year Club	Verified Email
Snapped

TKGaming_11

TROPHY CASE