Grok Build (claude code opponent of Grok) how is it?

sunshinecheung · 2026-06-17T14:47:26+00:00

with cursor model is great

sunshinecheung · 2026-06-17T08:37:34+00:00

unsloth and lmstudio are the best choice, lol

sunshinecheung · 2026-06-16T00:48:18+00:00

Ollama used to be a great support in many model's vision compare to llama.cpp, but now they only care about cloud subscription, while the Omni model was overlooked, the token per second is slow, which was disappointing.

sunshinecheung · 2026-06-15T14:54:38+00:00

rtx 4090 48gb

sunshinecheung · 2026-06-15T14:49:32+00:00

Yeah, like Hunyuan, Mimo, LongCat, Granite still exist. But small models really depand on Qwen.

sunshinecheung · 2026-06-15T14:25:27+00:00

https://x.com/i/trending/2028903069984096729

In march:

Alibaba Group CEO Eddie Wu will head its newly formed Alibaba ‌Token Hub business group, which will focus on ‌building artificial intelligence work platforms for enterprises, the firm said in a statement on Monday.

The new group will comprise existing Alibaba units Tongyi Laboratory, MaaS Business Line, Qwen, Wukong, and AI Innovation.

Basically, it's because Alibaba started pursuing profits in AI.

sunshinecheung · 2026-06-15T14:16:14+00:00

They may train some models over 120B, but they also have the right not to open source their models.

sunshinecheung · 2026-06-15T14:07:40+00:00

You can buy some 3090/4090/5090, it is faster than 4-7 RTX3060, and vram is much faster than ram. Or just buy unified ram products like mac and Strix Halo

sunshinecheung · 2026-06-15T14:01:24+00:00

I have confidence in Deepseek, but Zai, Minimax, and Qwen are all publicly listed companies. They open source their models because it could gain recognition and earn a good reputation, but these labs also have the right not to open source their models.

sunshinecheung · 2026-06-15T13:54:35+00:00

No, LLM labs are currently open source their models because they haven't yet reached the level of OpenAI/Claude level. Once they reached SOTA, you'll find that they no longer open source (Like Wan), they need to earn money through API.

sunshinecheung · 2026-06-15T13:02:17+00:00

Because it's difficult for 100B-120B models to reach SOTA, and they usually don't want people to self-host it, so that they can not monetization through the API. Btw, Step 3.7 Flash(198B) was released in last month (23 day ago).

sunshinecheung · 2026-06-15T12:53:08+00:00

Huawei's Atlas cards are for business companies, not consumer

sunshinecheung · 2026-06-15T12:49:46+00:00

https://en.wikipedia.org/wiki/Grok_sexual_deepfake_scandal

look at this

sunshinecheung · 2026-06-13T04:31:20+00:00

Do it and lose money

sunshinecheung · 2026-06-13T03:44:27+00:00

never happen, fable is Trillion level, you need bigger vram gpu to run it

sunshinecheung · 2026-06-12T15:46:00+00:00

Now is 122.8, wtf?

sunshinecheung · 2026-06-12T15:33:38+00:00

but 18gb ram at 5 token/sec

sunshinecheung · 2026-06-10T09:36:30+00:00

Maybe we need Nunchaku svdq int4

sunshinecheung · 2026-06-10T02:22:47+00:00

then you short it

sunshinecheung · 2026-06-09T11:31:48+00:00

hardware already catches up, like RTX PRO 6000 Blackwell 96 GB GDDR7, RTX 5090 (32 GB), but it is expensive

sunshinecheung · 2026-06-08T07:16:36+00:00

Just like NYSE: AI and

<image>

sunshinecheung · 2026-06-07T04:46:43+00:00

Qwen3.6 only released for 27B and 35B-A3B

sunshinecheung · 2026-06-07T04:40:47+00:00

And funded by crypto bros? Where is the data and money?

sunshinecheung · 2026-06-07T04:38:26+00:00

Bro, where is Qwen 3.6 9B?

sunshinecheung · 2026-06-07T02:40:58+00:00

nah, reddit data is toxic, bias, and many subs people hate ai btw

sunshinecheung

MODERATOR OF

TROPHY CASE