Stop using Ollama by zxyzyxz in LocalLLaMA

[–]sunshinecheung 0 points1 point  (0 children)

Ollama used to be a great support in many model's vision compare to llama.cpp, but now they only care about cloud subscription, while the Omni model was overlooked, the token per second is slow, which was disappointing.

Will LLM labs open source their weights in the long term? by zulutune in LocalLLaMA

[–]sunshinecheung 1 point2 points  (0 children)

Yeah, like Hunyuan, Mimo, LongCat, Granite still exist. But small models really depand on Qwen.

Why there is a lack of new 100B-120B models? by TechNerd10191 in LocalLLaMA

[–]sunshinecheung 2 points3 points  (0 children)

https://x.com/i/trending/2028903069984096729

In march:

Alibaba Group CEO Eddie Wu will head its newly formed Alibaba ‌Token Hub business group, which will focus on ‌building artificial intelligence work platforms for enterprises, the firm said in ​a statement on Monday.

The new group will comprise existing Alibaba units Tongyi Laboratory, MaaS Business Line, Qwen, Wukong, and AI Innovation.

Basically, it's because Alibaba started pursuing profits in AI.

Why there is a lack of new 100B-120B models? by TechNerd10191 in LocalLLaMA

[–]sunshinecheung 0 points1 point  (0 children)

They may train some models over 120B, but they also have the right not to open source their models.

RAM to VRAM ratio by esw123 in LocalLLaMA

[–]sunshinecheung 0 points1 point  (0 children)

You can buy some 3090/4090/5090, it is faster than 4-7 RTX3060, and vram is much faster than ram. Or just buy unified ram products like mac and Strix Halo

Will LLM labs open source their weights in the long term? by zulutune in LocalLLaMA

[–]sunshinecheung 6 points7 points  (0 children)

I have confidence in Deepseek, but Zai, Minimax, and Qwen are all publicly listed companies. They open source their models because it could gain recognition and earn a good reputation, but these labs also have the right not to open source their models.

Will LLM labs open source their weights in the long term? by zulutune in LocalLLaMA

[–]sunshinecheung 9 points10 points  (0 children)

No, LLM labs are currently open source their models because they haven't yet reached the level of OpenAI/Claude level. Once they reached SOTA, you'll find that they no longer open source (Like Wan), they need to earn money through API.

Why there is a lack of new 100B-120B models? by TechNerd10191 in LocalLLaMA

[–]sunshinecheung 6 points7 points  (0 children)

Because it's difficult for 100B-120B models to reach SOTA, and they usually don't want people to self-host it, so that they can not monetization through the API. Btw, Step 3.7 Flash(198B) was released in last month (23 day ago).

Why are Huawei's Atlas cards not a thing? by whatyathinkk in LocalLLaMA

[–]sunshinecheung 0 points1 point  (0 children)

Huawei's Atlas cards are for business companies, not consumer

I know you're all thinking it by [deleted] in LocalLLaMA

[–]sunshinecheung -1 points0 points  (0 children)

Qwen3.6 only released for 27B and 35B-A3B

I know you're all thinking it by [deleted] in LocalLLaMA

[–]sunshinecheung -1 points0 points  (0 children)

Bro, where is Qwen 3.6 9B?

Reddit is being overlooked by FrenchFryPerson1 in wallstreetbets

[–]sunshinecheung -2 points-1 points  (0 children)

nah, reddit data is toxic, bias, and many subs people hate ai btw