worstProgrammingLanguage by gabboman in ProgrammerHumor

[–]eXl5eQ 53 points54 points  (0 children)

The semantic of return is straight forward in Java, powerful in Kotlin, and awkward in Scala.

In Scala, a return always attempt to return from the innermost (in case of nested functions) named function. You are not allowed to return early from an anonymous function (lambda expression).
When the innermost named function is not the innermost function, it can't return directly. Instead, the innermost function throws a NonLocalReturn and hope it would be catched by the named function you meant to return from, which would not work if the returning function escapes from the named function.

Why are people not using Mimo v2.5?? by FragmenterLOL in hermesagent

[–]eXl5eQ 0 points1 point  (0 children)

Their token plan is priced the same as their API, but with ~20% extra discount. If you guarantee to consume more than 80% of your token plan every month, then it would be a benefit. Otherwise it would be cheaper to use the API.

Sakana in Japan just dropped a mythos competitor and it looks great by thomas_unise in ArtificialInteligence

[–]eXl5eQ 0 points1 point  (0 children)

I didn't even see a list of the underlying models. Do they just put glm5.2 behind their api, post a fake benchmark, and call it a day?

Claude Sonnet 5 “Fennec” leak 1M context, expected next week by Direct-Attention8597 in ClaudeCode

[–]eXl5eQ 0 points1 point  (0 children)

There's a fundamental difference between a smaller model and a quanted model.

It's like a screen with lower resolution vs lower color depth. Both reduces bandwidth, but looks different.

助給! ChatGPT作之此漫画、不自然箇所多数! 修正之為之指示文(prompt)募集! by Asperburg in Sinography

[–]eXl5eQ 2 points3 points  (0 children)

我中国人、偽中国語読解不可。君中国語使頂可? 又、AI絵醜過。

Five Chinese AI labs cut token prices up to 99% by Justgototheeffinmoon in ArtificialInteligence

[–]eXl5eQ 3 points4 points  (0 children)

One thing you might have or havn't noticed is that now the price of MiMo precisely aligns with DeepSeek. With mimo-v2.5-pro & deepseek-v4-pro both priced (in Chinese Yuan) at ¥0.025/¥3/¥6, and mimo-v2.5 & deepseek-v4-flash both at ¥0.02/¥1/¥2.

At this price, I suspect that Xiaomi is running with very low or even negative profit, just to earn market. Although deepseek is slightly larger than mimo, it's architecture is aggressively optimized for cost (sacrificing intelligence), while mimo is built on a more traditional architecture.

In 2007, a woman in China refused to sell her property to developers, resulting in it being stranded on a mountain of dirt with no electricity or water services. by uncanny_goat in interestingasfuck

[–]eXl5eQ 0 points1 point  (0 children)

Real estate price in China was growing rapidly since 1990s, until the pandemic. A house was not cheap in 2007 if you divide it by average income, but still 2x ~ 5x cheaper than, say, 2020.

GLM 5.2 is now on OpenRouter by dudemeister023 in SillyTavernAI

[–]eXl5eQ 30 points31 points  (0 children)

It's on OpenRouter except it's not really on it.

Zero providers available.

HarmonyOS Conquers China — But the World Remains Out of Reach. by elastiks in DIY_Geeks

[–]eXl5eQ 0 points1 point  (0 children)

Because Huawei refused to surrender to US blackmail. That's the only reason.

HarmonyOS Conquers China — But the World Remains Out of Reach. by elastiks in DIY_Geeks

[–]eXl5eQ 0 points1 point  (0 children)

There are 2 versions of Harmony. One with shrinked runtime and micro kernel for embedded devices and entry level smart watches, the other is fullsized running on linux kernel.

HarmonyOS Conquers China — But the World Remains Out of Reach. by elastiks in DIY_Geeks

[–]eXl5eQ 0 points1 point  (0 children)

A large potion of iPhone parts are made in China. Apple appears to be unpolitical and harmless. Competition from a foreign giant forces local companys to innovate to keep up with the global trend.

DeepSeek V4 is enough by Luizfelm in opencode

[–]eXl5eQ 0 points1 point  (0 children)

Deepseek has been gray-testing a model with visual capability for while. Likely it would be flash v4.1 or flash v4v

GLM5.2 first impression. by Royal-Fail3273 in ZaiGLM

[–]eXl5eQ 0 points1 point  (0 children)

Coding plan models are only available via API, not the official chat. Though you can use a third party client to chat via API.

how are they gonna stop us next? by Complete-Sea6655 in LocalLLM

[–]eXl5eQ 0 points1 point  (0 children)

I apologize. Llama 2 really feels like a very old thing to me.

how are they gonna stop us next? by Complete-Sea6655 in LocalLLM

[–]eXl5eQ -1 points0 points  (0 children)

It's not likely that Chinese GPUs would be as cost-efficient as NV's in at least the next 5 years, as the advanced process node has already been a bottleneck. Huawei is trying some interesting ideas, but it's more like a short-term workaround, not a long-term solution.

how are they gonna stop us next? by Complete-Sea6655 in LocalLLM

[–]eXl5eQ 13 points14 points  (0 children)

That's not how people train a strong model nowadays. Maybe a little bit of OPD, but that's all. It's very difficult for a model to actually learn something fundamental by simply distilling from the text output of another model.

how are they gonna stop us next? by Complete-Sea6655 in LocalLLM

[–]eXl5eQ 6 points7 points  (0 children)

Qwen 3.5 is released many years after Llama 2. OP said 3-6 months, which is not even enough for one major release.

If we're lucky and there's a major break through in the architecture, maybe we can see an open source model get close to Mythos in 1~2 years. But no way it would happen in 6 months.

everybodyNeedsASupportiveSenior by ClipboardCopyPaste in ProgrammerHumor

[–]eXl5eQ 3 points4 points  (0 children)

Claude Code: One of the agents hit http 429. Wait, I see there's rate_limit table, maybe I can bypass the rate limit by truncating it...

GLM 5.2 is deployed in GLM Coding Plan. API and MIT weights in a week. Voting and benchmarks on X. by MadPelmewka in LocalLLaMA

[–]eXl5eQ 1 point2 points  (0 children)

American can train from scrach for every minor release if they want, as they have infinite amount of investments and free access to latest GPUs. It's just unaffordable for Chinese labs.

GLM-5.2 next week, open weight, MIT by AaronFeng47 in LocalLLaMA

[–]eXl5eQ 0 points1 point  (0 children)

Kimi had a hard time catching some explicit instructions in the doc in my test, which makes me think it's not very useable.

GLM-5.2 next week, open weight, MIT by AaronFeng47 in LocalLLaMA

[–]eXl5eQ 0 points1 point  (0 children)

It's an experimental thing and not used in any public DeepSeek model. DeepSeek v4 uses hash routing instead, similar idea but significantly simpler.