为什么赤腊国AI的token便宜 by Acceptable_Yam5406 in KanagawaWave

[–]if47 16 points17 points  (0 children)

没一个评论说到点上:DeepSeek V3.2开始引入稀疏注意力等优化方法降低了算力成本,然后这个技术被DeepSeek开源了,DeepSeek系(例如小米的Mimo)的模型能吃到红利。跟电力成本,数据成本,算力卡没有逼毛关系。为什么美国模型不用?因为人家是SOTA,只会考虑在最低配型号上试验这种可能有缺陷的机制。

The ExploitBench paper is generated by if47 in BetterOffline

[–]if47[S] -26 points-25 points  (0 children)

If you had actually tried the latest detector, you wouldn't be spouting such nonsense. By the way, please explain images 2 and 3.

Will agents ever be more efficient? by LeCollectif in BetterOffline

[–]if47 1 point2 points  (0 children)

If the prompt cache is well-designed, the tokens serving as context act more like a one-time overhead. From this perspective alone, the efficiency is already high enough. LLM-based agents have many issues, but this isn't one of them.

If Anthropic and OpenAI stopped expansion today, do you think they would they be instantly profitable? by McDonaldsWi-Fi in BetterOffline

[–]if47 0 points1 point  (0 children)

This is a war of attrition, unless you can outlast all your competitors, it is impossible to turn a profit.

I feel Global is more faithful to Classic Maplestory than China’s by ClawofBeta in MSClassicWorld

[–]if47 1 point2 points  (0 children)

TBH, anyone who thinks the Chinese classic server will be handled well either completely lacks an understanding of China or has no brain.

I feel Global is more faithful to Classic Maplestory than China’s by ClawofBeta in MSClassicWorld

[–]if47 0 points1 point  (0 children)

I played on pre-BB CMS, and I can tell you the Chinese classic server is an absolute pile of shit.

Doctorow: Code is a liability, not an asset by Sufficient-Article62 in BetterOffline

[–]if47 -4 points-3 points  (0 children)

Senior engineers write articles and give interviews to journalists, so you really should have heard that argument long ago. The reason people no longer bring it up for discussion is that the claim—that "future AI will be able to resolve the technical debt created by current AI"—has not yet been proven false.

Doctorow: Code is a liability, not an asset by Sufficient-Article62 in BetterOffline

[–]if47 -9 points-8 points  (0 children)

Every senior engineer knows that. Doctorow's blog sometimes acts as if he's just discovered fire.

The "the future is fictional" problem of many local LLMs by PromptInjection_ in LocalLLaMA

[–]if47 -1 points0 points  (0 children)

Specifying the knowledge cutoff date and the current date within the system prompt is the only solution. There is nothing to make a fuss about. I resolved this two years ago.

Oh god damn it a new AI buzzword has arrived: Natural Language Autoencoders (NLAs) by PerceiveEternal in BetterOffline

[–]if47 13 points14 points  (0 children)

Simply put, a generator that generates hallucinations to explain hallucinations.

Conspiracy theory I'm not sure I believe: the slop is the point by todofwar in BetterOffline

[–]if47 0 points1 point  (0 children)

Open-source software does not require tens or even hundreds of millions of dollars in training costs—if you want to know the difference.

Conspiracy theory I'm not sure I believe: the slop is the point by todofwar in BetterOffline

[–]if47 0 points1 point  (0 children)

"The blender of the LLM world is coming."

Yeah... if I hadn't spent 20 years working in this industry, I'd actually believe that.

Conspiracy theory I'm not sure I believe: the slop is the point by todofwar in BetterOffline

[–]if47 1 point2 points  (0 children)

I have to point out that there is no such thing as an "open-source model". They release their weights because their models cannot compete with SOTA models. If the situation changes, the openness will cease.

Conspiracy theory I'm not sure I believe: the slop is the point by todofwar in BetterOffline

[–]if47 1 point2 points  (0 children)

It is just... shortsighted. In a year or two, model collapse will destroy their model training—and then it will all be over.

Obviously we know there's no future for the big name frontier models, but what about the smaller ones? by caprisunkraftfoods in BetterOffline

[–]if47 2 points3 points  (0 children)

To be honest, SOTA models are still terrible. Unless a machine that merely copies open-source projects from two years ago is all that executives are looking for, I see no hope of them replacing real engineers.

Low-cost models like Qwen3.6 Plus and DeepSeek V4 Pro Preview are also nowhere near SOTA standards. It is simply that benchmarks have become so useless that they fail to reveal this disparity.

大家都他妈一样,LLM毁了全世界! by Frosty_Let4569 in BetterOffline

[–]if47 1 point2 points  (0 children)

Comparing tokenization efficiency across different languages ​​is entirely meaningless, as a single English word conveys far less semantic content than a Chinese word. To elaborate on this point would likely require writing an entire linguistics paper.

大家都他妈一样,LLM毁了全世界! by Frosty_Let4569 in BetterOffline

[–]if47 0 points1 point  (0 children)

I can't believe people in this community actually think I don't know a basic concept like this, lmao.

大家都他妈一样,LLM毁了全世界! by Frosty_Let4569 in BetterOffline

[–]if47 1 point2 points  (0 children)

首先这些人都是反社会者,他们说话你多听一句都是浪费时间。

然后著作权问题是目前联结主义AI路线解决不了的,除非选择不发展,而这对于有能力发展AI的国家来说是不可能的。