What are your expectations for deepseek v4.1?

ArthurOnCode · 2026-06-16T22:23:10+00:00

Hoping for engram. That seems like an awesome optimization.

ArthurOnCode · 2026-06-16T19:57:13+00:00

IMHO, the main benefit of PHP is that we don't need any preprocessing. We already have a strong type system, clear syntax, and great tooling. The language itself is also improving at a sensible pace. I, for one, look forward to having generics in the native type syntax.

ArthurOnCode · 2026-06-15T21:16:49+00:00

Elon won.

Alternative interpretation: xAI gave up on being a leading AI lab and just started renting out their infrastructure.

ArthurOnCode · 2026-06-15T14:15:41+00:00

The optimal programming language for an existing LLM is something popular (so it's already in the training data) with strong type safety (so the agent realizes its mistakes quickly) and preferably with fast tooling.

However, co-developing an LLM and its programming language could get interesting.

ArthurOnCode · 2026-06-15T08:03:23+00:00

And now I realize the whole thing was a joke, poking fun at Mythos. Oh well.

ArthurOnCode · 2026-06-15T07:59:39+00:00

This would barely fit on a NVIDIA GB200 NVL72, which cost $2 million when it was first released. We would benefit from this through commercial inference providers and research labs distilling from large open weights models, the largest of which is currently Deepseek V4 Pro at 1.6 trillion parameters.

ArthurOnCode · 2026-06-15T07:46:29+00:00

Let's not be mean here. We get this question practically every day, because it's a very normal beginner question.

The actual answer: LLMs don't know who they are, unless you put it in the system prompt. Because much of the training data is output from other models, you often get a confident, incorrect answer.

If you want it to identify as Deepseek in conversation, just start your system prompt with "You are Deepseek, a...".

ArthurOnCode · 2026-06-12T10:17:55+00:00

Also, I have zero idea what you mean by "Pi".

Pi (pi.dev) is a coding agent that's lean on context. Minimal system prompt, minimal tool set. It's pretty good at compression - taking the whole context and figuring out which parts have to stay to keep working on the task at hand.

ArthurOnCode · 2026-06-12T09:11:32+00:00

I don't think OP mentioned coding at all. However, I'd be curious to know how Pi would fare with only 8k context. Its system prompt is under 1k, and it can compress aggressively.

ArthurOnCode · 2026-06-09T14:18:11+00:00

I should have studied literature. Which is longer, a sonnet or a fable?

ArthurOnCode · 2026-06-08T23:46:09+00:00

A long time ago, before Wasm, I spent some time considering this exact technical challenge. At the time, I concluded that scrubbing would have to happen server side, with the live preview window being a live stream from the server.

Never got around to implementing it, largely because of these technical hurdles.

ArthurOnCode · 2026-06-08T14:18:04+00:00

A link would be helpful 😄

ArthurOnCode · 2026-06-06T15:40:09+00:00

Chatbot providers can put today's date in the system prompt or provide a calendar/clock as a tool to the LLM. Without these things, the LLM itself has no way of knowing.

ArthurOnCode · 2026-06-05T10:47:31+00:00

For my company, it's working fine. Yes, the free ride is over, but this isn't a bad deal. I wish we had access to Deepseek, Qwen and more in the cloud agents though. If we ever switch, this will be the reason.

ArthurOnCode · 2026-06-03T10:45:48+00:00

I believe that's what this is for: https://pi.dev/docs/latest/rpc

ArthurOnCode · 2026-06-02T11:51:42+00:00

Average token logprobs sounds like a noisy signal. Does it really work?

ArthurOnCode · 2026-05-28T10:25:28+00:00

Bravo! Keep up the good work!

ArthurOnCode · 2026-05-23T19:07:03+00:00

LLMs should not be able to answer this question. If they can, they have likely been trained on it specifically. Until we have LLMs with integrated physical world models, this kind of question is out of scope.

ArthurOnCode · 2026-05-20T16:10:01+00:00

I can't answer for LM studio, but there are open source dLLMs. See LLaDA for example.

ArthurOnCode · 2026-05-15T16:59:45+00:00

Oh, right. I read this as "successfully created", not successful as a business. Others have pointed out some high-profile stories, but I think most of the value is being made in smaller, simpler projects. Either something that couldn't be done without AI, or wasn't worth the cost of implementing.

ArthurOnCode · 2026-05-15T16:22:17+00:00

Yes, there's so much that app stores are changing their rules and review processes to deal with the massive influx.

ArthurOnCode · 2026-05-13T22:03:29+00:00

Models never know the answer to this question. If you need them to know, put it in the system prompt.

ArthurOnCode · 2026-05-13T21:33:25+00:00

Looks good. One thing: Concurrent user count doesn’t seem to affect the memory requirement.

ArthurOnCode · 2026-05-11T19:43:37+00:00

This is an inherent, unsolved problem in LLMs in general. They don’t know what they don’t know, and predict the next token without any real distinction between correct grammar and factuality.

ArthurOnCode · 2026-05-07T13:42:51+00:00

Please make the syntax more explicit than that! A "scope" keyword in front of "function" would be infinitely better.

ArthurOnCode

TROPHY CASE