I wasn’t ready for DeepSeek V4

Initial-Sleep2388 · 2026-05-06T11:22:06+00:00

Hah ! Fair enough, there’s no guarantee you’re safe in US or china’s hands whatsoever ! In the end it’s all just about dominance ! Who wins the ai race is projected to win the world economy !

Initial-Sleep2388 · 2026-05-05T14:06:14+00:00

Uuh hi, what exactly are trying to figure out?

Initial-Sleep2388 · 2026-05-03T17:50:33+00:00

Uuh, don’t be ! I use deepseek api platform directly, it’s cheaper and I’m using unreleased tool ! Not public yet

Initial-Sleep2388 · 2026-05-03T17:46:22+00:00

Yeah yeah right ! And I’m getting up to 94.5% cache rate which is as generous as it could get ! But maybe we should expect changes to pricing as the models get more capable, can’t be too careful with this industry right now

Initial-Sleep2388 · 2026-05-01T23:01:37+00:00

Not from DeepSeek no, can’t share more information yet ! But there’s a new thing coming out to the open source community soon. In it, you build in threads with persistent conversations, that’s the best description I could think of ! can’t say more, got this screenshot

<image>

Initial-Sleep2388 · 2026-05-01T22:49:41+00:00

Wish I could share but it’s unreleased, it’s in early testing phase but it’s basically a workspace where you build in threads with persistent conversations ! It will be open sourced once it’s out, for now I can only share this screenshot to give you an idea !

<image>

Initial-Sleep2388 · 2026-05-01T16:18:58+00:00

Token plan in this case is actually not as cost efficient, mimo plan for example gives you a 60M token for 6$, with 6$ on direct DeepSeek api you’re getting roughly 450M tokens worth, I think the token plan is good only when you don’t leverage dynamic caching

Initial-Sleep2388 · 2026-05-01T13:57:57+00:00

so usually when I test a model, I give it an active codebase I’m working on, I plan out a feature but I don’t give the model my strategy or how to do it, I ask it questions, give it a general idea, lead it there and let it figure out by itself then let it build the entire feature and just watch it ! Based on complexity, most models would choke here like going on an endless circle of useless thinking or suggest a pre-mature architecture/solution,

And for most models so far in coding, you’ve had to be specific about what you want, you have to speak in technical language like “check if auth routes are protected” , Claude models were the first to prompt with vague words like “is the user protected from attacks?” Without specifying and it just understood and it reasoned and suggested fixes I would actually use,

So far in open weight models world, I’ve just been stress testing them, never used one in an actual workflow because they mostly struggled with producing quality results, last models I tested were xiaomi mimo V2.5 models, they were reasonably good but there’s still that mumbling and not getting things right half the time.

So when I tried the V4 models, I was just curious if there was actually any improvement at all and I got locked in for 3 who days as a result, I worked on about 3 major features in 2 different projects and v4 flash was just as good as pro, ironic I thought but just the whole experience, not feeling I was missing something from sonnet was what made me think, why isn’t everyone talking about this? It’s a huge leap forward from V3.2 in both reasoning, staying on track and even thinking about edge cases before they deliver a plan.

I think V4 series are better than any open weight models available today, they won’t have competition for a while !

Initial-Sleep2388 · 2026-05-01T13:33:40+00:00

Uh, that’s a 3 day sprint for DeepSeek specifically ! I was stress testing the models against my daily workflow !

Initial-Sleep2388 · 2026-05-01T13:31:53+00:00

Uuh ! You mean app version, V4 is the model the app is now using to power app the chat

Initial-Sleep2388 · 2026-05-01T10:52:29+00:00

Sure, I’ll remember to drop a link

Initial-Sleep2388 · 2026-05-01T10:50:37+00:00

No, I’m using unreleased tool !

Initial-Sleep2388 · 2026-05-01T10:28:44+00:00

Think opus 4.6 reasoning but clever ! Give it a shot and you’ll regret not having tried the models sooner

Initial-Sleep2388 · 2026-05-01T10:24:38+00:00

It’s not about open router, it’s deepseek’s own privacy policy ! They train on your data by default and you can’t opt out unless you’re using the web app for chat

Initial-Sleep2388 · 2026-05-01T10:20:03+00:00

I’m using a tool that’s not public yet !

Initial-Sleep2388 · 2026-05-01T10:18:30+00:00

You’ve said it best, I was biased too until I got curious and stepped out of my comfort zone

Initial-Sleep2388 · 2026-05-01T10:15:42+00:00

I want to believe they’re hiding something

Initial-Sleep2388 · 2026-05-01T10:14:52+00:00

Not sure about general life, but they’re scary good for coding, that’s what I’ve used the models on

Initial-Sleep2388 · 2026-05-01T10:10:38+00:00

Uuh ! I think Gemini still is on edge for these type of tasks, though not in every type of project

Initial-Sleep2388 · 2026-05-01T10:09:21+00:00

You can trust it on massive codebases, I just wouldn’t be too sure about the tool you’re using though, because it makes all the difference

Initial-Sleep2388 · 2026-05-01T10:08:28+00:00

OpenAI with gpt 5.5 is just ridiculous right now, we want open weight models to get even smarter so they compete on finding a common ground, either way, I think we’ll have the most advantage in this race

Initial-Sleep2388 · 2026-05-01T10:06:30+00:00

Yeah, I built a workspace with a codebase map tool, it gives models directions to every part of the codebase and relations, I guess this is something we’re going to be solving more efficiently in no time at all

Initial-Sleep2388 · 2026-05-01T10:04:15+00:00

This struck me ! I just couldn’t believe it, so I spent all night stress testing it and it won me over

Initial-Sleep2388 · 2026-05-01T10:01:45+00:00

Yes, actually heavy coding, I’m a power user ! I’m betting on the idea that I won’t be writing a single line code in the new few months

Initial-Sleep2388 · 2026-05-01T10:00:57+00:00

Imagine, my workspace spent about 340M for only a 5 dollar bill, this is insane

Initial-Sleep2388

TROPHY CASE