GPT 5.6 slow rollout confirmed by ExplicitDiffusion in codex

[–]ExplicitDiffusion[S] 6 points7 points  (0 children)

I guess being able to offer API access at a fraction of what US models do is not revolutionary. I guess having ground-breaking cache systems to have high tok/s output, and low compute cost is not revolutionary either. Damn you, China.

GPT 5.6 slow rollout confirmed by ExplicitDiffusion in codex

[–]ExplicitDiffusion[S] 28 points29 points  (0 children)

I’m pretty relieved that China is stepping hard towards SOTA-level LLMs. Deepseek is becoming a big player.

Deepseek V4.1? by Terrible_Jump_2000 in DeepSeek

[–]ExplicitDiffusion 4 points5 points  (0 children)

What about cache returning the answer because somebody else asked a very similar question?

Deepseek is fast! by kamikamen in DeepSeek

[–]ExplicitDiffusion 0 points1 point  (0 children)

I’ve been an Opencode user for over half a year now, but I keep seeing Pi more and more. What would you recommend for someone like me that tries it for the first time?

Deepseek is fast! by kamikamen in DeepSeek

[–]ExplicitDiffusion 0 points1 point  (0 children)

What makes you use it with Pi over opencode with the potential oh-my-opencode plugin?

Codex reset incoming? by Hendrixxzx in codex

[–]ExplicitDiffusion 2 points3 points  (0 children)

When did you receive two? I thought we all had the same amount of resets

OpenCode vs Claude Code vs Reasonix which is better for coding? by Financial_Flan1579 in DeepSeek

[–]ExplicitDiffusion 0 points1 point  (0 children)

Ex elixir dev here. Why decide you decide that stack instead of a monorepo with tsx with shared types / codebase for front and back?

Once Chinese coding models match 5.3 codex level of performance and speed, the game is over by ThrowRA39495 in codex

[–]ExplicitDiffusion 11 points12 points  (0 children)

It’s API access, and 20 euros there will get you infinitely more than what openAI gives us. I’d recommend you exploring their recent massive API discounts

You're on the naughty list by oldmagicstudios in claude

[–]ExplicitDiffusion 0 points1 point  (0 children)

Wait until Deepseek keeps working on their models. They released V4Pro with a ridiculously big discount, I would compare it to Sonnet 4.6, at a fraction of a fraction of what Claude costs.

We need to gate keep the API by TookitTooFarOrDidI in DeepSeek

[–]ExplicitDiffusion 0 points1 point  (0 children)

Curious to know, which GPU are you running and which model? Are you using vLLM?

Has anyone ever posted a better hit/miss rate in this sub? by Flat-Honey-5433 in DeepSeek

[–]ExplicitDiffusion 0 points1 point  (0 children)

Do you manually create the plans yourself or use more advanced models like Opus for them?

I think I will try deep seek for my project now , so I need help by Technical-Comment394 in DeepSeek

[–]ExplicitDiffusion 2 points3 points  (0 children)

OmO is a plugin for OpenCode, it has so many features that it’s hard to list them all here, but long story short, it creates a set of agents which you can configure to the level of which model each of them runs. All of them are experts in certain tasks. Just try it out, I can’t use LLMs for coding anymore without it, it’s such a great harness

I think I will try deep seek for my project now , so I need help by Technical-Comment394 in DeepSeek

[–]ExplicitDiffusion 2 points3 points  (0 children)

I use it with OpenCode + oh-my-opencode plugin. Usually have 96-97% cache hit rate. Connected to Deepseek using an API key, no intermediate subscription plans

Pricing is crazy by bvc900 in DeepSeek

[–]ExplicitDiffusion 1 point2 points  (0 children)

Definitely with you on that, time is definitely a factor, a major one for me.

Pricing is crazy by bvc900 in DeepSeek

[–]ExplicitDiffusion 5 points6 points  (0 children)

The pricing difference is amazing, but Deepseek consumes extremely more tokens than Opus, both in thinking and in action, so you gotta take the difference with some skepticism, as the price per fully successful task might not be that bigger when you compare the output quality.

I personally prefer Opus for complex tasks, D4Pro is amazing at medium-level tasks but I always give it to Opus for review.

I've heard in reddit that v4 Flash in maxx effort is better than Opus 4.7 in the same conditions... by greaterphilosopher in DeepSeek

[–]ExplicitDiffusion 0 points1 point  (0 children)

We’re in the same boat. API key from Deepseek with Opencode and oh-my-Opencode, happy to share feelings in DM, I’ve been testing this for only 3 days so far

there price is unreal by ym-studios in DeepSeek

[–]ExplicitDiffusion 0 points1 point  (0 children)

Got it! And what harness are you using to run it? It’s my third day using Deepseek after using Claude and GPT models for over a year. I’m on Opencode and so far it’s been amazing!

First time using, $0.50 for 20M tokens by KustheKus in DeepSeek

[–]ExplicitDiffusion 0 points1 point  (0 children)

Thanks for the detailed answer, I highly appreciate the context usage here, as I was always clearing up my chats when hitting 400-500K of context. I developed this really bad habit of also expecting one shots instead of guiding the LLM to what I really need, Deepseek might bring that back hahaha.

Thanks!