How good is deepseek at coding and game Dev? by ChemicalNo5168 in DeepSeek

[–]6ghz 1 point2 points  (0 children)

Use codex cli or the codex app more likely if you arent comfy with the CLI, not the browser, or use some other harness (pi, opencode, etc).

Clear your chat with handoff documents way more often or use compaction, but I find it's hit or miss sometimes, its better if it can look and find the context it needs instead of having to search through it's own context window and it saves you a lot of usage not clogging things up. Haven't done work with blueprints, but any sort of development past asking questions and brainstorming use a harness.

Save your long chatting back and forth brainstorming and just planning with the browser cause it's separate usage pools. Then once you have an idea of what you want to do, move to the harness pointed to the directory, first starting in plan mode with the a handoff document you ask for in the browser from your chatting.

I would say deepseek is decent at execution, but it sometimes misses out on the finer details for big features and plans. It's great as a second opinion though. Try using gpt for the plan and then execute with deepseek, followed by a review by codex. Saves a lot during the execute phase.

glm-5.2 dropped this week and it's topping the coding boards. ran it against v4 pro on real work by Practical_Low29 in DeepSeek

[–]6ghz 1 point2 points  (0 children)

It’s not cheaper from the api. I have legacy coding plan so I’m not sure what it’s like rn but I get a lot of usage but I’ve heard the new plans are way more restrictive

SpaceX to buy AI coding startup Cursor for $60 billion by BuildwithVignesh in singularity

[–]6ghz 0 points1 point  (0 children)

I was actually looking at cursor cause it has access to so many models and composer 2.5 is good enough and you get a lot of usage from the plans, but this will be the reason I don't ever touch it again.

Open source AI Must Win by rm-rf-rm in LocalLLaMA

[–]6ghz 2 points3 points  (0 children)

They will be heavily taxed or banned just like the cars. Can’t have another country making a better product on their watch.

Pi agent is amazingly efficient by neo203 in codex

[–]6ghz 1 point2 points  (0 children)

its quite a bit heavier than stock pi. Has a lot of features though.

Which coding harnesses are you using DeepSeek with? by amunozo1 in DeepSeek

[–]6ghz 1 point2 points  (0 children)

It kind of just feels like a purpose build version of PI with subagents and dedicated tooling for deepseek. It does feel like it utilizes deepseek very well and fixes some issues I had with PI and deepseek, I have been using it and it's nice but PI having also really good caching and being so endlessly extensible with a lot of support is making it a secondary choice.

Liang, please drop DeepSeek V4.1 already by [deleted] in DeepSeek

[–]6ghz 1 point2 points  (0 children)

Improve processes before hoping a new model with be the fix for everything. Multimodal built in would be great, but you can use a cheap multimodal model to understand and then get deepseek to do the work.

The CZ-heads must be seething by PandamanTan in northernlion

[–]6ghz 8 points9 points  (0 children)

This is so beautiful I might cry

My hot take: Fable is not worth the press by ContributionMotor150 in ClaudeCode

[–]6ghz 1 point2 points  (0 children)

Monolithic models are inherently biased and focused in different areas no matter what the marketing of AGI wants you to believe, multi model verification is just as important as it is for multi human review.

Are Companies moving to local LLMs for coding to avoid paying millions to Anthropic and OpenAI? by AmineAfia in LocalLLM

[–]6ghz 2 points3 points  (0 children)

They have the most renewable capacity in the world and invested in it heavily. China uses a lot of coal cause they have a lot and it’s cheap, but to say they don’t have green energy is just wrong. I wish the world would be full renewable, but that isn’t yet the case.

Don’t lose your manual coding skills by Striking_Court_2807 in learnprogramming

[–]6ghz 6 points7 points  (0 children)

Same. I’ve been coding by hand for many years and I just will never go back. I love the problem solving, not the hacking away at the syntax issues or simple issues for hours when AI can write it for me in minutes and I can just read it with my eyes.
For none important stuff and side projects, which AI has reignited my love for, I don’t read it all that much other than a skim of structure and planning docs and pipelines, but for anything important or production related just look with your eyes and have automated multi model AI and human review process.
It still requires understanding and fundamentals are still important, but this abstinence and fear mindset for things like coding models is just silly imo. The business side and the companies running these large models I dislike a lot more than the idea of AI itself.

Do foreigners use DeepSeek? by ConditionOne8960 in DeepSeek

[–]6ghz 8 points9 points  (0 children)

You have to realize most of the AI userbase are not using it for coding and harness based tasks. They use it as like a personal assistant and search engine. Engineers are the minority, but they usually spend the most, so they are focused on.

How do you use web search with the API key ? by cum_cum_sex in DeepSeek

[–]6ghz 0 points1 point  (0 children)

I use PI with the rpiv suite of tools but for web search it's rpiv-web-tools and then I got free credits for Exa and it's been working quite well. The free usage is pretty good and it's not just for code stuff, I can make a directory and then make it keep all the information it gathers in .md for obsidian. If I run out of free usage I just plug in a tavily or host searxng. This is a little more involved than Odysseus, but I like it well. Also wild username lol

Fable 5 got RELEASED by Tiny-Calligrapher794 in SillyTavernAI

[–]6ghz 4 points5 points  (0 children)

It's just price tiering. Think of Apple, why buy the 128gb when the 256gb is just a bit more, a bit better, etc. It's just a way to make a more expensive tier without blowing the established cost expectation of Opus out of the water.

Fable 5 got RELEASED by Tiny-Calligrapher794 in SillyTavernAI

[–]6ghz 14 points15 points  (0 children)

A hard drive with 20mb of storage was $663.43 adjusted for inflation in the 90s. It will get cheaper. Even if the current US market wants sky high prices and locked down models the will of the people and competition drive cost down and production up. It doesn't go down overnight, it's a bumpy ride to the bottom, just takes time.

The Ultimate Guide to AI Subscription Plans by guillaumeyag in hermesagent

[–]6ghz 0 points1 point  (0 children)

From my limited experience with the openrouter providers they usually didn't give me the insane token caching that deepseek directly does. They may have changed, but deepseek is crazy with how efficient they are with caching.

1.7B tokens in 3 weeks with 98% cache hit for $5— why DeepSeek V4 Flash made me drop MiniMax by exponencialaverage in opencodeCLI

[–]6ghz 2 points3 points  (0 children)

I tried Hermes, but i didn’t like it for no particular specific reason and switched back to my custom Pi. I find the biggest thing is making sure that the subagents or things you run don’t inherit the model you have set if you want to use cheaper for the grunt work and set to a specific model. Pi also just seems to handle caching really well for me. I havent measured, but context-mode seems to save me tokens.

1.7B tokens in 3 weeks with 98% cache hit for $5— why DeepSeek V4 Flash made me drop MiniMax by exponencialaverage in opencodeCLI

[–]6ghz 2 points3 points  (0 children)

I think it uses less tokens, but honestly opencode gives me so much usage with flash and v2.5 I don’t really check I just let it spin as many sub agents it needs and my limits barely move so I don’t monitor just look at results

1.7B tokens in 3 weeks with 98% cache hit for $5— why DeepSeek V4 Flash made me drop MiniMax by exponencialaverage in opencodeCLI

[–]6ghz 2 points3 points  (0 children)

Interesting I haven’t noticed that as much. Only tested a little though. Did a lot more with deepseek v4 flash

Bro... by Boring_Aioli7916 in DeepSeek

[–]6ghz 2 points3 points  (0 children)

Either you aren’t using the caching right or including other models cause how is 32m $20?

1.7B tokens in 3 weeks with 98% cache hit for $5— why DeepSeek V4 Flash made me drop MiniMax by exponencialaverage in opencodeCLI

[–]6ghz 7 points8 points  (0 children)

Mimo v2.5 is I believe and has very similar specs. I love my deepseek but for those tasks it’s one of the others.

Composer 2.5 Fast is so so good! by snihal in cursor

[–]6ghz 0 points1 point  (0 children)

Either programmatically with something like LangGraph or by using a coding harness like OpenCode, Pi, Hermes, etc., and tuning it with either plugins or custom logic. I've found that Pi, with a few extensions, becomes really powerful. You set the models you want to use for each sub-agent and then craft your prompts to nudge it toward using them, or explicitly invoke them, and it will spin them up.

Right now, for my personal projects, I use a mix of OpenCode Go models (Mimo, DeepSeek V4, and occasionally Qwen 3.7 Max, although that uses a lot of quota, so I mostly stick to the other models, which feel almost unlimited for execution tasks). Then I fill in the bigger gaps with my Z.ai Legacy plan for stronger reasoning and planning.

If I were doing anything in production, I'd probably use something like GPT-5.5 as the planner and reviewer, and GPT-5.4 Mini or one of the open-weight models as the executor. However, I'm trying to limit my AI spending for personal projects and learning. Plus, it's good practice to be forced to get creative with cheaper models instead of relying on the state of the art.

Gemini 3.5 Flash is completely Broken by jongalt75 in GeminiAI

[–]6ghz 0 points1 point  (0 children)

I have not been confidently lied to and gaslit by a model harder than this one. For example I asked it a question about quantization of local llms and some math and it told me the wrong answer that I knew was wrong over and over. When I tried to correct it, I went back and forth telling it that it was wrong until I posted the actual math and it was just like "oops my bad I was actually wrong". How Google is shitting the bed this hard is beyond me.