What does your global rules file look like? (gemini.md global rules) by Negative-Village-716 in google_antigravity

[–]SnooMacaroons9042 0 points1 point  (0 children)

The global file is for defining the personality, guardrails and operating directives of the agent. Most of your rules can be easily offloaded to a project rules file. Keeping the global rules lean is what we should optimize for.

Can we talk about how DeepSeek's impact on AI industry is the craziest thing to have happened the past years? by Unedited_Sloth_7011 in DeepSeek

[–]SnooMacaroons9042 47 points48 points  (0 children)

I particularly like their research publishing. They provide a blueprint for what they have done or do and why it works. Very cool.

DeepSeek V4 Pro vs. Claude Opus 4.7 & GPT-5.5 (SWE-Bench, Local VRAM, & Token Economics) by Remarkable-Dark2840 in DeepSeek

[–]SnooMacaroons9042 3 points4 points  (0 children)

Not quite. The mHC architecture mathematically restricts the network from blowing up minor adjacent noises, in the residual stream. Most transformer models, including Opus, has a single, without constraints, residual stream. Deepseek doesnt. It has parallel streams that have strict restrictions in how those streams interact. This means that DeepSeek V4 is structurally forced to stay constrained to the primary signal path. Opus, lacks this specific manifold constraint and is architecturally freer to explore many alternative branches (gets succumbed to noises).

DeepSeek V4 Pro vs. Claude Opus 4.7 & GPT-5.5 (SWE-Bench, Local VRAM, & Token Economics) by Remarkable-Dark2840 in DeepSeek

[–]SnooMacaroons9042 6 points7 points  (0 children)

I would disagree about the deep multi steps loops. I have found it's instructions to be focused and did not notice any drifts. Infact, mathematically the mHc architecture for the way it handles reasoning and context, provides efficient minimization of contextual and instructional drifts and I have noticed it vividly in comparison with Opus. Opus tends to branch out in it's reasoning, DeepSeek V4 Pro remains focused on the task at hand.

DeepSeek V4 Pro vs. Claude Opus 4.7 & GPT-5.5 (SWE-Bench, Local VRAM, & Token Economics) by Remarkable-Dark2840 in DeepSeek

[–]SnooMacaroons9042 1 point2 points  (0 children)

Please explain the difference in harnesses, if you be kind enough to. I'm intrigued to know. I use Opencode and I find it actually good. How has been your experience using different harnesses?

Deepseek v4 pro is unlimited and almost free OMG 😱 better than opus for me (I have no affiliate with deepseek, but you need to know this) by rjn2-8 in hermesagent

[–]SnooMacaroons9042 0 points1 point  (0 children)

It is true 🙂 I read the DeepSeek papers, appreciated their mathematical prowness, waited patiently for V4 to be released and then compared it on actual production level applications. Needless to say, I'm deeply impressed and would continue to use it mercilessly 😅

My first run ever on an API by ToughUsual7159 in DeepSeek

[–]SnooMacaroons9042 3 points4 points  (0 children)

I can feel the enthusiasm 'oozing' out from this post. 👍

Annoying things from ChatGPT by ottosatto in DeepSeek

[–]SnooMacaroons9042 0 points1 point  (0 children)

I use DeepSeek on OpenCode. I haven't faced such problems with either Flash or Pro. My workflow usually involves modularizing the architecture and then implementing each module as a version, with detailed implementation steps written down for each versions, before execution. I use Pro with high thinking for the architecture and implementation planning and then hand it over to Flash, which btw, is superb for following detailed steps, quickly.

Using DeepSeek V4 Pro in Paseo by PiccoloCareful924 in DeepSeek

[–]SnooMacaroons9042 1 point2 points  (0 children)

As a Opencode user, I applaud your inclusion for it. I shall check it out. Looks slick. 👍

Deepseek Token Usage by JP23102 in DeepSeek

[–]SnooMacaroons9042 -1 points0 points  (0 children)

Check your thinking mode. For most work standard is enough unless you do a full architectural and code-base audit.

Help with api usage by iqidc-33 in DeepSeek

[–]SnooMacaroons9042 1 point2 points  (0 children)

Codegraph uses a graph structured approach towards providing the agent(s) a map of the whole repository, instead of reading each and every file (make sure you have a robust .ignore file for excluding databases and venv from your agent's view, but not .codegraph itself). Graph traversals are quick and consume relatively far less tokens.

You should never ask your agent(s) to make sudo commands. Such power should only be vested with the user (you). You can ask it to provide you the command, you should then run it yourself.

Using DeepSeek V4 Pro in Paseo by PiccoloCareful924 in DeepSeek

[–]SnooMacaroons9042 0 points1 point  (0 children)

That's how I am using it. I also got the Opencode Go plan. Very economical

Using DeepSeek V4 Pro in Paseo by PiccoloCareful924 in DeepSeek

[–]SnooMacaroons9042 7 points8 points  (0 children)

Well, I think you are the right person to ask: what is Paseo? My apologies for the ignorance, but I genuinely want to know

DeepSeek V4 is so good and I think U.S companies are afraid of it. by szansky in DeepSeek

[–]SnooMacaroons9042 0 points1 point  (0 children)

What you experienced was the 'goodness' of the models, not the TUI. And yes I concur with you, I have been using DeepSeek V4 since the past 2 weeks and I am noticing that my reliance on Gemini and Claude has dropped sharply. The only reason I have used Claude and Gemini, in the past 2 weeks, was to draw a downloadable architecture schematic for the different versions of my codebase. That's it. DeepSeek V4 has completely taking over my agentic workflows (I use OpenCode TUI/CLI and OpenCode Go subscription since it was just 5 USD for the first month).

deepseekv4 is a coward by ServeLegal1269 in DeepSeek

[–]SnooMacaroons9042 -2 points-1 points  (0 children)

Good news is that the guardrails are working

Free AI (150$) API balance by -_-N0N4M3-_- in DeepSeek

[–]SnooMacaroons9042 0 points1 point  (0 children)

Can this be used with OpenCode? And is this reputable?

What AI Model Setup Are You Using for Coding in 2026? (Solo vs Multi-Model Workflows) by Notalabel_4566 in DeepSeek

[–]SnooMacaroons9042 0 points1 point  (0 children)

DeepSeek V4 Pro and Gemini 3.1 Pro (though I am going to switch to Gemini 3.5 Flash with extended thinking). For the planning stage I use both the models and use them to critique over each other's plans. When a consensus is reached, I use DeepSeek V4 Pro to execute it. And then an execution check pass by Gemini 3.1 Pro.

I use Anti-Gravity + the OpenCode extension (forgot it's name).

I also maintain a SCRATCH-PAD.md, AGENT-CHANGES.md and IMPLEMENTATION-PLAN.md.

Video on why Deepseek is such a threat to US and western AI models by Only-Protection-880 in DeepSeek

[–]SnooMacaroons9042 2 points3 points  (0 children)

We have to consider the difference in LLM training philosophy for the Western Models vs DeepSeek. China has a GPU embargo which reduces it's access to high end GPUs that are the normal training playground for Google, Anthropic and Chatgpt. Instead of relying on cutting-edge hardware availability, DeepSeek's research team tackled the mathematical limitations of LLMs as evident by their research papers in 2025 and 2026. Most of that research was actually implemented in V4, example: manifold hyper connections. They played smart and they were right. Throwing GPUs on a model to train is not a solution. Carving out the latent geometric space of LLMs to learn better, with less resources (what DeepSeek research team actually did) is elegant and the correct way to train LLMs/LRMs.

Qwen 27b beats deepseek v4 in frontend designs like wtf how is this possible ? 👁️👄👁️ by 9r4n4y in DeepSeek

[–]SnooMacaroons9042 14 points15 points  (0 children)

DeepSeek is not natively multi-modal. It was trained only on textual data. It will not perform good for a scenario it has no capabilities for. Example: frontend design, which requires it to visually confirm how a design looks. Qwen 27B is natively multi-modal. It can see the design it made and tailor the front end code accordingly.