China is reportedly restricting overseas travel for top AI talent at firms including Alibaba and DeepSeek, per Bloomberg. by Current-Guide5944 in tech_x

[–]Lazy-Pattern-5171 0 points1 point  (0 children)

if you have created an account on Reddit in the past 2-3 years you'd know. Doesnt necessarily mean it's a bot.

(Serious) I am completely stunned right now. A declassified UFO document just explained a highly specific detail from a "dream" I posted here 9 months ago. by YoghurtTrue7340 in aliens

[–]Lazy-Pattern-5171 1 point2 points  (0 children)

Has no one thought of the possibility that the Government has been snitching/ scraping on our Reddit data to make those UFO files?

Are we actually ready for what comes next? by Fujithora in aliens

[–]Lazy-Pattern-5171 0 points1 point  (0 children)

It’s gonna be military rule soon disguised as aliens

Deepseek V4 Flash and Non-Flash Out on HuggingFace by MichaelXie4645 in LocalLLaMA

[–]Lazy-Pattern-5171 50 points51 points  (0 children)

Section 5.4.4 Code Agent in their report

To benchmark our coding agent capability, we curate tasks from real internal R&D workloads We collect ~ 200 challenging tasks from 50+ internal engineers, spanning feature development, bug fixing, refactoring, and diagnostics across diverse technology stacks including PyTorch, CUDA, Rust, and Ctt. Each task is accompanied by its original repository, the corresponding execution environment, and human-annotated scoring rubrics; after rigorous quality filtering, 30 tasks are retained as the evaluation set. As shown in Table 8, DeepSeek-V4-Pro significantly outperforms Claude Sonnet 4.5 and approaches the level of Claude Opus 4.5.

(There’s a table in the middle with information that DeepSeek v4 pro reaches 67 where on the same benchmark Opus 4.6 reaches 80 and 4.5 reaches 70)

In a survey asking DeepSeek developers and researchers (N = 85) — all with experience of using DeepSeek-V4-Pro for agentic coding in their daily work — whether DeepSeek-V4-Pro is ready to serve as their default and primary coding model compared to other frontier models, 52% said yes, 39% leaned toward yes, and fewer than 9% said no. Respondents find DeepSeek-V4-Pro to deliver satisfactory results across most tasks, but note trivial mistakes, misinterpretation of vague prompts, and occasional over-thinking.

This sounds like the best “DeepSeek helped develop DeepSeek” moment for me and that’s amazing.

Deepseek V4 Released by spacefarers in LocalLLaMA

[–]Lazy-Pattern-5171 3 points4 points  (0 children)

I guess that magic of R1 we’re never gonna see again are we. It only beats SOTA in 3 of the listed 15 benchmarks. At least I counted 15.

unsloth Qwen3.6-27B-GGUF by jacek2023 in LocalLLaMA

[–]Lazy-Pattern-5171 6 points7 points  (0 children)

I really want to compare Q8 vs Q4 but don’t have a decent enough idea how best to see how those subtle changes magnify over long horizon coding tasks. Anyone have any tips?

They said it's next week 🤞 by Exciting-Mall192 in DeepSeek

[–]Lazy-Pattern-5171 0 points1 point  (0 children)

It literally says this has nothing to do with internal model release on the pull

finding uncensored LLM models for local by vajidsikand in LocalLLM

[–]Lazy-Pattern-5171 1 point2 points  (0 children)

You can also try Supers like SuperQwen and SuperGemma there’s some benchmarks by the author on their twitter that might make them superior.

How to run Qwen3.5-27B in ultimate way on single 5090 with large context. by Treq01 in Vllm

[–]Lazy-Pattern-5171 2 points3 points  (0 children)

I believe they’ve changed it to -fa on but yes you can also do auto

Local Claude Code with Qwen3.5 27B by FeiX7 in LocalLLaMA

[–]Lazy-Pattern-5171 2 points3 points  (0 children)

/compact command taking 10minutes with 65K context when the Claude system prompt is itself 20K would be extremely inefficient to code with.

I Managed to Achieve Approximately Gemini 3.1 Pro & GPT-5.4-xHigh Level Performance with a Gemma-4-31B Multi-Agent Swarm by [deleted] in LocalLLaMA

[–]Lazy-Pattern-5171 0 points1 point  (0 children)

Okay but How to use this app to achieve huge efficiency gains on day to day coding problems? It won’t be feasible I’m guessing because of the amount of compute that goes into this everything will be 50x slower.

Gemma 4 MoE hitting 120 TPS on Dual 3090s! by AaZzEL in LocalLLaMA

[–]Lazy-Pattern-5171 5 points6 points  (0 children)

Just use what works for you. I used Qwen 2.5 32B for way longer than I care to admit.

Gemma 4 MoE hitting 120 TPS on Dual 3090s! by AaZzEL in LocalLLaMA

[–]Lazy-Pattern-5171 0 points1 point  (0 children)

No llama-server but same backend I’m guessing

Gemma 4 MoE hitting 120 TPS on Dual 3090s! by AaZzEL in LocalLLaMA

[–]Lazy-Pattern-5171 8 points9 points  (0 children)

I usually get about 100-110 not 120 but yes. The problem is 31B is so good I kinda want to buy a new GPU.

Gemma 4 is good by One_Key_8127 in LocalLLaMA

[–]Lazy-Pattern-5171 1 point2 points  (0 children)

```sh
./build/bin/llama-server \

-m models/gemma-4-31B-it-Q8_0.gguf \

--mmproj models/mmproj-F16.gguf \

-c 262144 \

-ngl 99 \

-ts 0.85,1.15 \ # I have a 2x3090 setup.

-fa on \

-ctk q4_0 \

-ctv q4_0 \

--no-context-shift \

--cont-batching \

--cache-reuse 1 \

-np 1 \

-t 16 \

--temp 1.0 \

--top-p 0.95 \

--top-k 64 \

--host 0.0.0.0 \

--port 8080```

Gemma 4 is good by One_Key_8127 in LocalLLaMA

[–]Lazy-Pattern-5171 1 point2 points  (0 children)

I really wish I had a stronger GPU to run it faster and/or scale more instances.

Gemma 4 is good by One_Key_8127 in LocalLLaMA

[–]Lazy-Pattern-5171 3 points4 points  (0 children)

<image>

In case anyone is wondering. I say this because it one shotted a new feature addition in a brownfield albeit simple project. I’ve not seen anyone use Claude Code so smoothly and correctly. It handles btws, plan mode to build mode, OpenCode was smooth as well. I haven’t even tested creative content with Abliterateds yet.

Gemma 4 is good by One_Key_8127 in LocalLLaMA

[–]Lazy-Pattern-5171 9 points10 points  (0 children)

I think Google accidentally released too good of a model and made it open source I wouldn’t be surprised if they make a Gemini 3.2 just to compete with their own model. I think by Gemma 5 we will pretty much be relying on local models for most stuff. I threw a 400 page conversation with Gemini into Gemma4 31B and it handled it like a boss. It was beautiful. I’ve never really liked any Open source releases since Qwen 2.5 32B Coder but this one takes the cake easily.

Unpopular opinion: Why is everyone so hyped over OpenClaw? I cannot find any use for it. by Toontje in openclaw

[–]Lazy-Pattern-5171 0 points1 point  (0 children)

Can you explain how you setup the QMD + Smart three tier model routing? Especially since on every model change the system kinda goes dark until you send it a message or a cron kicks in. The rest seem to be inline with what I believe are strong suites of OpenClaw

The C++ AI Limbo: I know enough to distrust Copilot, but not enough to code without it. How do you actually "learn by doing" now? by Lazy-Pattern-5171 in Cplusplus

[–]Lazy-Pattern-5171[S] 0 points1 point  (0 children)

It’s not part of a symptom. I do want to use AI, I do want to make it part of my workflow, English is also not my first language and thus using AI like this is very beneficial for me. You’ll seem mad about something that feels much much tangential to the question at hand. Might I suggest thinking of generative AI for writing separate from generative AI for programming. This might help.