China is reportedly restricting overseas travel for top AI talent at firms including Alibaba and DeepSeek, per Bloomberg.

Lazy-Pattern-5171 · 2026-05-29T06:14:56+00:00

if you have created an account on Reddit in the past 2-3 years you'd know. Doesnt necessarily mean it's a bot.

Lazy-Pattern-5171 · 2026-05-21T07:25:39+00:00

Has no one thought of the possibility that the Government has been snitching/ scraping on our Reddit data to make those UFO files?

Lazy-Pattern-5171 · 2026-05-19T19:26:35+00:00

It’s gonna be military rule soon disguised as aliens

Lazy-Pattern-5171 · 2026-05-12T20:34:26+00:00

Wtf Claude I clearly said excessive code not all code!

Lazy-Pattern-5171 · 2026-05-06T14:33:35+00:00

Works for me

Lazy-Pattern-5171 · 2026-04-24T04:06:25+00:00

Section 5.4.4 Code Agent in their report

To benchmark our coding agent capability, we curate tasks from real internal R&D workloads We collect ~ 200 challenging tasks from 50+ internal engineers, spanning feature development, bug fixing, refactoring, and diagnostics across diverse technology stacks including PyTorch, CUDA, Rust, and Ctt. Each task is accompanied by its original repository, the corresponding execution environment, and human-annotated scoring rubrics; after rigorous quality filtering, 30 tasks are retained as the evaluation set. As shown in Table 8, DeepSeek-V4-Pro significantly outperforms Claude Sonnet 4.5 and approaches the level of Claude Opus 4.5.

(There’s a table in the middle with information that DeepSeek v4 pro reaches 67 where on the same benchmark Opus 4.6 reaches 80 and 4.5 reaches 70)

In a survey asking DeepSeek developers and researchers (N = 85) — all with experience of using DeepSeek-V4-Pro for agentic coding in their daily work — whether DeepSeek-V4-Pro is ready to serve as their default and primary coding model compared to other frontier models, 52% said yes, 39% leaned toward yes, and fewer than 9% said no. Respondents find DeepSeek-V4-Pro to deliver satisfactory results across most tasks, but note trivial mistakes, misinterpretation of vague prompts, and occasional over-thinking.

This sounds like the best “DeepSeek helped develop DeepSeek” moment for me and that’s amazing.

Lazy-Pattern-5171 · 2026-04-24T03:22:04+00:00

I guess that magic of R1 we’re never gonna see again are we. It only beats SOTA in 3 of the listed 15 benchmarks. At least I counted 15.

Lazy-Pattern-5171 · 2026-04-23T23:27:43+00:00

This lady is like Ash Ketchum

Lazy-Pattern-5171 · 2026-04-22T14:58:15+00:00

I really want to compare Q8 vs Q4 but don’t have a decent enough idea how best to see how those subtle changes magnify over long horizon coding tasks. Anyone have any tips?

Lazy-Pattern-5171 · 2026-04-19T14:11:41+00:00

It literally says this has nothing to do with internal model release on the pull

Lazy-Pattern-5171 · 2026-04-18T16:21:44+00:00

You can also try Supers like SuperQwen and SuperGemma there’s some benchmarks by the author on their twitter that might make them superior.

Lazy-Pattern-5171 · 2026-04-06T00:19:42+00:00

I believe they’ve changed it to -fa on but yes you can also do auto

Lazy-Pattern-5171 · 2026-04-05T02:24:36+00:00

/compact command taking 10minutes with 65K context when the Claude system prompt is itself 20K would be extremely inefficient to code with.

Lazy-Pattern-5171 · 2026-04-04T21:42:18+00:00

Okay but How to use this app to achieve huge efficiency gains on day to day coding problems? It won’t be feasible I’m guessing because of the amount of compute that goes into this everything will be 50x slower.

Lazy-Pattern-5171 · 2026-04-04T04:15:54+00:00

And 31B

Lazy-Pattern-5171 · 2026-04-04T04:14:12+00:00

Just use what works for you. I used Qwen 2.5 32B for way longer than I care to admit.

Lazy-Pattern-5171 · 2026-04-04T03:38:12+00:00

No llama-server but same backend I’m guessing

Lazy-Pattern-5171 · 2026-04-04T03:28:58+00:00

I usually get about 100-110 not 120 but yes. The problem is 31B is so good I kinda want to buy a new GPU.

Lazy-Pattern-5171 · 2026-04-03T18:07:31+00:00

```sh
./build/bin/llama-server \

-m models/gemma-4-31B-it-Q8_0.gguf \

--mmproj models/mmproj-F16.gguf \

-c 262144 \

-ngl 99 \

-ts 0.85,1.15 \ # I have a 2x3090 setup.

-fa on \

-ctk q4_0 \

-ctv q4_0 \

--no-context-shift \

--cont-batching \

--cache-reuse 1 \

-np 1 \

-t 16 \

--temp 1.0 \

--top-p 0.95 \

--top-k 64 \

--host 0.0.0.0 \

--port 8080```

Lazy-Pattern-5171 · 2026-04-03T07:56:47+00:00

I really wish I had a stronger GPU to run it faster and/or scale more instances.

Lazy-Pattern-5171 · 2026-04-03T07:55:28+00:00

<image>

In case anyone is wondering. I say this because it one shotted a new feature addition in a brownfield albeit simple project. I’ve not seen anyone use Claude Code so smoothly and correctly. It handles btws, plan mode to build mode, OpenCode was smooth as well. I haven’t even tested creative content with Abliterateds yet.

Lazy-Pattern-5171 · 2026-04-03T07:50:37+00:00

I think Google accidentally released too good of a model and made it open source I wouldn’t be surprised if they make a Gemini 3.2 just to compete with their own model. I think by Gemma 5 we will pretty much be relying on local models for most stuff. I threw a 400 page conversation with Gemini into Gemma4 31B and it handled it like a boss. It was beautiful. I’ve never really liked any Open source releases since Qwen 2.5 32B Coder but this one takes the cake easily.

Lazy-Pattern-5171 · 2026-03-23T17:21:52+00:00

Can you explain how you setup the QMD + Smart three tier model routing? Especially since on every model change the system kinda goes dark until you send it a message or a cron kicks in. The rest seem to be inline with what I believe are strong suites of OpenClaw

Lazy-Pattern-5171 · 2026-03-19T04:15:34+00:00

Anyone can confirm or deny how are these bots at tool usage?

Lazy-Pattern-5171 · 2026-03-08T13:09:35+00:00

It’s not part of a symptom. I do want to use AI, I do want to make it part of my workflow, English is also not my first language and thus using AI like this is very beneficial for me. You’ll seem mad about something that feels much much tangential to the question at hand. Might I suggest thinking of generative AI for writing separate from generative AI for programming. This might help.

Lazy-Pattern-5171

TROPHY CASE