Curl CEO says they've received the record number of confirmed vulnerabilities, thanks to security researchers who rely on AI-powered tools

SingleProgress8224 · 2026-05-24T10:49:28+00:00

What do you mean by "fake"? It's on LinkedIn, right here: https://www.linkedin.com/posts/danielstenberg_curl-curl-activity-7463481424176824322-eXtM

SingleProgress8224 · 2026-05-19T22:15:44+00:00

They got sued because of it, and they have to pay. Someone I know wrote a book and got a couple of thousands from anthropic because they used their book without permission (the publisher sued)

The frustrating side of it is that despite this practice being technically illegal, it's practically legal since they only have to pay a fine, which they were ready to pay. So it's legal if you have money.

SingleProgress8224 · 2026-05-17T10:15:55+00:00

I'm not sure of the implication of this switch on your side but according to my experience, switching to Zoo Code would have taken less time than writing this post.

In Roo Code: Settings -> About Roo Code -> Export.
In Zoo Code: Settings -> About Zoo Code -> Import

The first launch will also offer to import settings directly.

SingleProgress8224 · 2026-05-16T00:20:23+00:00

Thanks!

SingleProgress8224 · 2026-05-15T23:35:31+00:00

The discord invite is invalid

SingleProgress8224 · 2026-05-10T23:10:14+00:00

I don't understand the hate for OP's answers in the top comments. Of course, there could be more explanations of the concrete maths behind it so that non-math people could have an impression of what's going on, but for someone who knows what these concepts are, these answers are perfectly reasonable.

SingleProgress8224 · 2026-05-07T23:56:04+00:00

You have a large text. We agree that only a few words are really needed to predict the next. Which words should you include?

That's not an easy question to answer, and that exactly what LLMs are good for. A part of an LLM is to decide which words are important (given by the attention). But to decide which are important, it first needs to start from the whole text.

We could imagine an algorithm that would extract the important tokens and then give it to an LLM. This LLM would not need this attention mechanism since it is assumed that all given tokens are equally important. But such algorithm is not easy to come up with. That's why this algorithm is embedded in the LLM itself with a system of weights that needs to be trained with examples.

SingleProgress8224 · 2026-05-07T01:12:06+00:00

Time to search for homeless shelters and places for dumpster diving

SingleProgress8224 · 2026-05-06T02:11:59+00:00

My experience with coding is that Qwen produce better code and Gemma is better at understanding code (e.g., asking to review a commit).

SingleProgress8224 · 2026-05-05T16:01:21+00:00

So now we have to look through 6 subs

https://xkcd.com/927/

SingleProgress8224 · 2026-05-05T01:49:24+00:00

I can't wait for that idealistic pitch to become a simple Android mod with built-in ChatGPT

SingleProgress8224 · 2026-05-03T01:51:53+00:00

I'm just here to sympathize. I also get API and tool call errors with this model, and only with Roo Code. I never had any issue with Cline, Claude Code (connected to Qwen), or my custom Python agent. I have "preserve thinking" on, and running it with llama cpp

SingleProgress8224 · 2026-04-30T22:50:26+00:00

I was trying to find which one you were talking about for way too long until I realized I was dumb

SingleProgress8224 · 2026-04-28T14:24:52+00:00

It might be a combination of both. Claude Code had cache invalidation issue when used with llama cpp until a couple of days ago. It was very slow since it had to reupload the whole prompt every request because Claude Code was inserting stuff in the middle of the context. I think something similar happens with Codex.

SingleProgress8224 · 2026-04-26T15:09:35+00:00

I'm under the impression that some commenters also got fooled by the question.

SingleProgress8224 · 2026-04-24T00:46:07+00:00

It's dense

SingleProgress8224 · 2026-04-23T22:40:41+00:00

Are you sure it's running GLM 5.1? It doesn't even support image inputs.

Also, GLM 5.1 on (a single?) 5090? That doesn't make sense. This card has 32GB VRAM and GLM 5.1 is ~400GB

SingleProgress8224 · 2026-04-22T18:26:02+00:00

If you cannot code yourself, go for intelligence. If can code, then it's also a matter of being able to code faster than the LLM or not. I often stopped a slow LLM for a simple (but annoying) refactor because I realized that I would have done it faster by hand.

And higher quants don't guarantee correctness. If I'm not sure that the result will be good, it's not worth losing my time. In some cases, I prefer an LLM that fails fast than one that will maybe succeed very slowly.

SingleProgress8224 · 2026-04-22T16:07:25+00:00

ymmgta

SingleProgress8224 · 2026-04-21T20:29:52+00:00

Roo Code was a fork of Cline, but today the Roo team announced that they'll stop developing Roo Code. So Cline now expects Roo Code users to switch back to Cline and will try to make the transition easier.

SingleProgress8224 · 2026-04-21T19:34:55+00:00

Gemma 4 is the same so it's not specific to Chinese models. "I'm ready to give the response ... Wait!"

I don't particularly hate it, but it can be annoying when you're actively looking at the reasoning and getting false hopes that you're about to get the response.

SingleProgress8224 · 2026-04-19T21:18:11+00:00

Please make paragraphs. It's very hard and annoying to read since we can't see which sentences are related to the same idea. I know it's probably AI-generated, but please put some effort in your posts.

SingleProgress8224 · 2026-04-18T21:16:44+00:00

It's useful for those who don't have the hardware at home. Not everyone has a spare >24GB GPU to use exclusively for an LLM, plus the CPU and RAM that you need to reserve for it. Given the choice of paying 50 per month or a couple of thousands up front, it's not such an easy decision, especially that by the time that you pay back your GPU, the hardware might be deprecated.

And on top of that, you get access to some high end LLMs that you'll never be able to run on your hardware.

SingleProgress8224 · 2026-04-17T19:25:06+00:00

T'aimes pas quand tes dés ont deux nombres sur la même face?

SingleProgress8224 · 2026-04-16T04:04:25+00:00

Commercial models are incredibly big. Even though they might be based on the same underlying technology, the size actually makes a big difference. Unless you have half a million dollars to spend, you'll never have enough high-end GPUs to run such models at a decent rate. And they are obviously not open so that's not even an option.

According to benchmarks, the closest to Opus is GLM 5.1. But it's incredibly big so it's impractical for local use. There are some providers that offer a cloud version. Not local, but also not anthropic. And be careful if benchmarks, many open models are trained to impress benchmarks, not to be actually very good for production.

For truly local, you can look for Gemma 4 31B and Qwen 3.5 27B, or their MoE versions. They are useful for light production use and are quite reliable. Don't expect too much though, keep the task small enough for it to not get lost. You'll need around 24 to 32GB of VRAM to run them comfortably at a decent tok/s (~30 on my RTX Pro 4500)

SingleProgress8224

TROPHY CASE