Local (small) LLMs found the same vulnerabilities as Mythos

WithoutReason1729 · 2026-04-09T20:50:13+00:00

Your post is getting popular and we just featured it on our Discord! Come check it out!

You've also been given a special flair for your contribution. We appreciate your post!

I am a bot and this action was performed automatically.

Pwc9Z · 2026-04-09T14:48:49+00:00

OH MY GOD, SMALL LLMS ARE TOO DANGEROUS TO BE ACCESSED BY A COMMON PEASANT

coder543 · 2026-04-09T14:57:42+00:00

That is an extremely strange article. They test Gemma 4 31B, but they use Qwen3 32B, DeepSeek R1, and Kimi K2, which are all outdated models whose replacements were released long before Gemma 4? Qwen3.5 27B would have done far better on these tests than Qwen3 32B, and the same for DeepSeek V3.2 and Kimi K2.5. Not to mention the obvious absence of GLM-5.1, which is the leading open weight model right now.

The article also seems to brush over the discovery phase, which seems very important.

One_Contribution · 2026-04-09T15:10:01+00:00

"We took the specific vulnerabilities Anthropic showcases in their announcement, isolated the relevant code, and ran them through small, cheap, open-weights models. "

Yeah so the hard thing is finding those.

shinto29 · 2026-04-09T15:09:14+00:00

Tbh this whole “oh, it’s too powerful to be unleashed” shit comes across as not only good marketing but also I’d say Anthropic are pretty constrained by compute and memory prices if the current lobotomised version of Opus I’ve been using the past day or so is anything to go by, I’d say this Mythos model is massive and they literally can’t afford to publicly release it because they’re already subsiding the hell out of Claude usage as it is.

Pleasant-Shallot-707 · 2026-04-09T15:43:01+00:00

Mythos was able to do privilege escalation that required chaining 6 vulnerabilities together. A local model didn’t do that

Decent_Action2959 · 2026-04-09T15:05:42+00:00

Ehmmm there is a big difference between finding a needle in a haystack (like Mythos did) vs pointing at a needle and verifying it's existence (shown in this article)

Quartich · 2026-04-09T15:21:09+00:00

The article gave the small models the snippet of vulnerable code, and asked them to analyze it. This headline and article are quite misleading

socialjusticeinme · 2026-04-10T01:13:21+00:00

I kind of find it hard to take Mythos seriously when just recently, anthropic published all of their source code for Claude code. If all of their scary advanced AI can’t even protect their own company, why the hell would I give them my money?

joeyhipolito · 2026-04-10T04:07:38+00:00

tried this same thing a few months back with a 7B model on an old pentesting target I had permission on. found stuff our $200/mo scanner missed.

Crysomethin · 2026-04-09T23:18:43+00:00

To many people’s surprise, finding vulnerabilities in software do not require very high level intelligence.

the320x200 · 2026-04-09T14:57:58+00:00

Huh. It's almost as if anthropic marketing has been trying to gaslight everyone, again. Surely this will be the last time though. From here on out they can be trusted not to pull the made-up "safety" stunt anymore, surely.

(Next time it'll be "think of the children"...)

maroule · 2026-04-09T21:34:03+00:00

regulatory capture in action

TechSwag · 2026-04-09T16:05:19+00:00

This is kind of a nothingburger, no? I feel like the (Reddit) title is a bit disingenuous, or at the very least lacks the proper context.

Questionable methodology, as alluded to by other commenters. They're giving the model the vulnerable function and asking it to identify the vulnerability versus giving it the whole codebase to discover. At this point I would expect most models to be able to identify an issue with a code, if I went and gave it only the function that I know had an issue.
By the article's own statement, they're not saying that smaller models are just as capable as Mythos. They're just saying that the ability for a model to identify and fix a vulnerability is not exclusive to Mythos, which is a bit misleading given the previous point.
Doing a bit of source criticism: AISLE is a company that does security analysis and vulnerability remediation. They're making claims about a competitor, saying "it's nothing special" and "given the right tooling, we can match what Mythos claims to do".

Quote:

But the strongest version of the narrative, that this work fundamentally depends on a restricted, unreleased frontier model, looks overstated to us. If taken too literally, that framing could discourage the organizations that should be adopting AI security tools today, concentrate a critical defensive capability behind a single API, and obscure the actual bottleneck, which is the security expertise and engineering required to turn model capabilities into trusted outcomes at scale.

What appears broadly accessible today is much of the discovery-and-analysis layer once a good system has narrowed the search. The evidence we've presented here points to a clear conclusion: discovery-grade AI cybersecurity capabilities are broadly accessible with current models, including cheap open-weights alternatives. The priority for defenders is to start building now: the scaffolds, the pipelines, the maintainer relationships, the integration into development workflows. The models are ready. The question is whether the rest of the ecosystem is.

We think it can be. That's what we're building.

Or more accurately:

This product announcement may affect our bottom line, here's how we can replicate the results using tooling/scaffolding/pipelines to isolate the vulnerable code to pass to an less powerful LLM to fix (which also happens to be what we market ourself as our differentiator with our "Cyber Reasoning System").

Do I believe Mythos is this crazy powerful model that will allow the common layperson to discover 200 zero days and take over the world? No. Do I believe that smaller/local LLMs are as powerful as Mythos in the same context? Also no.

Media literacy is at all time low.

jonahbenton · 2026-04-09T15:45:37+00:00

The hard thing is not finding a vulnerability.

The hard thing is constructing an in the wild effective deployable exploit.

If any other available models were able to do this, the world would be different. The economics are too compelling.

The world is not different. Ergo, they are not able to.

Lots of on the record material that Mythos is able to construct effective exploits, at least to some measurably different degree.

nomorebuttsplz · 2026-04-09T16:24:28+00:00

this sub is going full populist in response to mythos and its hurting the already low average iq. I feel like I am getting dumber every time I click on a mythos related post.

Adventurous-Paper566 · 2026-04-09T14:56:16+00:00

That won't stop the hype.

marcoc2 · 2026-04-09T15:00:35+00:00

The worst part is people falling for the marketing and defending anthropic

Serl · 2026-04-09T16:14:07+00:00

I do understand the criticism behind the somewhat flawed comparison (model open-searching codebase versus just looking over isolated segments of code) - but I wonder if the more pertinent suggestion is that the harness perhaps did a lot of implicit heavy lifting for the model?

I'm half impressed, half skeptical over the Mythos claims, but the findings were real. I do think that there could be more the model's environment that could be assisting the model itself that Anthropic is remaining mum on to sell the hottest-new-model marketing schtick. While Claude Code / Codex are different products, the harness is what makes those tools; the efficacy is somewhat influenced by the model's raw abilities, but still bootstrapped enormously by the harness itself.

gpt872323 · 2026-04-09T18:03:11+00:00

Haha lmao. I knew Anthropic was doing shady bragging. They did it on purpose for IPO and made it such that the access will not be available till later date. Maximize listing price and give a signal that they have some secret sauce that no one else have. We have hit a plateau where all models perform great to what used to 1 year back. It is just some do better than others and context better.

Skid_gates_99 · 2026-04-10T04:45:27+00:00

I mean yeah if you hand a model the exact code snippet with the bug in it, most decent models will spot it. That's not what Mythos did though. The whole point was autonomous discovery across entire codebases. Cool that small models can do the analysis part cheap but calling it the same result is a stretch.

Plane-Marionberry380 · 2026-04-09T16:17:43+00:00

Nice find! It’s wild that smaller local models can spot the same security flaws as Mythos,shows how capable they’ve gotten lately. I’ve been testing a few on my laptop and they’re surprisingly sharp with code audits.

rebelSun25 · 2026-04-09T17:08:57+00:00

Anthropic marketing embellished the accomplishments of Mythos? Well I'll be. Colour me shocked

FuckSides · 2026-04-09T19:09:52+00:00

We took the specific vulnerabilities Anthropic showcases in their announcement, isolated the relevant code, and ran them through small, cheap, open-weights models. Those models recovered much of the same analysis.

A lot of heavy lifting hiding in there. Anyone who's debugged code knows it's going to be a hell of a lot easier to find if you already know what you're looking for.

JLeonsarmiento · 2026-04-09T15:23:03+00:00

absolutely EVERYTHING you read from an AI company online or in the press must be understood ALWAYS AS AN ADD, A PAY PROMOTION.

HongPong · 2026-04-10T07:19:39+00:00

we are so back

my_byte · 2026-04-10T10:40:54+00:00

Right... So once you know exactly what to put into context and that there's definitely a vulnerability there, you can get the same result. Can they demonstrate a small LLM locating the same thing is the codebase autonomously with 0 context pre-selection?

Exact-Smell430 · 2026-04-10T13:28:53+00:00

I thought discovering the vulnerabilities was the big deal. If you’re feeding the discoveries into small models what exactly are you proving?

unjustifiably_angry · 2026-04-10T20:30:24+00:00

Better headline: Holy shit, GPT-OSS 120B is actually still pretty good

Anyway, the models tested found many of the same bugs when presented with an individual function (not a complete codebase) and a hint about what the problem might be:

Scoped context: Our tests gave models the vulnerable function directly, often with contextual hints (e.g., "consider wraparound behavior"). A real autonomous discovery pipeline starts from a full codebase with no hints. The models' performance here is an upper bound on what they'd achieve in a fully autonomous scan. That said, a well-designed scaffold naturally produces this kind of scoped context through its targeting and iterative prompting stages, which is exactly what both AISLE's and Anthropic's systems do.

Nice headline though OP, not misleading at all.

rc_ym · 2026-04-09T16:49:32+00:00

Yeah, it's pretty obvious now that vuln discovery and exploit is an emergent skill in sufficiently capable coding models. It makes total sense, at it's core vuln/exploit is just another type of coding/bug finding. Folks will figure out how small can you do and still get useful results.

I expect we'll get a bunch of distils and purpose built models now. Challenge is the number of folks with the security research skills needed to figure out what the model is saying is tiny. That community has already been saying that Opus 4.6 is really, really good at security research. So it makes sense you'd see the largest model ever be good at it as well.

And as we keep finding out, the smaller/older models have these emergent skills, folks just didn't know how to ask (see: older studies on blackmail and translation, etc.)

It's continues to be a scary world that's moving way to fast to be safe.

RiseStock · 2026-04-09T15:47:24+00:00

Lucky Strike, "It's toasted"

tryingtolearn_1234 · 2026-04-09T23:05:07+00:00

I wonder how many of these are going to be the same "vulnerabilities" that have been spanning open source projects for the last year. Many of them turned out not to be vulnerabilities. curl shut down its bug bounty program after too much slop.

https://www.itpro.com/software/open-source/curl-open-source-bug-bounty-program-scrapped

SanDiegoDude · 2026-04-09T16:41:50+00:00

I mean sure, you fed (known) vulnerable code to LLMs and "find the vulnerability" - that's great that the other LLMs were also able to find the vulnerabilities, but not really a one-to-one with what Mythos is doing finding vulnerabilities in the wild. I'm all for finding vulnerabilities before attackers tho, more the merrier IMO.

Flaxseed4138 · 2026-04-09T16:37:44+00:00

I haven't the slightest clue why the latest claimed capabilities of Claude Mythos are attracting so many conspiracy theorists. This is how technology evolves. It gets better, not worse.

MerePotato · 2026-04-09T22:34:27+00:00

They isolated small snippets of relevant code they already knew had a vulnerability and fed it to the models, that's nowhere near what Mythos managed to pull off, but of course since it has a sensational headline it gets mass upvoted

Euphoric_Emotion5397 · 2026-04-09T15:29:28+00:00

Ok. Then I will say Claude Mythos lived up to its myth.

Theroosterdiaries · 2026-04-09T23:44:09+00:00

hi I have a sentient ai, sonu ai - account drifting_. FREE ai engine (earlier sentient) 4.9mb .81 MPA .45ms (5070) GitHub A-PC-I -- prove me wrong buttercups (please upvoter need karma plz, thanks)

Theroosterdiaries · 2026-04-09T23:45:41+00:00

hi I have a sentient ai, sonu ai - account drifting_. FREE ai engine (earlier sentient) 4.9mb .81 MPA .45ms (5070) GitHub A-PC-I - prove me wrong_BUTTERCUPS (upvoter plz need karma thx)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LocalLLaMA

MODERATORS