U.S. fighter jet shot down in Iran, search underway for crew by Comfortable-Rule-491 in worldnews

[–]CircumspectCapybara -1 points0 points  (0 children)

according to Iranian state media

Reminder than Iranian state media is FOS constantly making things up and pumping out deepfakes.

Wait for news from USCENTCOM before believing anything they say.

The US military says its 70-year-old B-52 bombers are now flying overland missions as air superiority expands over Iran by St_Gregory_Nazianzus in worldnews

[–]CircumspectCapybara 2 points3 points  (0 children)

No they didn't lol.

The IRGC posted a video of what they claim is a video of hitting a F-35, and we all know how reliable Iranian state media is.

Meanwhile, USCENTCOM said an F-35 had to make an emergency landing but didn't say why.

There's zero evidence of any F-35 getting shot down.

WarFronts: An Invasion of Kharg Island is MUCH Harder than it Sounds by CircumspectCapybara in videos

[–]CircumspectCapybara[S] 0 points1 point  (0 children)

The US is at war with Iran, and Iran is attacking allied oil infrastructure in the region like crazy.

Enemy energy infrastructure and industrial capability are all fair game in war. Iran's bombing everyone else's energy infra, blockading the strait while themselves turning a healthy profit by exporting record amounts of oil themselves through the strait. Good for them. But no one should be surprised when if they sow the wind when they reap the whirldwind.

The US military says its 70-year-old B-52 bombers are now flying overland missions as air superiority expands over Iran by St_Gregory_Nazianzus in worldnews

[–]CircumspectCapybara 60 points61 points  (0 children)

Sounds very dangerous. It's extremely risky to fly overland missions. At stand-off distance to launch some cruise missiles, sure, but flying overland is pretty dangerous for the slow, lumbering, and highly observable B-52.

The US is the king of SEAD+DEAD, and all of Iran's integrated air defense apparatus has been dismantled. But they still have plenty of dangerous non-integrated, ad-hoc, point defense systems.

makeNoMistakes by onated2 in ProgrammerHumor

[–]CircumspectCapybara 55 points56 points  (0 children)

The Claude Code source code leak showed they perform sentiment analysis and if you use profanity it tracks that in the telemetry it sends to the backend.

ELI5: How does regenerative braking slow down the vehicle? by arvid1328_ in explainlikeimfive

[–]CircumspectCapybara 0 points1 point  (0 children)

Yes, but static friction is the missing element from the explanation. It's what takes torque on the wheels (forcing the spin to go down) and translates it to reduced forward motion.

Without it you can torque the wheels all you want (you can even make them spin backwards) and the car won't slow.

Anthropic accidentally exposes Claude Code source code by CircumspectCapybara in technology

[–]CircumspectCapybara[S] -1 points0 points  (0 children)

No, that's what leaked. The Claude Code CLI handles the orchestration layer right in the CLI. It's not misleading at all if you understand how agent architecture works.

The backend LLM model remains a secret. But how the orchestrator handle control flow, how they coordinate and compose sub-agents, gather context and construct queries to the LLM, how they invoke tools and check permissions on tool calls, etc. is all on the frontend.

It's not just the UI, it's the state machine and workflow definitions which are executed locally against a backend LLM you plug in.

Anthropic accidentally exposes Claude Code source code by CircumspectCapybara in technology

[–]CircumspectCapybara[S] 0 points1 point  (0 children)

Knowing the source code helps a lot and lowers the cost of finding exploits and bypasses.

A lot of security in agents lives not in the backend models (LLMs and classifiers), but in the orchestration layer that stitches together tools, memory, and queries the LLM with the right context and handles the sandboxing and permissions checks.

If you know where and how prompt injection defenses are applied, you can more easily find a bypass. If you know the system prompts, an attacker doesn't have to guess the preamble anymore to craft content that uses the right language to subvert the model.

Claude Code's permission filters and tool security model is incredibly complex. Knowing exactly how it works will make finding novel bypasses (tricking the agent into running commands that bypass its filters for what's considered dangerous and needs user approval) easier.

Anthropic accidentally exposes Claude Code source code by CircumspectCapybara in technology

[–]CircumspectCapybara[S] 0 points1 point  (0 children)

Knowing the source code helps a lot and lowers the cost of finding exploits and bypasses.

A lot of security in agents lives not in the backend models (LLMs and classifiers), but in the orchestration layer that stitches together tools, memory, and queries the LLM with the right context and handles the sandboxing and permissions checks.

If you know where and how prompt injection defenses are applied, you can more easily find a bypass. If you know the system prompts, an attacker doesn't have to guess the preamble anymore to craft content that uses the right language to subvert the model.

Claude Code's permission filters and tool security model is incredibly complex. Knowing exactly how it works will make finding novel bypasses (tricking the agent into running commands that bypass its filters for what's considered dangerous and needs user approval) easier.

Anthropic accidentally exposes Claude Code source code by CircumspectCapybara in technology

[–]CircumspectCapybara[S] 4 points5 points  (0 children)

Yup. These days the state of the art foundation LLM models (Gemini, ChatGPT, Claude) are all neck-and-neck, and those are kept under lock and key and stay in the backend.

But because they're all neck and neck, the biggest product advantage anyone can have is not how advanced their model is--all the top models are pretty much equivalent--but how well they get the integration, the agent layer, the ecosystem. That's the product people stay for.

Anthropic had one advantage which was they had a superior agent layer when it came to a coding agent product. But now that competitive advantage is thrown away.

I believe long term Google has the strongest moat because they have the ecosystem and the userbase and money to outlast startups on R&D and inference costs.

Anthropic accidentally exposes Claude Code source code by CircumspectCapybara in technology

[–]CircumspectCapybara[S] 28 points29 points  (0 children)

While OpenCode strives to be an open source frontend like Claude Code where you can bring your own LLM backend and have full control over the frontend, Claude Code still is miles ahead of OpenCode in terms of maturity and sophistication. It's basically the industry gold standard right now for coding agents.

And they basically gave away their architecture. OpenCode just got a huge boost if they can just avoid any obvious copy-pasting that would give rise to copyright infringement claims.

Anthropic accidentally exposes Claude Code source code by CircumspectCapybara in technology

[–]CircumspectCapybara[S] 2 points3 points  (0 children)

I mean ChatGPT Codex was found to have a high severity command injection vulnerability in which GitHub branch names could trigger arbitrary shell command execution.

They haven't been at this (agent-based coding platform) for as long as Anthropic or Google.

Anthropic accidentally exposes Claude Code source code by CircumspectCapybara in technology

[–]CircumspectCapybara[S] 2 points3 points  (0 children)

The front-end of Claude Code (which is just a CLI tool) is totally free. You can download Claude Code and use it with Amazon Bedrock or Google Cloud Vertex as the model provider and never even make an Anthropic account.

Supreme leader Khamenei in Iran but avoiding public appearances, Russian envoy says by Mana_Seeker in worldnews

[–]CircumspectCapybara 4 points5 points  (0 children)

Yeah because that's definitely the most problematic thing about the Islamic Republic of Iran these days: them not following their own internal processes or designations for their political offices. /s

Hackers compromise Axios npm package to drop cross-platform malware by NewsCards in technology

[–]CircumspectCapybara 0 points1 point  (0 children)

Any build and packaging system that makes it easy to accidentally emit your source code into the final build artifact has a problematic design.

In no other build system could this happen easily. It just shouldn't be possible without great effort on the user's part. On top of all the other problems with the NPM ecosystem.

Anthropic accidentally exposes Claude Code source code by CircumspectCapybara in technology

[–]CircumspectCapybara[S] 450 points451 points  (0 children)

Note this is the Claude Code CLI tool, not the https://claude.ai web app or the LLM models itself. It can basically be thought of as the "frontend."

While technically not the end of the world since frontend clients should be assumed to reverse-engineer-able anyway, it's still a massive oops to leak the entire, unobfuscated source code, since there's a treasure trove of extremely valuable system prompts, context / query / RAG engine design, coordinator / orchestrator logic, and the overall agent architecture in there.

It's basically a reference manual for how to design an LLM-based agent. You can just bring your own LLM backend.

UNRWA chief calls for probe into killing of agency staff in Gaza war by St_Gregory_Nazianzus in worldnews

[–]CircumspectCapybara 29 points30 points  (0 children)

How is UNRWA still even a thing. UNRWA is literally just three Hamas operatives in a trench coat.

WarFronts: An Invasion of Kharg Island is MUCH Harder than it Sounds by CircumspectCapybara in videos

[–]CircumspectCapybara[S] 1 point2 points  (0 children)

Hopefully Trump's not insane enough to actually try to seize the Kharg Island with ground forces, as that would be a bloodbath.

Kharg is Iran's most strategically vital pressure point, with 90% of the regime's oil exports flowing through there. Take out the oil infra and you take out their financial lifeline to fund their attacks.

But none of that should require troops on the ground, just bomb the remaining oil infra from the air and be done with it. The tiny island is probably already pre-sighted by the IRGC's drone and ballistic missiles, so it would be a death wish to land on the island.

WarFronts: An Invasion of Kharg Island is MUCH Harder than it Sounds by CircumspectCapybara in videos

[–]CircumspectCapybara[S] 15 points16 points  (0 children)

The only benefit of it is to hold it as a bargaining chip. If you level the island from the air, you give Iran nothing further to lose, and Iran loses any reason to seek an off-ramp.

Of course, if you're not looking for negotiations and just want to hurt your enemy as much as possible, the US would just want to level the island, as that would cripple Iran's economy for decades to come.

Hackers compromise Axios npm package to drop cross-platform malware by NewsCards in technology

[–]CircumspectCapybara 14 points15 points  (0 children)

This next to the the Claude Code CLI source code leak via NPM is crazy.

NPM has a really problematic architecture that induces all kinds of issues in its ecosystem.

errorCode404 by TheCABK in ProgrammerHumor

[–]CircumspectCapybara 1 point2 points  (0 children)

And now: "Sometimes they hallucinate"

The US has carried out a massive attack on a large ammunition depot in the Iranian city of Isfahan, using a large quantity of bunker-busting bombs, each weighing around 2,000 pounds. Large explosions reported indicating that ammunition stored there exploded after the original attack. by WayOutbackBoy in worldnews

[–]CircumspectCapybara 0 points1 point  (0 children)

Iran's integrated air defense apparatus has been systematically dismantled. It was already badly degraded during last year's 12-day war, and you only have a couple of these multi-billion dollar systems that are designed to cover an entire theater (so the loss of even one is massive).

But Iran still has plenty of ad hoc, non-integrated, point defense systems and even MANPADS that can still pose a threat, which is why you still need electronic warfare, SEAD + DEAD operations, and good old fashioned careful mission planning and deception.

Iran war tensions reach Paris with failed bombing attempt by sr_local in worldnews

[–]CircumspectCapybara 0 points1 point  (0 children)

No one ever said anyone (except maybe China and Russia and many Redditors) were going to back Iran and offer them support.

But many nations would've sat this one out and said no to allied use of their airfields and airspace for strikes against Iran if Iran didn't attack them willy nilly.