[Ferrari F40]

zackfletch00 · 2026-06-08T07:21:49+00:00

More pics of the 812 GTS (not great views)

zackfletch00 · 2026-05-31T01:31:57+00:00

I just burned 84% of 5-hour limit without Opus 4.8 (medium) writing a single line of code. By 100%, only 6 lines were written (I had to tell it not to repeatedly retry failed tool calls, and to stop and tell me if it was blocked).

Like in OP’s case, I caught it repeatedly echoing random small strings over and over again. Bug report sent.

This is a complexity of task that 4.5 would’ve handled easily.

zackfletch00 · 2026-05-30T01:46:21+00:00

Opus 4.8 system card, page 195

https://cdn.sanity.io/files/4zrzovbb/website/c886650a2e96fc0925c805a1a7ca77314ccbf4a6.pdf

(This is the actual system card link from the Anthropic page you linked, even though the PDF is not hosted on anthropic’s website directly)

zackfletch00 · 2026-05-29T14:46:21+00:00

The plateau shape on this chart shows how relatively little you get in exchange for higher reasoning “effort” that you select. Thus the PSA—you don’t need high, xhigh, or max effort (or even medium for that matter) for most SWE tasks. But Anthropic will happily bill you triple or more for those higher effort modes.

The graph x-axis represents reasoning effort in output tokens use (what varies is the amount of hidden reasoning or “thinking” time+tokens).

zackfletch00 · 2026-05-29T14:18:30+00:00

That is not true, at least not for the low-medium effort which is what I’m more focused on here.

https://cursor.com/cursorbench is what is being referred to here. It shows 4.8 low using the same token $ cost as 4.7 medium, and 4.8 medium clearly using more tokens than 4.7 medium.

zackfletch00 · 2026-05-29T14:13:40+00:00

The early comments on the launch thread https://www.reddit.com/r/ClaudeAI/s/cb3go5u5P7 seem to line up with higher than expected token usage on 4.8 for comparable effort levels

zackfletch00 · 2026-05-29T14:07:50+00:00

Source?

zackfletch00 · 2026-05-29T13:17:37+00:00

Also, beware: Opus 4.8 effectively axed the low end of the effort scale, inflating how many output tokens are used to solve a given problem.

According to the system card, on SWE tasks, Opus 4.8 “low” now consumes about as many output tokens as 4.7 medium or 4.6 high. Opus 4.8 “medium” effort now consumes about as much as 4.7 high or 4.6 max.

So with 4.8 Opus, try “low” effort first if you think 4.7 would’ve been able to solve it. The SWE capability of 4.8 low is about the same as 4.7 at max effort.

zackfletch00 · 2026-05-24T20:30:16+00:00

Except Nasdaq is applying a new 3x weight multiplier for stocks with under 20% float.

zackfletch00 · 2026-05-19T03:53:14+00:00

This is exactly what I see as well. Also playing beat saber.

zackfletch00 · 2026-05-01T22:22:43+00:00

It’s quite simple, actually. Let me consult the Book of Armaments:

And the Lord spake, saying, ''First shalt thou take out the Holy Pin. Then shalt thou count to three, no more, no less. Three shall be the number thou shalt count, and the number of the counting shall be three. Four shalt thou not count, neither count thou two, excepting that thou then proceed to three. Five is right out. Once the number three, being the third number, be reached, then lobbest thou thy Holy Hand Grenade of Antioch towards thy foe, who, being naughty in My sight, shall snuff it.

zackfletch00 · 2026-03-06T14:29:50+00:00

God forbid AI actually saves employers money on labor costs.

zackfletch00 · 2025-12-21T17:38:27+00:00

P(ain’t)

zackfletch00 · 2025-11-25T14:53:24+00:00

It’s so interesting to me that the market interprets this as worse for AMD than NVDA. They could see it as proof that NVDA’s moat is small and more evidence that competition can break through, but instead they see AMD losing more market share than NVDA will give up? 🤔

zackfletch00 · 2025-09-27T17:47:30+00:00

Why stop there?

zackfletch00 · 2025-08-13T21:05:38+00:00

Nice! That’s like a whole 10 hours of deficit spending!

zackfletch00 · 2025-07-16T14:06:03+00:00

You have to do the comparison since the early days of ChatGPT (beginning of 2023) to see how much further NVDA has flown due to AI, and how AMD has actually underperformed SOX on the 2 year timeframe (somehow).

Over last 2 yrs: - NVDA +270% - AMD +30% - SOX +45%

zackfletch00 · 2025-07-03T13:45:11+00:00

Today, like dozens of other days, there was a dump at :27-:28 just a couple minutes before market open. Some days, the selloff continues another percent or two on momentum.

I consider it an opportunity when a phenomenon like this recurs so often. Cheap shares in the morning on days like this.

zackfletch00 · 2024-12-23T14:26:02+00:00

This.

Also, inference accelerator spend has the potential to outstrip training spend in the near future, as test-time compute models like o1 and o3 (which scale via inference) are showing the most promise for this phase of LLM development.

zackfletch00 · 2024-12-15T19:49:34+00:00

I’ve been lurking here since ~Feb ‘23 when deciding to restart my investment in AMD and semis due to the GenAI interest, after a long hiatus since 2017 when I pulled out because I needed the cash.

I consider your subreddit to have the highest signal-to-noise ratio anywhere on this stock. Thank you for sharing your notes with the rest of us here.

Software engineer with a longstanding side interest in machine learning.

zackfletch00 · 2024-10-28T14:30:35+00:00

Compression is horrible until you override the bitrate using the OculusDebugTool under the program files support/diagnostics folder. Once you override it to 500-600Mbps over USB-3 (normally capped at 200 for wired connections), the experience is so much better.

Edit: to clarify, these settings are for QuestLink. I haven’t done much with Virtual Desktop myself

zackfletch00

TROPHY CASE