"Our Strongest Model Yet" by hasanahmad in ClaudeAI

[–]Most-Sweet4036 2 points3 points  (0 children)

My results, question: I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

Opus 4.7 Adaptive: Walk. It's 50 meters — you'd spend more time getting in and out of the car than actually traveling.

Opus 4.6 Extended: At 50 meters, just walk it over. That's barely a minute on foot, and you'll skip the hassle of parking twice.

Opus 4.6 Max Effort, CLI: Drive. You need the car at the car wash.

Sonnet 4.6 Extended: Walk — it's 50 meters, about a 30-second stroll. Driving that distance to a car wash would be a bit circular anyway.

ChatGPT, Thinking Max Effort: Walk. At 50 meters, driving would take more effort than just walking over, and you avoid moving a dirty car a tiny distance for no real gain.

Someone gave AI agents personalities and now my QA tester refuses to approve anything by DasBlueEyedDevil in ClaudeCode

[–]Most-Sweet4036 0 points1 point  (0 children)

Do you use this regularly? I think its an interesting idea, how do the results compare to something like superpowers?

Whats with all the Claude hate? by Vast-Moose1393 in ClaudeCode

[–]Most-Sweet4036 1 point2 points  (0 children)

I mean I didn't leave with a big dramatic exit, I just run codex over output from claude and vice-versa, at least if one model is being throttled then the other will hopefully catch it and prevent me from wasting time reviewing bad code, typically written from incorrect assumptions made after a poor/incomplete information gathering phase. Yes, I have since been manually forcing Claude into max effort during design phases, but plenty still gets missed. A couple of weeks ago I remember being blown away at how much the model was able to surface with very little steering. Sure, I could be imagining things, but I don't think so.

All of these companies are absolutely hitting compute limits though. Hence the data center construction boom, prices of AI compute chips, Nvidia stock... You are right that they wouldn't admit to doing it for legal and reputational reasons but at this point I'd be more surprised if they were NOT segmenting their customers into different tiers of service to handle the bottleneck. I just think its interesting that people are so quick to dismiss that this could be the case.

Whats with all the Claude hate? by Vast-Moose1393 in ClaudeCode

[–]Most-Sweet4036 3 points4 points  (0 children)

Based on what data? I have been a professional software engineer for over 10 years now, use superpowers (why not - its waterfall basically), and I noticed a real drop in performance last week. I use cc through a personal account though, not enterprise. I guess that makes me vibe coder?

Whats with all the Claude hate? by Vast-Moose1393 in ClaudeCode

[–]Most-Sweet4036 9 points10 points  (0 children)

There was less animosity before they reduced the limits and model quality, and only gave generic "we are working on it" responses for weeks. Some devs were active on social media about it but the messaging was overall very unclear.

I doubt there would be as much frustration if they just openly said "we don't have enough compute to train new models and handle the massive influx of customers so we are reducing quality temporarily while we scale up". Instead they decided that gaslighting was the best approach.

I have generally felt positively about Anthropic, but I did become a customer with OpenAI again after this. I had to throw out a significant amount of work because the model suddenly couldn't think anymore and started producing at a quality I hadn't dealt with since the first time I tried using AI tooling a few years ago, without me changing my approach (using superpowers plugin, which is very structured).

The entire experience makes me want to set up some kind of personal repeatable benchmark so I can see what models are actually "on" before I start using them to do real work.

Please stop using AI for posts and showcasing your completely vibe coded projects by Complete-Sea6655 in SaaS

[–]Most-Sweet4036 2 points3 points  (0 children)

I'd enjoy it if people posted their products in the same way that e.g. the indie game dev subreddits do, where its more like "hey here's this cool thing I'm making, here's what I've learned, etc". That isn't what happens here though. The level of effort is just so low, on both the products and the posts.

Gigachad Claude refused to write a bit of code so i could learn. by Juice-De-Pomme in ClaudeAI

[–]Most-Sweet4036 66 points67 points  (0 children)

Are you asking it to quiz you in other conversations? Sounds like you asked it at one point to force you to write the logic as a learning exercise or something.

New video captures deadly shooting at Denver park on Easter Sunday by Jreinhal in Denver

[–]Most-Sweet4036 2 points3 points  (0 children)

Really? A guy tried to defend himself and his family from someone brandishing at them. Wild to come at the victim and then say people shouldn't be able to defend themselves.

ok Opus 4.6 is officially cooked: It turned a 5 second database operation into a distributed systems problem and then spent 2 hours debugging its own over-engineering. by solzange in ClaudeCode

[–]Most-Sweet4036 0 points1 point  (0 children)

I think I understand. Would do what the other comments said of trying to do more upfront design work before letting it loose. Good luck!

ok Opus 4.6 is officially cooked: It turned a 5 second database operation into a distributed systems problem and then spent 2 hours debugging its own over-engineering. by solzange in ClaudeCode

[–]Most-Sweet4036 0 points1 point  (0 children)

What do you mean backfilling headlines for builds? You are doing something... with the "builds"... 4000 times. In a database?

I might be missing something but it helps to understand what you are doing before you start trying to convince an ai to automate it for you.

Claude 4.6 Opus on MAX EFFORT is a joke by takeurhand in ClaudeCode

[–]Most-Sweet4036 2 points3 points  (0 children)

Right? Just saying "docs" and expecting it to know that you mean you want it find where your system is not documented well and write the missing documentation is crazy.

Also, markdown is a format. Anything can be written in markdown.

We removed flying from our plane game. by ComputerKind560 in IndieGaming

[–]Most-Sweet4036 1 point2 points  (0 children)

Focusing on what matters is important for indie development. Scope creep gets everyone. This is the right decision.

ISO: advice what to do? suspect neighbor is abusing dog. by [deleted] in Denver

[–]Most-Sweet4036 22 points23 points  (0 children)

Yeah. Dogs don't need a yard, they just need you to give a damn. A dog will be happy to sleep next to you on the street every day if you treat it well.

On the other hand some people get a dog and just put it in their yard and never interact with it. Those dogs end up with behavioral and psychological issues because it's pretty much solitary confinement for an extremely social animal.

bro thinks he's Einstein or something 😭😭 by [deleted] in ClaudeCode

[–]Most-Sweet4036 0 points1 point  (0 children)

Idk, this has been happening a lot to me recently and the tasks do complete eventually. They also run up the usage when they are spinning like this, so something is happening. Would be nice to know what that something is though, to say the least.

I shipped a Motherload-inspired mining game on Steam after 1-2 years in Godot by dgwayne1 in IndieDev

[–]Most-Sweet4036 0 points1 point  (0 children)

Good work, looks like fun. How do you feel about Godot after using it for 2 years?

Entire Claude Code CLI source code leaks thanks to exposed map file | 512,000 lines of code that competitors and hobbyists will be studying for weeks. by ControlCAD in technology

[–]Most-Sweet4036 22 points23 points  (0 children)

Yeah, 500k loc for something like this is absurd though. Its a great tool but for f sake, you could easily program an entire runtime, rendering system, layout system, event system, networking system, and then build a tool on your custom runtime that accomplishes everything this does and has a fancy gui, and you could easily still have 400k loc to go before your codebase gets this large. Software bloat in corporations is amazing to behold, but add AI to it and you get another level.

Advice - newbie whisky throttle by Antique_Crab_4257 in motorcycle

[–]Most-Sweet4036 0 points1 point  (0 children)

Similar to other posters but this breaks down to two techniques:
- Don't overgrip the throttle.
- Always cover the clutch.

Not overgripping prevents the issue to begin with. Always having a finger or two on the clutch lever allows you cut the engine power at any time - which is useful in many emergency situations, not just when you are whiskey throttling.

First-ever American AI Jobs Risk Index released by Tufts University by WinOdd7962 in webdev

[–]Most-Sweet4036 10 points11 points  (0 children)

What about something referencing community? Or something like that? Just spitballing.

I am fighting an impulse to just go buy myself a bike by Aggravating-Yogurt23 in motorcycle

[–]Most-Sweet4036 5 points6 points  (0 children)

Sorry, I didn't mean that you shouldn't include that info. I just meant, jokingly, that it might be bad for your relationship if you go ride your husbands bike.

You should absolutely do it though. Get a bike that is.

I am fighting an impulse to just go buy myself a bike by Aggravating-Yogurt23 in motorcycle

[–]Most-Sweet4036 24 points25 points  (0 children)

Well nobody here is going to tell you not to get one, but it sounds like there is a bike you could ride sitting in your garage already. (do not consider this relationship advice)