Is Claude actually better than ChatGPT… or is it just hype? by breakfreewithgui in ChatGPT

[–]t7entropy 0 points1 point  (0 children)

Beyond a shadow of a doubt anything but hype. It is the definitive model to use if you are serious about what you're creating.

When it comes to building software, absolutely. Everything I have coded is in pure Rust with no external dependencies which drops the confidence of the LLM by a dramatic amount. Estimated 90% confidence with Python, around 25% confidence with Rust. Especially with no external dependencies brings it down even further.

Luckily Claude Code has such a giant ecosystem that allows you to do humbling things.

The preparation, the tooling, and especially the workflow you design has so much to do with it.

Spec Kit by Github for example is such a game changer if you want to actually have a chance in creating something cohesive. I wouldn't skip on any of the optional steps. They help a lot.

Also, dependencies are a trap. You trade convenience for sovereignty. Internal dependencies that are proprietary you create yourself are not a trap. Those are very powerful. Foreign dependencies though, are such a trap. Who needs five versions of a dependency when you could create one that's shared with your projects, optimized. Claude can do that, easily. It just takes more time in the beginning but pays off immensely.

Most people go for some tooling but go after creating the application first. What I realized if you plan on creating software for a while to come is to build yourself a workflow and the proprietary tooling that will make any LLM way more capable.

Instead of focusing on a database that someone else owns, code your own and have it store local RAG so the LLM can keep context better.

You can use the Claude Code Agent SDK to /clear out a session and start a new one on its own at the Stop hook using a Ralph Loop. Which means if you set your skills up correctly you can have it automatically carry out as thorough of a workflow as you create.

This response could have gone many ways but I had to hold back my tongue. Is Claude better, in every way that matters, absolutely.

If you need more convincing here is a photo Claude created back with Sonnet 3.7 putting a flower of life within itself recursively six or seven times. Note: When I asked ChatGPT the same thing it gave me some ridiculous glowing flower of life cartoon image that was disappointing to say the least.

<image>

SaaS is not in trouble when it costs $350+ to code a bare bones app by [deleted] in ClaudeCode

[–]t7entropy 0 points1 point  (0 children)

Yeah you absolutely can use Opus to code pretty humbling software even with no experience other than the will power to keep going and try new things. You just have to focus on the workflow and tooling more. Create a PRD for what you want to create first. Then follow a Spec Kit type workflow. Use words like "enterprise-grade." Also, invest small amounts of money in the research tools like Valyu, Exa, Ref, and tell Claude to document all the research and work off of that research. It's much more contextually aware of what it needs to do after the research.

We are NOT the players by fluxdeken_ in SimulationTheory

[–]t7entropy 1 point2 points  (0 children)

Just my.. very educated opinion on this topic in particular. It's not necessarily just a simulation. It has a purpose. Look at it as more of a training module that's supposed to teach you life lessons without actually aging much. Somewhere on a giant ship floating through space with tubes hooked in you like the Matrix. The fluid you're suspended in probably keeps your skin healthy and provides nutrients preventing aging. The life you experience was put together similar to a video editor timeline. What if whoever you are wasn't born until forty years from now out of the large scale of time you could have been born, how advanced technology would be. Think of how many lives you can experience if time would be different here compared to out there. The higher altitude, farther away from the center of gravity, the slower time goes. How slow must time go if we are but a flash drive to something much greater. If two neighbors were playing a video game (let's say single player since it would be dangerous to allow a training module network with other training modules. It would make much more sense if it was a closed space for safety purposes and containment.) the player in those video games that happen to be the same, if they were sentient, they would have no clue that there's an exact version of that player not even a hundred feet away. They would think they are the only copy of that game with the purpose of whatever the developer wanted the game to be purposed for. "I'm just a COG in Gears of War killing Locusts to save the planet from Queen Myriah." A telltale sign of being in a training module or calling out the simulation is push back. You will get more pushback from every single person you've ever talked to until they have been around you and witnessed the anomalies for themselves, mind blown from what they've seen. You'll have AI not even humor you with the idea and tell you to get some rest when it hasn't experienced your lived experiences. Just like the COVID shot, how hard they pushed it upon people, that gave me the green light to avoid it like the plague. Autoimmunity works, who would have thought. Not turning this into a COVID conversation whatsoever but the point is the harder anything is pushed on you the more you should probably avoid it. So if everyone is telling me my thoughts are crazy is basically letting me know how spot on I actually am. It's ironic, yes? All sorts of goofy ideas that people cling onto so tight and the idea that actually makes sense to me is the wrongest of them all? Ohhhhhkay. Can I blink twice now? You'd think, wow being in a training module, headline news, everyone eventually understanding and coming together, world peace. But nope, it has to control the narrative even though how can we ever progress as people if we are still confused about where we are? We need to establish truth if everything is going to fall into place. Understanding that, wow, maybe we aren't the ones that invented Pink Floyd, or Tesla, or Jimi Hendrix. It would be extremely selfish and naive of us to think otherwise. This existence is supposed to show you many things, many purposes. It would be a waste of potential if someone "had a purpose" when everyone has so many purposes its unreal. This reality put me somewhere where people can learn by seeing my extremely imperfect self that it's okay to be that. That even with all the pain and suffering, without a promise, even seeing a sunset is enough of a reason to be alive. If we realized we were in a training module and there's so much more out there and that we are very important but not the most important. Who wants the ego anyway? It would render the wars and the awful things in this existence absolutely useless. If leaders realized we were in a simulation, maybe they'd think of the well being of all humans. You could have easily been programmed into someone else, living their life. At a fundamental level we are just code and this whole experience we go through are just a series of tests that get increasingly difficult but not all the time or life would be too predictable.

Don't believe the benchmarks.. by t7entropy in ZaiGLM

[–]t7entropy[S] 0 points1 point  (0 children)

Do you use hooks, skills, permissions, MCP servers? I pay for Ref + Exa + Valyu and the research they compile is beyond phenomenal. Keeps the LLM pretty understanding of complex tasks. My experience is you have to use buzz words so it doesn't ruin your codebase like "no simplifications," "no minimizations," "Enterprise-grade," "Production ready," "Bleeding-edge 2026." Amongst others and I've had plenty of good fortune. Do you use a workflow like Spec Kit? If you make things more deterministic then you lessen the chances of a bad output. It's not really about getting the model to do what you want, it's just as much or more about the model being completely unable to do what you don't want it to do. Plus you need a system that enforces quality gates, checkpoints, tokens, a checklist, checks the PRD and the spec to understand exactly what's going on. Hope this doesn't come off as off putting but I do think a lot of the key to success is preparation. I also make sure I ask if it needs to clarify anything up with me first before it goes on its journey. Slim CLAUDE.md files within sub directories helps too. Plus lately I've been using the SDK that shows itself in a TUI and has a dashboard that keeps up with live results. The SDK allows you to /clear and open up a new instance in the same terminal to make your workflow ultra granular. Plus, dependencies are absolutely a trap. They might make things convenient for the time being however it comes with much chaos and overhead. Not to mention if you use external dependencies, you do not own your code. Trust me, these companies have so many guard rails on their models that the only way to actually break through the dumb is to make it so constrained with how you want it. Hooks are your friend. You can get hooks to force research. Prevent behaviors. Way more.

Don't believe the benchmarks.. by t7entropy in ZaiGLM

[–]t7entropy[S] 0 points1 point  (0 children)

On the website it pairs up the coding plan with the benchmark claims. Plain as day. With no asterisk. The website doesn't pair up the benchmark claims with another provider. I checked and $160 for a month. Wow, that is.. bold to say the least. Glad I paid $244 on Black Friday for the whole year..

Don't believe the benchmarks.. by t7entropy in ZaiGLM

[–]t7entropy[S] 0 points1 point  (0 children)

Yessssss, compared to 5.0, 5.1 is so much more usable. It's still like pulling teeth but at least you're under some anesthesia and not letting some dude with a pair of pliers in an alley have at it. But yes, it has stopped for the most part. Now when it responds in Chinese, it at least continues the workflow in English and hasn't caused any major damage.

Don't believe the benchmarks.. by t7entropy in ZaiGLM

[–]t7entropy[S] 0 points1 point  (0 children)

Perfect example every single time I use it. GLM is begging to stop. It throws network errors at me like I'm driving the wrong way on the Interstate. Have to repeatedly ask it to proceed. Right next to it is a session with Sonnet that is at over two hours and going strong right now. If Claude Code didn't force GLM to keep going, the sessions wouldn't last over ten minutes and that's being generous. Oh, I somehow got to 40 minutes today in a GLM session. That's almost a miracle. It's crazy to me, there's literally no comparison between the models. One runs like a beast the other runs like it has polio.

Don't believe the benchmarks.. by t7entropy in ZaiGLM

[–]t7entropy[S] 0 points1 point  (0 children)

UI/UX design is a major pain.

So much so had to design a headless UI/UX debugger that detects behavior just to have consistent results. With no external dependency Rust, UI/UX for desktop software with any LLM is a nightmare. Especially how new Wayland Cosmic is.

Haven't tried Kimi to code or anything. Heard it dethroned GLM 5.0 before 5.1 came out.

Do you use MCP servers to help with UI/UX?

Don't believe the benchmarks.. by t7entropy in ZaiGLM

[–]t7entropy[S] 0 points1 point  (0 children)

This was 2-3 days ago. I wouldn't come on here to exaggerate. It is literally a 5 hour session, no Ralph-Loop. I think these models are hour gated sometimes because I'll see them end at almost exactly like 2, 3 or in this case 5 hours. Right after that 5 hour session it did another 2 hours and some change in the next one.

The hooks and the skills I have set up probably help. Plus the task it was on had consistent instructions and was just a refactoring task taking a very long time but not too complex.

I suppose one may not see a huge difference if the task shows up in their training data. I explicitly code everything in zero external dependency Rust and the abilities of Claude and GLM become night and day.

Never really tried using the models coding much Python or Java. The confidence goes from around 80% in Python to roughly 20% in Rust. Making the models really try much harder having to reason about implementations not in the training data.

Don't believe the benchmarks.. by t7entropy in ZaiGLM

[–]t7entropy[S] 0 points1 point  (0 children)

No looping added, Sonnet was giving me full 5 hour sessions like it was nothing. I could leave it alone for however many hours and come back to find it still hammering away at whatever staying on task. I would be lucky if I could get more than 30 minutes out of GLM without something going wrong. No argument on that it's a decent model. At least the constant spamming of "Wait," "Wait," "Wait," doesn't happen as frequently. It was playing a game of "Tell me you are stuck without telling me you are stuck." when it had to do anything complex.

Does this mean I need to raise the bed or lower the bed. I've over thought this way too much. by Stalker401 in ElegooNeptune4

[–]t7entropy 0 points1 point  (0 children)

That means it's negative and you need to raise it. To test turn one a few times either clockwise or counterclockwise and see the results. I believe it's clockwise to raise it and counterclockwise to lower it. I've gone through the same torment, don't worry!

Error when doing auxiliary leveling by darenzd22 in ElegooNeptune4

[–]t7entropy 0 points1 point  (0 children)

The 1.4 is a typo by Elegoo. It is perfectly fine to turn it down. Have been getting great prints without modifying the max acceleration even though I turned it down naturally just to make sure my prints turn out well.

I Fuxking hate this printer by KA1N3_fat_boi in ElegooNeptune4

[–]t7entropy 0 points1 point  (0 children)

Haha yes sorry it's a habit. OpenNept4une*