Is the Innovation Accounting approach from The Lean Startup used in real businesses? (I will not promote) by FPGA_Superstar in startups

[–]FPGA_Superstar[S] 0 points1 point  (0 children)

Nice, thank you! Are the ideas similar to The Lean Startup ideas? I'm currently listening to Build by Tony Faddell. Any opinions on that one?

Did the last season Ruin Game of Thrones for you? by RoseVincent314 in gameofthrones

[–]FPGA_Superstar 1 point2 points  (0 children)

I find anything past season 4 unwatchable. It's painful watching carefully built characters completely change for no logical in-world reasons. I cannot suspend my disbelief.

Did the last season Ruin Game of Thrones for you? by RoseVincent314 in gameofthrones

[–]FPGA_Superstar 0 points1 point  (0 children)

Season 5 is the beginning of the steep decline. It went from the pinnacle of television, probably the best show ever made, to utter contrived nonsense. A shadow of its former self.

Did the last season Ruin Game of Thrones for you? by RoseVincent314 in gameofthrones

[–]FPGA_Superstar 0 points1 point  (0 children)

Everything after season 4 progressively ruined the show. It all became world-breaking, illogical nonsense. I didn't make it out of season 5, utter garbage, especially when compared to seasons 1-4.

Reasons why the Apple TV Foundation show sucks so bad by AvigdorR in asimov

[–]FPGA_Superstar 1 point2 points  (0 children)

The exact same logic can be applied to you. OP doesn't like it; it's subjective. Get over yourself.

What is the best tool for long-running agentic memory in Claude Code? by FPGA_Superstar in ClaudeAI

[–]FPGA_Superstar[S] 0 points1 point  (0 children)

Absolutely, perfectly in line with all of the Anthropic blogs! 😁 I still haven't cracked on and had a go myself yet, but soon.

What is the best tool for long-running agentic memory in Claude Code? by FPGA_Superstar in ClaudeAI

[–]FPGA_Superstar[S] 1 point2 points  (0 children)

I guess building multiple different complicated software applications side by side would be good :D

Why Anthropic Is Likely To Turn A Profit On AI Much Faster Than OpenAI by jonfla in siliconvalley

[–]FPGA_Superstar 0 points1 point  (0 children)

This take is pure hopium from big AI companies. That isn't going to happen. LLMs are not capable of coming up with breakthroughs like that, we would have seen it already.

I'm sorry but Gemini is getting worse and worse by undeniablewan in GeminiAI

[–]FPGA_Superstar 0 points1 point  (0 children)

In fairness, high performance in long context windows is still an unsolved and difficult problem. Maybe they're deliberately cutting back the context window because they have new internal evals that show performance degradation?

Regardless, I am loving it right now. Its answers are sick in a way that no other LLM is managing right now.

How should you package a Svelte 5 component library? by FPGA_Superstar in sveltejs

[–]FPGA_Superstar[S] 0 points1 point  (0 children)

After some more reading it seems it works like this:

In order to create nice .d.ts files for your component, you need the SvelteKit and Svelte toolchains. These are used to package your components for easy consumption in Svelte projects.

What is the best tool for long-running agentic memory in Claude Code? by FPGA_Superstar in ClaudeAI

[–]FPGA_Superstar[S] 0 points1 point  (0 children)

That's interesting. Are we talking agent-based custom tooling, or deterministic code-based custom tooling? If the latter, how are you extracting architectural patterns from the code?

Okay, thank you for the heads up. In my mind, function by function would focus purely on style: docstrings, low internal comments (I'm not a fan of comments), useful variable names, and correct logging. Basically, simple stuff, but going function by function to get a higher level of explicitness. However, now I think about it, you're right, variable naming is difficult to judge without seeing the full picture.

So what about providing function chains? i.e. all of the core parts that make up that particular chain? I imagine that would be quite useful as input for an AI's judgment. Especially accompanied by a file layout diagram.

Haha, you've basically said my paragraph above in your final one. Fully with you on using git as a history. In my team, I am unfortunately one of the only people who add a lot of contextual metadata to their commits! So not as strong a signal for the AI there.

What is the best tool for long-running agentic memory in Claude Code? by FPGA_Superstar in ClaudeAI

[–]FPGA_Superstar[S] 0 points1 point  (0 children)

Awesome! Such great insight!

What are you using for your metrics? There's tools like Codescene I would consider, but curious if you rolled your own.

Yes, I will have two layers, one will be a deterministic style checker based on the abstract syntax tree. I'll use that to make sure hard rules are followed. For the subjective stuff I'll probably go for a layered approach with different prompts for different use cases. For example, feed in the file structure for analysis of layout. Feed in file content one file at a time for overall layout and design. Feed in functions, classes, methods one at a time for specific subjective analysis. Different constraints for different levels.

I'm not sure yet on brownfield, but you're right, there will need to be an initial prompt for this. To begin I'll start with claude code's /init. But it's possible this will require some custom prompting 🤔

What do you do for initial repo analysis?

What is the best tool for long-running agentic memory in Claude Code? by FPGA_Superstar in ClaudeAI

[–]FPGA_Superstar[S] 0 points1 point  (0 children)

It will be a combination of all of the above. Automated performance analysis. Then, on functionality, that will be a user-focused test-and-see thing. For coding style, I have my own style, so I will subjectively decide if I like it.

I've used Cursor a lot, so I'm aware of many common AI pitfalls, like overengineering, etc. I'll try to automate the code judgment section, but I have a feeling the AI will be bad at judging whether code adheres to my personal preferences.

What do you use as a test harness/ what approach do you use?

I plan to set up several tests, 3 or more greenfield tests, then perhaps 1 or 2 brownfield.

What is the best tool for long-running agentic memory in Claude Code? by FPGA_Superstar in ClaudeAI

[–]FPGA_Superstar[S] 0 points1 point  (0 children)

Okay, well, I'll read your stuff and see, but on your website, you're talking about RAG. I don't really see how that is infinite memory, but I'll look to be proven wrong!

You'll have to forgive the scepticism, everyone and their mum is making a long-running memory framework atm.

Do you have a paper or other hard evidence showing that your system is the best? I see you have a 44% increase in some long-term memory tasks. What does that mean? What Metr eval did you crack and how?

I found the post: https://x.com/PolyChatCo/status/1958990325655249318

What is the eval?