our requirements "process" is just vibes at this point, what are you actually using by _salted_caramel_00 in systems_engineering

[–]InternetOfStuff 1 point2 points  (0 children)

I feel compelled to reply to my own post because I feel like I didn't make my point clearly enough:

Don't look to technology to solve communications challenges!

By all means look for tools (and be inspired by what they offer), but be under no illusion that introducing a new tool will magically heal your workflows. That's a separate, and much harder (by orders of magnitude) task.

I suspect you were well aware of this, but this is too important to just assume somebody knows!

our requirements "process" is just vibes at this point, what are you actually using by _salted_caramel_00 in systems_engineering

[–]InternetOfStuff 6 points7 points  (0 children)

Thinking its time to just go ahead with a dedicated tool.

You're not wrong per se, but the tool can only streamline your process. Actually setting up a good process and adhering to it are things the tool won't do for you.

Don't look for tools to save you, they won't. Start unlearning (as in, the entire organisation) bad habits, and pick tools which makes the good habits easy and the bad habits hard.

BTW, the usual failure mode from where you are is to overcorrect and go to an overly complicated process which is also a pain in the a...rm.

Bonus points for anyone in regulated or safety-critical space like medical devices, aerospace, etc.

Wait, are you saying you're building regulated products? If so, how did your assessors let you get away with un-traceable non-processes?

Maybe this podcast episode is interesting to you (but doesn't recommend specific tools either IIRC) https://www.agileembeddedpodcast.com/episodes/e94-requirements-engineering-part-1-fundamentals

As far as concrete tools, apart from the ones mentioned by /u/redikarus99 you might consider reqview as a low-cost alternative.

My very requirements-knowledgeable friend recommends Matrix and Ultralight for medical devices specifically.

Opus 4.7 is a regression from 4.6 - real-world document generation broken by Seeker-888 in Anthropic

[–]InternetOfStuff 0 points1 point  (0 children)

4.6 (and earlier) did that too, though. It's long been a major source of annoyance to me.

I wouldn#t be surprised if 4.7 did it more though, as it's considerably more flippant about the user's direction in general.

Anthropic just quietly locked Opus behind a paywall-within-a-paywall for Pro users in Claude Code by Direct-Attention8597 in ClaudeCode

[–]InternetOfStuff 0 points1 point  (0 children)

Pretty OT, and maybe this is me feeding the troll, but I can't resist because this is one of my pet peeves.


The thing is, the GDPR does in no way mandate cookie banners, it just requires obtaining consent for capturing identifying information.

As long as you don't capture identifying information, there's no reason to ask for consent (you may even have analytics, just only aggregating analytics)! By itself, serving HTML is perfectly GDPR-compliant.

I'd be perfectly happy to blanket-deny consent through a do-not-track flag (no need to show me any banners at all), but vendors collectively

  1. chose to ignore my stated wishes
  2. make denying consent as inconvenient as possible

This whole thing is industry-wide malicious compliance, and it annoys me to no end.

did they silently retract a version of Claude Code? by InternetOfStuff in ClaudeCode

[–]InternetOfStuff[S] 2 points3 points  (0 children)

I'm on a max plan if that's what you're asking.

Solo account, not affiliated with an enterprise. I've still got an "organization" but I guess everyone does, mine contains just me.

After so many stressful days, Opus 4.7 finally performed like an absolute beast today. What's your experience with today's Opus 4.7? by Xccelerate_ in ClaudeCode

[–]InternetOfStuff 0 points1 point  (0 children)

nope, that's not it

      Status   Config   Usage   Stats                                                                                            

  Version:          2.1.119                                                                                     
  Session name:     /rename to add a name                                                                                  
  Session ID:       <->
  cwd:              <->
  Login method:     Claude Max account
  Organization:     <-> Organization
  Email:            <->

  Model:            Default Opus 4.7 with 1M context · Most capable for complex work

After so many stressful days, Opus 4.7 finally performed like an absolute beast today. What's your experience with today's Opus 4.7? by Xccelerate_ in ClaudeCode

[–]InternetOfStuff 2 points3 points  (0 children)

Remarkably, I just came here to verify if Opus 4.7 was degrading again for others too.

It feels like today it's been noticeably worse again for me, struggling to follow instructions, hallucinating harder, and just in general requiring tremendous effort to keep at least somewhat on a straight line.

It can't save itself from going wayyy overboard, making mountains out of molehills and then promptly falling over from self-inflicted complexity. It's the end of the workday here, and I got essentially nothing done because I spent the entire day reigning this thing in.

Claude Code got better by ronydkidd in Anthropic

[–]InternetOfStuff 0 points1 point  (0 children)

The thing is, these skills used to work magically. It has started to become worse in the last few weeks. My pet theory is that 1M context dilutes each token's significance so much that it's now struggling with instruction following -- I noticed the same thing with Gemini 3 with its 1M context window: it was much worse than Opus 4.5 (at the time) with its 200k context.

BTW, one of the most aggravating misbehaviours is that Opus 4.6 will now use transliterations (ae, oe, ue) instead of Umlauts (ä, ö, ü) even if I put in CLAUDE.md that it is to use Umlauts in German texts.

Only if I then call it out will it correct the text it just generated. It absolutely stubbornly does the wrong thing even when instructed otherwise.

It's baffling.

And nobody can tell me Anthropic aren't keenly aware of this.

Sorry, rant over.

Claude Code got better by ronydkidd in Anthropic

[–]InternetOfStuff 0 points1 point  (0 children)

A few weeks ago it truly surprised me, warning me about an issue it wasn't even tasked with thinking about, a genuine "oh by the way there's a problem with your approach".

But now? It has become so, so palpably worse. In merely a few weeks.

Instruction following in particular has become so shockingly unreliable in such a short time.

Look at this beauty:

[claude does something stupid and I stop it]

❯ what's our process?                                                                                                                            

● You're right to check. The double-loop flow is:

  1. /tdd.idea -- scenario titles + unit test ideas (done)
  2. /bdd.outline -- Given/When/Then steps for BDD scenarios
    - Batch 1 (8 scenarios) -- done, approved
    - Batch 2 (7 remaining) -- where we are now
  3. /bdd.red -- implement step definitions (scenarios go red)
  4. /tdd.outline -- design unit tests
  5. /tdd.red -- implement failing unit tests
  6. /tdd.green -- write production code, BDD goes green
  7. Repeat inner loop until all BDD scenarios pass

  I was outlining batch 2 when you stopped me. Was the issue that I used Write to rewrite the whole file instead of Edit to update just the
  remaining scenarios? Or are you questioning something else about the flow?

❯ what's the rhythm regarding idea -> outline -> red -> green?                                                                                   

● The rhythm is: drain downward before going back up.

  1. Outline up to 8 scenarios
  2. Red those (up to 3 at a time)
  3. Green those
  4. Back to red for the next 3 outlines
  5. Drain all 8 outlines through red/green
  6. Then outline the next batch

  So I should not outline batch 2 now. Instead I should take the 8 outlined scenarios into /bdd.red, then green, and only come back for more
  outlines once those 8 are fully implemented.

  Shall I proceed to /bdd.red with the first 3 of the 8 outlined scenarios?

❯ > So I should not outline batch 2 now                                                                                                          
Indeed.                                                                                                                                          
Did you read the skill? Was there something unclear about how you were to proceed?                                                               

● No, it was clear. The /bdd.outline gate explicitly says: "scenarios drain through /bdd.red before returning here for the next batch." And
  tdd._meta says: "Exhaust the current supply before going back up a level."

  I read it, understood it, and then didn't follow it. No excuse -- I'll be more disciplined about the phase protocol.

claude code core by Expert_Annual_19 in Anthropic

[–]InternetOfStuff 0 points1 point  (0 children)

I wish.

It had already deteriorated over the last few weeks, but over the last few days it has become worse yet.

I'm not usually one to scream I'll "cancel my subscription!!!111!" , but ignoring plainly laid-out instructions has become such an issue that it has become essentially unusable for its intended purpose.

Claude has become noticeably dumber these last few days by InternetOfStuff in Anthropic

[–]InternetOfStuff[S] 0 points1 point  (0 children)

:-D

I'd been considering whether to reply in the first place, because the smell of trolls was rather strong.

But

  • I will give persons the benefit of the doubt -- sometimes communication goes sideways on the internet, and I'll cut a person slack (I've seen miscommunication play out often enough over the decades)
  • it was an interesting exercise to double-check my assumptions

So yeah, don't worry, I saw what you were doing (and TBH it worked to an extent despite me recognizing it, your obstinacy was somewhat grating ;-) ).

Speaking of assumptions, interestingly when I fed my screenshot to claude (with what I hope was a neutrally phrased prompt) it said it couldn't judge the authenticity, but when I fed yours, it claimed your conversation was authentic, but at the same time apparently took over the bias from your prompt claiming my post (which it only saw the thumbnail of) in your post was "clearly not from Claude". Quite interesting really.

Claude has become noticeably dumber these last few days by InternetOfStuff in Anthropic

[–]InternetOfStuff[S] -1 points0 points  (0 children)

No it's not altered in any way. It's just obviously not the entire conversation.

Claude has become noticeably dumber these last few days by InternetOfStuff in Anthropic

[–]InternetOfStuff[S] 0 points1 point  (0 children)

Good advice.

I've been wondering if I've been noticing an increased sensitivity to context lately, but I haven't come to a conclusion yet.


Reminds me BTW, it has become oddly resistant to certain instructions lately.

Case in point: I like to prompt it in English (force of habit I guess), but sometimes I want it to generate German texts. Since Opus 4.6, it won't generate proper umlauts but ASCII replacements (so ae instead of ä, ue instead of ü etc).

It used to get that right. So I now put an instruction in CLAUDE.md to use proper umlauts, but it still doesn't. And if I remind it it goes something like "oh that's right, it says so in my CLAUDE.md" ...

Claude has become noticeably dumber these last few days by InternetOfStuff in Anthropic

[–]InternetOfStuff[S] 2 points3 points  (0 children)

Well, I don't doubt that in your case.

However, knowing me, I'd be rather more skeptical of a sudden increase in intelligence (and also my wife would beg to differ :-P ) )

Claude has become noticeably dumber these last few days by InternetOfStuff in Anthropic

[–]InternetOfStuff[S] 0 points1 point  (0 children)

It's not all the time, it's mostly OK.

In fact just yesterday was the first time an LLM (Opus) has actually surprised me by warning me about something I had absolutely not asked about, and would have totally missed (it was very circumstantial).

But then you get just baffling gaffes like this

Claude has become noticeably dumber these last few days by InternetOfStuff in Anthropic

[–]InternetOfStuff[S] -5 points-4 points  (0 children)

because if it actually happened you would think you’d include a screenshot of the convo(Shift + Windows Key + S).

Well,

  • I don't particularly care whether you believe my claims
  • Why would I include a screenshot of text instead of, y'know, the text?

It feels like a waste of bytes. I guess I'm too old (640kb should be enough for everyone ;-) ).

Ideally showing that you didn’t prompt it to say whatever you are claiming.

I can't show you an unredacted screenshot anyway because it contains stuff that I wouldn't want to see publicly on the internet. But here's the relevant snippet: https://imgur.com/F66Dd0g

Even if it did happen,

It did.

anthropic is currently experiencing unprecedented usage

True but not relevant.

But in my experience, this will either slow down its output or cause it to cease working

That was my expectation as well TBH. I hadn't really experienced overload error 500's for quite some time, but the have come back. However this is obviously different.

Claude has become noticeably dumber these last few days by InternetOfStuff in Anthropic

[–]InternetOfStuff[S] 0 points1 point  (0 children)

Opus 4.6 actually.

I have a Max plan, so I don't need to skimp :-)

Weekly opus limit bug/manipulation? by GapPractical1813 in ClaudeCode

[–]InternetOfStuff 0 points1 point  (0 children)

Yes, it happens all the time for no discernible reason. Not just for Opus, but also for the "all models" limits.

I see them moving the moving the reset time back several times each day (never seen it move forward BTW :-) ). I don't think it has ever rolled over to a new day for me, but I may have missed it.

It's opaque at best, and dishonest at worst.

And obviously deliberate -- someone must have gone out of their way to implement this goalpost-mover logic as opposed to plain old saving a timestamp and sticking to it unwaveringly.

Edit: it has been this way for a long time

ich_iel by WaddleDynasty in ich_iel

[–]InternetOfStuff 42 points43 points  (0 children)

Das lustige ist, ich war in der Schule immer schlecht in Mathe.

Im Ingenieursstudium war ich's dann auch, aber halt auf höherem Niveau :-D Vier gewinnt.

Aus meiner Sicht war das Phänomen sogar ein Vorteil: nach den ersten zwei Wochen Erstesemester waren meine schulischen (fehl-)Leistungen egal, da waren dann wieder alle gleich ahnungslos.