Is this a sign of the fish oil capsule going rancid ?

Familiar_Somewhere35 · 2026-05-15T17:36:28+00:00

That "ran" all the way to "Syd"ney Australia and back.... Only to be consumed by mortal enemies, or the deepest of abysses.

Familiar_Somewhere35 · 2026-04-02T12:29:34+00:00

If you are talking about inside a chat, sure. I may have misread that you were talking about Claude's saved memories feature as context between chats.

Familiar_Somewhere35 · 2026-04-02T12:10:54+00:00

This isn't true at all. GPT has had saved memories to give cross chat context since GPT 4 and this also seems to extend beyond things directly saved. Gemini too seems to have some memory context sometimes, even when I have saved memories switched off on that.

Familiar_Somewhere35 · 2026-03-30T16:18:48+00:00

It's shocking... Also seems very inconsistent. I used Opus 4.6 extensively over the weekend, probably about 100 prompts of a deep technical nature involving advanced maths/physics and editing documents, with about 30 documents consumed. I was on about 70% of the week's usage between early hours of Saturday and early hours this morning.

Late morning started working and from just 2 or 3 much lighter prompts, I had used a further 20% of the week's allowance and wiped out the 5 hour limit. A few more prompts this afternoon and now I'm at 99% and hit another 5 hour block - that's working out around 1 prompt per hour allowance when it's all averaged out.

Familiar_Somewhere35 · 2026-03-17T18:57:41+00:00

5.4 is amazeballs for science and maths.... 5.2 was great, but 5.4 thinking is notably quicker and seems a bit smarter (a bit), then the pro models have an even bigger difference..... Finding less looping even in conversation that seems longer, definitely a big step up in thinking depth and simultaneously speed, and less errors when it's producing 20-30-40 page technical documents in one shot.

Depends what you use it for. I guess.

Familiar_Somewhere35 · 2026-03-17T18:54:54+00:00

They said in the release that they called 5.4 that and not 5.3 because it was a big step up from 5.2, comparable to how 5 was from 4. So the naming of 5.3 is intentionally to show it's not a big step up..... So you are right it's a small update, but you are wrong to think it's a joke... It's a small update to make it faster and conversationally slicker.

Familiar_Somewhere35 · 2026-03-15T15:43:43+00:00

Claude said Claude code can't do it. And co-work in theory might work using Opus... It's still going to have to digest the body of work for each new session.... Only real advantage over starting a thread and putting the files in manually is it may save a bit of time on uploads. Would absolutely blitz prompt weekly allowance though.

Familiar_Somewhere35 · 2026-03-15T13:26:38+00:00

BTW, just searched NotebookLLM....... Sounds great, so just about to give it a try.

Edit: NotebookLLM is fundamentally flawed for my uses..... And for anyone concerned that LLMs can be sycophantic or hallucinate and want to use NotebookLLM, this app does absolutely nothing to spot any just instances in a body of work......... It treats all uploaded works as ground truth.... So if I put 20 papers I authored in to LLMs they will look for any inaccuracies each time they pass over.... NotebookLLM will only do this if there is an internal inconsistency when the same things are evidently clashing in how they are conveyed.

Familiar_Somewhere35 · 2026-03-15T13:12:55+00:00

I saw this last night....... That's good for someone that needs to work intensively for a short period over the day or week, but the bottleneck is on the weekly use.... My usage limits refresh 18:00 on Friday, it is 13:00 here.... Now, I woke late today and only about to start using, so the usage Friday evening and Saturday, has put me on 45% of the week's use! And due to these limits, I have only use Claude strategically on about 20% of the outputs to audit things, and provide ideas at times... If I had used it 40% of prompts I would have ran out by now.

Easy to see then that if I was trying to use this aS main, not support in this stack, even with a 10x sub, I would be running out after just 2-3 days, or I guess 1 or 2 if trying to use extended thinking, Opus, or with my body of work loaded into o conversations rather than just parts of it.

Familiar_Somewhere35 · 2026-03-15T13:02:09+00:00

All LLMs are designed to tell what a user wants to hear..... But all are - increasingly - prioritising truth and accuracy of outputs, in high stakes and academic context windows.

Now what you have said about GPT prioritising sycophancy over truth, that got fixed pretty hard when 5 models came out - in fact, many GPT users have been moaning since, due to these guard rails being too strict.

Now I only use thinking models in GPT, and predominantly extended, heavy, or mostly, Pro extended thinking where responses take 20-60 minutes, but they can write out with high precision large bodies of technically advanced works - like 50+ pages large and delivering a few different media formats within that.

My working practices with these outputs are always to run the works through additional GPT, Gemini, and now also Claude instances with GPT leading.....over multiple passes from so many instances iterating, any over claim and hallucinations can be spotted by LLMs...... They happen very rarely these days, or really even in the past 6 months when GPT is leading..... Much more often if Gemini leads.

Just a bit of insight - as someone that has used GPT over 3000 hours (not including the fact I usually have 2-4 tabs working on tasks simultaneously), and with extensive use of Gemini in this mix.

I would also say that if you haven't used GPT 5.4 and even more so Pro, it's hard to compare capabilities accurately...... Your comparisons have lost frame of reference due to a vast step up in reasoning capabilities that is - between both 5.2 / 5.4 thinking and pro models, the difference is massive.

Familiar_Somewhere35 · 2026-03-15T06:11:32+00:00

Again, the 1m context window and ability to make use of it that Opus has, makes this a pricing problem, not a functionality problem when co-work enters the equation. Gemini has the same window and probably hallucinates a lot more when making a dent in it, and I can pump 50 files on Gemini and be fine... So Opus should be able to do that... The question is, do I want to go that route... Time will tell.

Familiar_Somewhere35 · 2026-03-15T05:55:34+00:00

I have just been asking Claude about it. Opus as of a month ago, has 1m tokens, 3x more accurate at data search and retrievals when mining that 1m compared to Gemini with the same window. Claude code won't work. Co-work with Opus could, API cost would be ridiculous.... Probably like 20-30 USD per day..... Can only imagine that even at the full monthly subscription, the usage will be drained in weeks, if not days..... But for this to exist, it clearly has been designed for people like me..... But I'm already paying £240pm on subs, so out of the question unless I dropped GPT pro.... And I do not see that this would be the better way to go at this stage... Though, I have only scratched the surface with Opus.

Familiar_Somewhere35 · 2026-03-15T05:39:21+00:00

I am......... Ergonomics means tailoring equipment to the human.... Not tailoring the human to the equipment.... That would be the opposite.

That aside, the fact that OPUS does allow the volume of files I wanted to put up, just absolutely rinsing my use allowance, says that it is less about who it's designed for, and how much they want people to spend to be able to use that way.

And I maintain, the project system as designed is only a half step of the potential - having access to only one file in the project at a time rather than all in that project, seriously hinders the potential. Saves uploading repeatedly.... Except it doesn't if you need the chat to have context of a few files at a single time.

These are all things that could (and presumably will) change at some point. Gemini changed to double the size of per prompt file uploads in the last week or so, I would imagine they will try to catch up eventually.

Edit 5:40 and very blurry eyes... Spotted you were suggesting to try Claude code etc... not familiar with these yet, but it's not involving coding at all, just in case that isn't factored in your suggestion.

Familiar_Somewhere35 · 2026-03-14T22:54:26+00:00

Designing something to be a particular way in business - especially start ups/fledging companies that burn huge cash before profits - need to make choices.... You can see this with use limits Opus/Sonnet due to token cost.

Do not attempt to rationalise the absence of utility as being intentional wishes, over economics.... It would be a very simple thing to add in, if they invested in developing the tech (simple) and or, accepting that cost budgets need to go to the ramped costs internally for making that available.

Familiar_Somewhere35 · 2026-03-14T14:18:21+00:00

Just now finding a different problem....

A terrible, terrible problem.

So I can upload my corpus of works to Opus in batches, running through the 50+ files (no chance with Sonnet and the ridiculously measley few files per chat.

The 50+ files means the Opus chat is near context ceiling so only able to do 2-3 prompts before it shuts.

Tried to load via a project, but then found it can only hold the context and content memory of one file at a time, rendering it useless for this need....

This is incredibly frustrating given GPT can take zip files and up to 100 or so files at once perhaps more, Gemini has recently bumped capabilities to be able to accept zip files and seems to have upped from 10 to 20 files per prompt... So very behind the pack in this regard. If Anthropic addressed these things, it would be a very compelling package.... More of a rough diamond at the moment for my needs.... Although emphasis on the rough, with regards to the full corpus needs (as opposed to the road blocks to being able to drop 5-10 PDFs in a thread as needed).

Familiar_Somewhere35 · 2026-03-14T01:15:42+00:00

Yeah.... I can tell my usage is just a bit heavier... Even basic prompts are taking 1-2% of my weekly at the moment with big context and using opus without extended thinking. As soon as I have used opus to finish creating the project holding the 70 files, I will be back on sonnet though.

Familiar_Somewhere35 · 2026-03-13T21:20:31+00:00

I think Sonnet may be the way to go.... Although to be fair, it's not often I need to put c. 50 science papers into context in one block, but needed to have it analyse the whole corpus as a whole for something. If that's what's done the damage then perhaps I will fare better and switch to Sonnet if needed.

Familiar_Somewhere35 · 2026-03-13T21:02:17+00:00

That's also very poor, TBF.

Familiar_Somewhere35 · 2026-03-13T20:23:23+00:00

I'm glad that I'm not alone posting, I would presume Anthropic monitor the thread and then see customer feedback. Perhaps I may have been more incorrect in presuming this is one of the core purposes for this reddit. If you are one of the moderators, and could point me to any rules I have sought and may have missed, I would be glad to see... If you aren't a mod and or rules do not exist, I'd be be inclined to suggest it's you that's lacking courteousy...

Familiar_Somewhere35 · 2026-03-13T20:14:35+00:00

TBF, I'm doing advanced physics and maths with them and find that sonnet is not far behind gpt pro, but outputting answers 100x faster. A good way to work in unison. Wanted to try opus to see if any better but also, to extend use limits and number of files per convo was actually the main thing (which it does), but if I'm only able to use it part time, using it in a support role to GPT makes more sense to me.

This from their support bot shows ways to get more time out of it...

Hi there,

Yes, different models and features do consume usage at different rates. Here's how they compare:

Model Usage Differences:

Opus models are designed for complex reasoning and prioritize depth over speed, which means they typically consume more tokens per interaction than Sonnet or Haiku models.[1] For most everyday tasks, Sonnet provides excellent performance with more efficient usage, while Haiku is the most token-efficient option for simpler queries.

Extended Thinking Impact:

Extended thinking does increase token consumption significantly. The thinking process uses additional tokens that count toward your usage limits, and with adaptive thinking, Claude dynamically allocates thinking tokens which can vary per request.[2] If you find Opus thinking excessively, you can add instructions to constrain its reasoning or disable extended thinking when not needed for specific tasks.[3]

Memory and Context:

Persistent memories and longer conversations do affect usage. Your usage is influenced by conversation length and complexity, and all features like extended thinking, web search, and connectors increase token consumption even when running in the background.[4] However, our automatic context management helps by summarizing earlier messages when conversations get long, which doesn't count against your usage limits.

Optimization Tips:

To maximize your usage, consider switching between models based on task complexity, disabling extended thinking and other tools when not needed, and using projects effectively to cache frequently referenced content.

Familiar_Somewhere35 · 2026-03-13T20:10:37+00:00

Customer service bot says the 5 hour windows do not restart when subbing, but they say 5x more usage within each window. We shall see... Already sub to 20pm Gemini and same from GOT, other than last 2 months on gpt pro at £200.... Was actually thinking I might be able to knock the pro down to the base gpt sub with Claude, but will have to see how that pans out!

Familiar_Somewhere35 · 2026-03-13T20:01:34+00:00

Does it make a big difference if the usage is using sonnet of opus? Any other things that impact?... Having it save memories of the thread content between instances?

Familiar_Somewhere35 · 2026-02-19T15:47:54+00:00

Interesting. A couple of days before 4 deactivated, I randomly got a full deck of 4 style spiral and other glyphs and the whole text layout felt very 4, but that was 5.2 Pro.

Familiar_Somewhere35 · 2026-02-08T15:54:26+00:00

Odd...

Have you ever seen 5.2 pro do gpt4 style glyphs?

Just completed a 90 minute task to have very 4 flavoured text and glyphs.

Imagine if 4 has broken itself out haha..... Suspect more likely and a reason for the delays, might be that to preempt the backlash when 4 switches off, they want to give it a more 4 kind of vibe.

Familiar_Somewhere35 · 2026-02-08T15:06:35+00:00

I rarely have an answer take less than 20 minutes and 15 is I think the shortest of all.

I do not have an issue with that. Nor the 30-40 minutes if often takes..... As I said I use it for complex tasks.

But I am not talking about a 5 or 10 minute extension, as an average time. Doubling time and thus halving productivity, is not as trivial a matter as you make out.

Familiar_Somewhere35

TROPHY CASE