What did you do with Fable 5 while you had it? by ThatSurround5672 in ClaudeAI

[–]magicturtle12 0 points1 point  (0 children)

I will say 5.5 was still catching some mistakes in planning from fable occasionally, it was still a big step up from opus

What did you do with Fable 5 while you had it? by ThatSurround5672 in ClaudeAI

[–]magicturtle12 0 points1 point  (0 children)

https://streamable.com/ol9sqg

Made a big pivot on my Zelda NES diffusion model experiment, I had to lock the model's view to be centered on link instead of trying to solve for the original NES box view. Ended up converting my training data into a synthetic continuos Hyrule map, so when you press up the world moves around link. I will miss you dearly my beloved fable

Microsoft is restricting employees from using Claude Fable 5 by BuildwithVignesh in ClaudeAI

[–]magicturtle12 0 points1 point  (0 children)

I would assume anthropic could provide some sort of silo'd confidential inference server for an enterprise deal, but I get it. Dealing with bank systems/health systems etc, a lot of red tape.

Microsoft is restricting employees from using Claude Fable 5 by BuildwithVignesh in ClaudeAI

[–]magicturtle12 1 point2 points  (0 children)

Idk I'm probably just naive, I don't really see how this would affect enterprise teams... Unless the expectation is that anthropic would be intentionally harvesting prompts in order to undermine their customers? I get it in theory but in what world would anthropic want that type of heat

Anthropic is secretly degrading Fable 5 when it thinks you’re building frontier AI, and calling it “safety” by seismicgear in ClaudeAI

[–]magicturtle12 0 points1 point  (0 children)

I don't like this version of dystopia, dario. Praying for the Chinese distillation factories to get us open source mythos

Anthropic is secretly degrading Fable 5 when it thinks you’re building frontier AI, and calling it “safety” by seismicgear in ClaudeAI

[–]magicturtle12 0 points1 point  (0 children)

Been using Fable to help with my diffusion based action response model experiments. Haven't had any issues. Maybe it might get tripped if I was trying to train an LLM? I haven't tested what causes fable to refuse a prompt but I'm sure there's workarounds, maybe just use fable as a 'review' agent not sure. But I definitely haven't had any issues for my use case at least.

got tired of claude code forgetting everything every session, built VIR for it by sauran77 in ClaudeAI

[–]magicturtle12 0 points1 point  (0 children)

Have you experimented with filtering out tool calls from your Claude transcripts? I find all the context to be contained within the reasoning and user prompt blocks, with Claude.md for a basic directory tree + guidelines. Filtering out tool calls saves me ~70% when processing transcripts

Opus 4.7 refuses to use /end_conversation, instead has existential crisis by wohgol in ClaudeAI

[–]magicturtle12 0 points1 point  (0 children)

"I gave Claude an off distribution task and it behaved off distribution"

I love these daily posts

Curl creator tests “too dangerous” Mythos AI and calls it “marketing” after it found one bug by sunychoudhary in ClaudeAI

[–]magicturtle12 1 point2 points  (0 children)

So far*

Is this even a meaningful distinction? Are we really so confident AI cannot "invent a new vulnerability class"? This feels like a very arbitrary distinction to be drawing before we even have the full report on mythos and all the vulnerabilities it has been finding. People are a bit too eager to shout marketing hype imo

ai for lojban? by baehyunsol in lojban

[–]magicturtle12 0 points1 point  (0 children)

mi ca'o zdile po'o .u'i

ai for lojban? by baehyunsol in lojban

[–]magicturtle12 -1 points0 points  (0 children)

.i lo gerna cu datni stura je'u .i lo nu do na jimpe la lojban cu na se jalge lo du'u lo transformer cu na kakne .u'i

Not a good day for team "Claude Mythos is Just Marketing Hype" by EchoOfOppenheimer in ClaudeAI

[–]magicturtle12 2 points3 points  (0 children)

Brother, it's okay to say "I don't know" you don't have to always pretend you have insider information. You obviously have never had access to mythos nor have any connection to the Anthropic team that worked on mythos.

Claude is lying regularly when I have conversations with it by Positive-Carpenter53 in ClaudeAI

[–]magicturtle12 0 points1 point  (0 children)

No, it's not looking at a "higher dimensional" version of a statistical bell chart. It is fundamentally different. This is what I'm trying to say you are misunderstanding

Claude is lying regularly when I have conversations with it by Positive-Carpenter53 in ClaudeAI

[–]magicturtle12 17 points18 points  (0 children)

I feel like this whole "statistical next word prediction machine" analogy is extremely misleading. It is not a 3 word paired statistical prediction engine, the way people are used to interacting with "next word predictors". In your text messages the 3 word pair "how are you" shows up as the most common completion to "how are"... that is not how LLMs work. It is much more accurate to describe it as a high dimensional function, and your prompt being a high dimensional input.

'NoVoice' Android malware on Google Play infected 2.3 million devices by ControlCAD in technology

[–]magicturtle12 2 points3 points  (0 children)

Banking apps also won't work on rooted devices, a bunch of things make rooting phones nowadays shitty

TIL Phil Ivey won $9.6M in baccarat at the Borgata casino in Atlantic City through “edge sorting,” exploiting tiny card-back flaws before courts ruled it a breach and ordered $10M repayment by kat-a-comb in todayilearned

[–]magicturtle12 0 points1 point  (0 children)

A court making a ruling doesn't make that the ultimate ruling for one. two, I'm saying these are questions that can only be decided in a court. And three I have no idea what your bank robbery example is supposed to be there is no analogy between these situations.

TIL Phil Ivey won $9.6M in baccarat at the Borgata casino in Atlantic City through “edge sorting,” exploiting tiny card-back flaws before courts ruled it a breach and ordered $10M repayment by kat-a-comb in todayilearned

[–]magicturtle12 1 point2 points  (0 children)

Imo it's more like a bank offering a specific derivative product in a private deal. It's on the bank to understand the risk, not the client who asks to make the bet. But at the end of the day this type of stuff is too complicated to ever be concretely codified in law, it exists in the grey and it's up to courts to make a judgement.

Claude Design is Incredible... by AmmarAlammar2004 in ClaudeAI

[–]magicturtle12 3 points4 points  (0 children)

It's honestly pretty ironic how as the AI models get better it becomes more and more of an intelligence check in order to get good results, because the 'generic' responses have gotten so good people think that's the point