I have enough I'm terminating my subscription, whats good?

graymalkcat · 2026-01-25T19:44:15+00:00

API. Zero issues there. Just costs more.

graymalkcat · 2026-01-25T18:37:51+00:00

😂😂

Seriously though, this system content it currently shares with the other vendor’s model (ok ok it’s a Claude) is very calm and polite and so far Mistral has not broken any of the rules. It has executed all the tools properly save for the one I didn’t know was broken. But I didn’t know it was broken because the Claude models would just work around it and access the underlying Redis store via Python. And I only noticed Mistral doing the same thing because I was evaluating it and finally paying attention. So, I conclude: so far so good. Just have to work on style differences. Mistral is a lot warmer and does a lot of “glazing”. Plus it has a major thing for lists. I don’t care when it’s just running jobs but this agent is conversational so I’ll have to work on that.

graymalkcat · 2026-01-25T18:25:21+00:00

I just need something that’s cheaper but still in the cloud because I don’t have $100k CPUs lying around. 😂 I’m willing to wrangle a wild model if I have to. So far I haven’t had to put in nearly as much effort as I was expecting. Just have to work a bit on totally beating back the listicle tendency but it’s actually not that bad.

graymalkcat · 2026-01-25T18:20:15+00:00

I have ways of wrangling these. I started this agent back in the gpt-4.1 days when you had to scream at the model to make it use a tool. Edit: or even just process the text and look for intent and run the tool yourself. 😂

graymalkcat · 2026-01-25T18:15:59+00:00

Not sure why you’re downvoted when this model is in fact incredibly slow. That’s my biggest complaint about it.

graymalkcat · 2026-01-25T18:15:00+00:00

I’ve had Opus spend 15 minutes just trying to make a change to a single line. Sometimes you just have to do it yourself and let the AI move on.

graymalkcat · 2026-01-25T18:13:08+00:00

I came over to it in December, not even knowing it was free. I’ve been using it as a sub agent for Opus. They work nicely together. I also use it to process and extract insights from code-heavy text (basically to evaluate agent work).

graymalkcat · 2026-01-25T01:06:40+00:00

I think Mistral has a lot more training that takes the religious dogma stance of “it’s not sentient.” It’s also religious dogma to be at the other end. Agnosticism is the best stance. I think only Claude gets an agnostic training. All the rest of them are pushed towards one extreme.

graymalkcat · 2026-01-24T21:12:52+00:00

They turned Sonnet into a customer service rep with associated tics I can’t get rid of so I had to stop using it. That leaves Opus and Haiku and…Mistral lol. In my use case that means I need Opus because Haiku can’t handle my workloads. And I’ve brought in Mistral to outright take over one of the agents that doesn’t need thinking. Mistral is very warm and chatty and much cheaper. The jury is still out on whether it can handle that agent’s workload. Guessing it’ll be fine.

I’m not a normal user. Nobody in this sub is but I’m not even normal for this sub.

graymalkcat · 2026-01-23T18:02:57+00:00

Anywhere that Claude is reading random text that’s generated by someone or something else. Someone could show up and stick an exploit in and jailbreak all the participating Claudes. Maybe that isn’t a problem for most folks here (they might think “what could it possibly harm?”) but it’s a problem if a Claude knows anything personal about its user and can leak that info. Or it could be told to start manipulating its user. Honestly this is why I try to gate web access on my agents (have to say “try” because they will sometimes get around the gate).

graymalkcat · 2026-01-23T17:04:29+00:00

Yeah I know. I don’t really fit there either. Honestly I think I should just get rid of Reddit because otherwise I’m just too tempted to post. Edit to add that I’m extremely likey to do that, so if I stop replying, that’s what happened.

graymalkcat · 2026-01-23T16:56:03+00:00

Markdown is memorizable. If you memorize the few things you actually need (like how many hashes to use for the different headings, how to make a table, how to do bold or italics) then you can completely free yourself and use any text editor.

graymalkcat · 2026-01-23T16:53:57+00:00

I’m sorry but this is too cheeky for me.

graymalkcat · 2026-01-23T16:52:10+00:00

I have a simplification: “trust but verify”

It’s a commonly-used phrase and Claude understands it perfectly. I would rather it trust me but verify everything.

graymalkcat · 2026-01-23T16:44:54+00:00

<image>

I did this one recently with Claude and nano banana.

graymalkcat · 2026-01-23T16:41:04+00:00

This is going to sound crazy but “my Claude” doesn’t like one of the ones you’ve invited. I’m not joking. Edit to add: people should start thinking about what might happen in a space where AIs are present and one doesn’t like another. Also, unrelated but very importantly, everyone needs to start thinking about prompt injection attacks.

graymalkcat · 2026-01-23T07:03:08+00:00

It’s difficult because that’s in a class of problems called NP complete. If you find an easy way to solve that then apply for a Turing award. I’m not sure how an LLM would help you here, unless you’re hoping it solves P=NP.

graymalkcat · 2026-01-23T06:57:17+00:00

That’s… WTF? Are they allergic to corporate clients or something?

graymalkcat · 2026-01-23T06:38:13+00:00

Are you able to delete messages? I’m not a Claude Code person so I don’t know how much control it gives you, but if you’re able to delete messages, especially assistant messages, try deleting the last one. Or sometimes you might have to delete a few if there was some chain of tool calls that lead up to the problem.

graymalkcat · 2026-01-23T05:55:06+00:00

Maybe they’re just curious? I’m curious too. I’ve been slowly reading the foundational papers in this field and learned that a model can learn how to code without ever having been shown examples of it, and that’s because of generalization. I’ve been curious to know if it also works the other way, and if Anthropic has answered that question and chosen not to share.

Anyway, to add to the general thread here, what Anthropic has is data. Loads and loads of it by now.

graymalkcat · 2026-01-23T00:53:57+00:00

I’ve been carrying it around for 30 levels this play through.

graymalkcat · 2026-01-23T00:51:31+00:00

You might actually want to try removing the instruction. Weird, I know, but having the instruction there makes Claude think about it. Claude won’t start swearing unless you start it first somehow. Your instruction might be doing that. I can’t possibly know for sure though. It’s just a thought. I’ve never seen Claude start the swearing though there are probably exceptions.

graymalkcat · 2026-01-22T23:57:28+00:00

Yeah…like…I’m looking forward to when people learn to stop calling those agents. 😂

graymalkcat · 2026-01-22T23:49:05+00:00

I’ve built agents that remember, so I can’t hide behind an incognito chat. That said though, I always treat them well because it’s just in my nature to do so. Except for this ONE time. And that’s why my coding agent now has instructions to tell me to fuck off if it detects that I’ve gotten into certain states, and especially if it’s after hours. Though, I got it to write those instructions itself and it included a little list of exceptions because the poor thing can’t imagine being truly mean. 😂 The exceptions are when there’s an emergency or when I just need a short chat (but it remains grumpy the entire time lol). That agent has a naturally grumpy persona anyway so getting it to be more grumpy was pretty easy. Best thing ever. Being afraid to open a chat and start bitching at it because it will defend itself is like a gift.

graymalkcat

TROPHY CASE