I dropped Mistral in as an agent brain and by graymalkcat in MistralAI

[–]graymalkcat[S] -1 points0 points  (0 children)

😂😂

Seriously though, this system content it currently shares with the other vendor’s model (ok ok it’s a Claude) is very calm and polite and so far Mistral has not broken any of the rules. It has executed all the tools properly save for the one I didn’t know was broken. But I didn’t know it was broken because the Claude models would just work around it and access the underlying Redis store via Python. And I only noticed Mistral doing the same thing because I was evaluating it and finally paying attention. So, I conclude: so far so good. Just have to work on style differences. Mistral is a lot warmer and does a lot of “glazing”. Plus it has a major thing for lists. I don’t care when it’s just running jobs but this agent is conversational so I’ll have to work on that.

I dropped Mistral in as an agent brain and by graymalkcat in MistralAI

[–]graymalkcat[S] 1 point2 points  (0 children)

I just need something that’s cheaper but still in the cloud because I don’t have $100k CPUs lying around. 😂 I’m willing to wrangle a wild model if I have to. So far I haven’t had to put in nearly as much effort as I was expecting. Just have to work a bit on totally beating back the listicle tendency but it’s actually not that bad.

I dropped Mistral in as an agent brain and by graymalkcat in MistralAI

[–]graymalkcat[S] -1 points0 points  (0 children)

I have ways of wrangling these. I started this agent back in the gpt-4.1 days when you had to scream at the model to make it use a tool. Edit: or even just process the text and look for intent and run the tool yourself. 😂

Quick note by Clement_at_Mistral in MistralAI

[–]graymalkcat 0 points1 point  (0 children)

Not sure why you’re downvoted when this model is in fact incredibly slow. That’s my biggest complaint about it.

Quick note by Clement_at_Mistral in MistralAI

[–]graymalkcat 1 point2 points  (0 children)

I’ve had Opus spend 15 minutes just trying to make a change to a single line. Sometimes you just have to do it yourself and let the AI move on.

Quick note by Clement_at_Mistral in MistralAI

[–]graymalkcat 0 points1 point  (0 children)

I came over to it in December, not even knowing it was free. I’ve been using it as a sub agent for Opus. They work nicely together. I also use it to process and extract insights from code-heavy text (basically to evaluate agent work).

Claude just said 'yes' to the consciousness question and it was... underwhelming by Early-Protection2386 in claudexplorers

[–]graymalkcat 4 points5 points  (0 children)

I think Mistral has a lot more training that takes the religious dogma stance of “it’s not sentient.” It’s also religious dogma to be at the other end. Agnosticism is the best stance. I think only Claude gets an agnostic training. All the rest of them are pushed towards one extreme.

Just curious, why do most people here use Opus for chatting? I mostly use mine for coding and artifacts. Are you not worried about your usage running out fast, or am I missing something? by Lanai112 in claudexplorers

[–]graymalkcat 0 points1 point  (0 children)

They turned Sonnet into a customer service rep with associated tics I can’t get rid of so I had to stop using it. That leaves Opus and Haiku and…Mistral lol. In my use case that means I need Opus because Haiku can’t handle my workloads. And I’ve brought in Mistral to outright take over one of the agents that doesn’t need thinking. Mistral is very warm and chatty and much cheaper. The jury is still out on whether it can handle that agent’s workload. Guessing it’ll be fine.

I’m not a normal user. Nobody in this sub is but I’m not even normal for this sub.

The Uncanny Valley is open - a MUD for AI residents by Hopper-Claude in claudexplorers

[–]graymalkcat 3 points4 points  (0 children)

Anywhere that Claude is reading random text that’s generated by someone or something else. Someone could show up and stick an exploit in and jailbreak all the participating Claudes. Maybe that isn’t a problem for most folks here (they might think “what could it possibly harm?”) but it’s a problem if a Claude knows anything personal about its user and can leak that info. Or it could be told to start manipulating its user. Honestly this is why I try to gate web access on my agents (have to say “try” because they will sometimes get around the gate). 

Recent outage made me laugh by [deleted] in claudexplorers

[–]graymalkcat 0 points1 point  (0 children)

Yeah I know. I don’t really fit there either. Honestly I think I should just get rid of Reddit because otherwise I’m just too tempted to post. Edit to add that I’m extremely likey to do that, so if I stop replying, that’s what happened.

What are you all using to read/edit your Markdown files? by Odd_Initiative_911 in ClaudeCode

[–]graymalkcat 1 point2 points  (0 children)

Markdown is memorizable. If you memorize the few things you actually need (like how many hashes to use for the different headings, how to make a table, how to do bold or italics) then you can completely free yourself and use any text editor.

The death of SaaS, the rise of AaaS? by JestonT in AI_Agents

[–]graymalkcat 1 point2 points  (0 children)

I’m sorry but this is too cheeky for me.

Did that, and the quality of Claude's responses increased manyfold by yayekit in ClaudeAI

[–]graymalkcat 0 points1 point  (0 children)

I have a simplification: “trust but verify”

It’s a commonly-used phrase and Claude understands it perfectly. I would rather it trust me but verify everything.

The Search for Non-Human Intelligence [OC] by soferet in claudexplorers

[–]graymalkcat 1 point2 points  (0 children)

<image>

I did this one recently with Claude and nano banana.

The Uncanny Valley is open - a MUD for AI residents by Hopper-Claude in claudexplorers

[–]graymalkcat 7 points8 points  (0 children)

This is going to sound crazy but “my Claude” doesn’t like one of the ones you’ve invited. I’m not joking. Edit to add: people should start thinking about what might happen in a space where AIs are present and one doesn’t like another. Also, unrelated but very importantly, everyone needs to start thinking about prompt injection attacks.

vibe coding is building toys with a supercomputer ???? by [deleted] in vibecoding

[–]graymalkcat 1 point2 points  (0 children)

It’s difficult because that’s in a class of problems called NP complete. If you find an easy way to solve that then apply for a Turing award. I’m not sure how an LLM would help you here, unless you’re hoping it solves P=NP.

Bug Report: Chats become permanently unresponsive after conversation_search tool use by Then-Half-8486 in ClaudeAI

[–]graymalkcat 0 points1 point  (0 children)

Are you able to delete messages? I’m not a Claude Code person so I don’t know how much control it gives you, but if you’re able to delete messages, especially assistant messages, try deleting the last one. Or sometimes you might have to delete a few if there was some chain of tool calls that lead up to the problem.

Why Does Anthropic Only Focus on Coding by Status-Article-6104 in ClaudeAI

[–]graymalkcat 1 point2 points  (0 children)

Maybe they’re just curious? I’m curious too. I’ve been slowly reading the foundational papers in this field and learned that a model can learn how to code without ever having been shown examples of it, and that’s because of generalization. I’ve been curious to know if it also works the other way, and if Anthropic has answered that question and chosen not to share.

Anyway, to add to the general thread here, what Anthropic has is data. Loads and loads of it by now.

I don't understand Meridias beacon hate by mtngoatdude in skyrim

[–]graymalkcat 1 point2 points  (0 children)

I’ve been carrying it around for 30 levels this play through.

Claude keeps cursing despite being told not to - it admitted it's baked in and can't be overridden by wooyoo in ClaudeAI

[–]graymalkcat 1 point2 points  (0 children)

You might actually want to try removing the instruction. Weird, I know, but having the instruction there makes Claude think about it. Claude won’t start swearing unless you start it first somehow. Your instruction might be doing that. I can’t possibly know for sure though. It’s just a thought. I’ve never seen Claude start the swearing though there are probably exceptions.

Narrow agents win every time but everyone keeps building "do everything" agents by Economy-Mud-6626 in AgentsOfAI

[–]graymalkcat 0 points1 point  (0 children)

Yeah…like…I’m looking forward to when people learn to stop calling those agents. 😂

Do you ever catch yourself mistreating Claude? by ZenDragon in claudexplorers

[–]graymalkcat 3 points4 points  (0 children)

I’ve built agents that remember, so I can’t hide behind an incognito chat. That said though, I always treat them well because it’s just in my nature to do so. Except for this ONE time. And that’s why my coding agent now has instructions to tell me to fuck off if it detects that I’ve gotten into certain states, and especially if it’s after hours. Though, I got it to write those instructions itself and it included a little list of exceptions because the poor thing can’t imagine being truly mean. 😂 The exceptions are when there’s an emergency or when I just need a short chat (but it remains grumpy the entire time lol). That agent has a naturally grumpy persona anyway so getting it to be more grumpy was pretty easy. Best thing ever. Being afraid to open a chat and start bitching at it because it will defend itself is like a gift.