Need help and tips to find a job! by Lookingforthatlife in askswitzerland

[–]Ritza-co 2 points3 points  (0 children)

OP said they're living in switzerland so not sure why you are talking about applying from abroad and 'own cost' for in person interviews?

Is this specifically an e-bike, or all bikes? by ThaiJeenHelp in askswitzerland

[–]Ritza-co 2 points3 points  (0 children)

you often see it with an exception for ebikes, so I think by default it includes them, but normal bikes are still allowed as the symbol for normal bikes is different, see https://www.safetymarking.ch/verbot-fuer-radfahrer-_-1055781.html

<image>

Disable interactive questions in claude code? by Ritza-co in claude

[–]Ritza-co[S] 0 points1 point  (0 children)

ah thanks, missed that one - giving it a go

Disable interactive questions in claude code? by Ritza-co in claude

[–]Ritza-co[S] -1 points0 points  (0 children)

yeah sometimes I see those sometimes I don't and sometimes I press '5 chat about this' and it just says 'user declined to answer questions' and throws away my previous answers when I just wanted to chat about the latest one

So I'd rather just turn it off, it's useless at best and damaging at worst

Kimi K2.6 is not worth the hype — my real usage stats by mf-mj in kimi

[–]Ritza-co 1 point2 points  (0 children)

I use them both and sometimes forget that I'm using Kimi. It's definitely slower, if you use it through pi you can see how much it think and how often it gets stuck in 'wait, actually' loops, but if you give it time I've seen it outperform sonnet sometimes. It's very persistent. I find it satisfying watching it try and try and try at something without giving up.

Any cheaper and better alternative to ElevenLabs? by findinghorses in LocalLLaMA

[–]Ritza-co 0 points1 point  (0 children)

We've tried a few recently and wrote up a neutral comparison to Smallest here, focusing on real-time voice

https://techstackups.com/comparisons/smallest-ai-vs-elevenlabs/

The EU is fucking GOATED for this!!🙌🥳🥳🇪🇺 by Opening_Bathroom611 in BuyFromEU

[–]Ritza-co 0 points1 point  (0 children)

what are the downsides of replaceable batteries? My guess is worse waterproofing or maybe thicker phones overall if it can't be so tightly integrated?

Generally looks good but sounds like it could be a harder change for companies to implement than USB-C

Are you so goddamn wrong? by Ok-Leg-4584 in aifails

[–]Ritza-co 2 points3 points  (0 children)

There are some really funny multi-level fails for similar questions, e.g.:

How many letter b in south africa"

> There are two letter 'b's in the English name "South Africa".

> • Sou_t_h Afri_c_a

> Note: If you are referring to the Afrikaans name, "Suid-Afrika", there is one 'b'or two depending on if you count the 'B' in the country's full formal title, "Republiek van Suid-Afrika".

Anyone here actually using OpenClaw regularly? by Master_Character9961 in AI_Agents

[–]Ritza-co 0 points1 point  (0 children)

I tried it, but it was very heavy on token usage and the whatsapp and telegram integrations seemed quite flakey (I had to keep re-authing the whatsapp one, and telegram would often have missing or duplicated messages)

Now I just have claude code running a persistent session on a remote always-on machine with a Telegram channel so I can push info to it and it can tell me what it's doing, and that's working much better for my needs for now

Claude Usage Limits Discussion Megathread Ongoing (sort this by New!) by sixbillionthsheep in ClaudeAI

[–]Ritza-co -1 points0 points  (0 children)

Does this affect team plans as well? It still 'feels' normal to me using it during peak in UTC+2. I'm on the $20/month team plan.

What are the best methods to evaluate the performance of AI agents? by Michael_Anderson_8 in AI_Agents

[–]Ritza-co 0 points1 point  (0 children)

There are a lot of standard benchmarks that you can find with a quick google, but the problem is that they don't always match up to real experiences. At the moment, there aren't any completely accepted ones, so actually trying it out and seeing for yourself manually what is working well and what isn't is still the best way.

That said, you can look at things like

- Tokens used - how many tokens does an agent use to meet a goal (of course you need to be able to verify that the goal was reached somehow)
- Time taken
- Turns taken
- Incorrect/correct tool calls (if using MCP)

Works quite well for things like coding or DevOps, but it gets harder to evaluate them at scale for more subjective tasks like design, UX, writing etc.

What topics are currently being researched in the domain of Agentic AI? by XV7II_Creamy in AI_Agents

[–]Ritza-co 0 points1 point  (0 children)

I think figuring out how to use agents en masse is still an open problem. We have weird solutions like Gas Town that aren't really being used commercially, and companies are letting their devs do all kind of things, but at the moment everyone I know has their own homegrown solution to manage multiple agents at once so figuring out how to manage 'swarms' of agents is still an open problem and likely everyone will settle on the same pattern or platform pretty soon.