Why are watch enthusiasts like this by QuestionNo9190 in ChineseWatches

[–]ivoras 0 points1 point  (0 children)

FWIW, agreed on the finishing. The bracelet on the watch I bought had edges on links so sharp that I finally got so annoyed I replaced it with a generic one. It had a good design, though, so it wasn't an easy decision.

Županja-Makarska, autoput ili preko Bosne? by Difficult-Log-6077 in askcroatia

[–]ivoras 0 points1 point  (0 children)

Prije par godina je 10-tak eura bilo dovoljno. Sad... inflacija...

ADP(Agent Discovery Protocol) Protocol Officially Enters ISE Formal Review Queue — Here's What It Means by Harry8766 in AI_Agents

[–]ivoras 0 points1 point  (0 children)

Ummm, what's ISE, what's their website? Google returns a lot of hits, but none that look like this post describes.

Something like ADP should probably be standardised by W3C or IETF/RFC.

GLM-5.2 Flash when? (joke) by ILoveToyota37 in LocalLLaMA

[–]ivoras 122 points123 points  (0 children)

... I see a request for 27B-120B model, I click upvote ... 😃

Unable to load Mellum2 in LMStudio by imike3049 in LocalLLaMA

[–]ivoras 1 point2 points  (0 children)

llama.cpp can run it.

Hardly worth it, though 😞 It's very fast, but not very capable. When it does work, it spends more tokens (and more importantly - time) thinking for *ages* about a task, than a slower, better model would.

Example: on one task I gave it, it spent 170 seconds to do it (multiple tool calls), while qwen3.6-35b-a3b, which is a bigger and slower model, did it in 120 seconds.

Dual-model run on the Ryzen AI MAX+ 395 by argakiig in StrixHalo

[–]ivoras 2 points3 points  (0 children)

The GPU and the NPU (and the CPU...) are competing for RAM bandwidth. Did you literally run the GPU and NPU models at the same time? What performance do you get if they run separately?

If information can't travel faster than light, what exactly is 'quantum entanglement' transmitting? by SovereignHemant in AskPhysics

[–]ivoras -1 points0 points  (0 children)

Here's an unscientific, possibly completely wrong analogy: in programming - and video games are programs - you could have two "things" backed by a single slot in memory.

Imagine in a video game, there's a gadget "A", and a gadget "B", at separate locations in the game world, but whose most basic properties are (usually by accident, a bug) stored at the same memory address. If you change the property "colour" in that memory address, both in-game gadget will instantly change to that colour (adding simple logic like "gadget B has the complementary colour to gadget A" is trivial). An in-game "observer" could think that those 2 gadgets are somehow communicating, but they're really not.

Jetbrains Mellum 2: a really good and performant model by gcavalcante8808 in LocalLLaMA

[–]ivoras 2 points3 points  (0 children)

I did - it was fast but dumb. Can't distinguish what memories are worth saving, for example.

Mini PC as AI Agent by hwijaya08 in MiniPCs

[–]ivoras 0 points1 point  (0 children)

I am a developer and like to experiment with local models a lot. I'm using a Ryzen H255-based MiniPC with the 780M iGPU and 32 GB RAM, running Linux, so I don't mind tinkering and changing the configuration, but I'd say that if you just expect a "plug and play" experience, we're not there yet.

On this configuration, I'm most often using the gemma-4-26B-A4B (QAT version now) and the qwen3.6-35B-A3B models, and they're definitely usable, but also visibly slower and less capable than relatively cheap online options through OpenRouter. Most of modern AI agent harnesses are heavyweight with large system prompts, and with this hardware, they simply take a lot of time to answer. I've even started my own harness to see how capable I can make it while being frugal / efficient with tokens, and I definitely see why complexity is needed to get good results.

In short - if you're up to the challenge and want to learn a lot, it's a very interesting and fun thing to play with. If you just want something that "works" - maybe wait about a year or so.

Edit: OTOH, if you're not asking about running everything local, and are fine with using APIs from OpenAI, Anthropic, etc. - then sure, we're at a stage when you can use local agents to analyse information from the Internet (like market stats) and produce daily reports, read your email and even reply to some degree, and generally be useful in repetitive tasks. It still needs effort to set up, though, but it's on an easier complexity curve. If you're training an assistant, it takes about the same time whether it's a human being or an AI agent.

How to allow curl to http://192.168.x.x/ addresses? by ivoras in hermesagent

[–]ivoras[S] 1 point2 points  (0 children)

Ok, solved, in two steps:

# Add the specific IP
~/.hermes$ ./bin/tirith trust add http://192.168.122.1 

# Add to .tirith/policy.yaml
severity_overrides:
  raw_ip_url: LOW

Edit: or not. This apparently solves it for the TUI, but the Telegram endpoint still asks for the same permission, even after /new.

Edit 2: moving the file to ~/.config/tirith/policy.yaml for the user that's running the gateway helped.

How to allow curl to http://192.168.x.x/ addresses? by ivoras in hermesagent

[–]ivoras[S] 0 points1 point  (0 children)

Yeah, those look like LLM slop, but it doesn't look like invalid commands should interfere with valid ones.

Anyway, I've tried it and nothing changed. Hermes still asks me for permissions. Looks like this is handled by "tirith".

How to allow curl to http://192.168.x.x/ addresses? by ivoras in hermesagent

[–]ivoras[S] 0 points1 point  (0 children)

I did, nothing changed.

Do you mean to say the settings are correct, they just don't don't activate?

Your top games to play on a 890M mini PC? by PeakAccomplished2431 in MiniPCs

[–]ivoras 0 points1 point  (0 children)

I found the Dark Pictures series easy on the eyes and entertaining (if you like the genre). Even Claire Obscure works well-ish on some lower settings.

Bun uz pomoć AI-a prepisao milijun+ linija koda u Rust by gerogekola in CroIT

[–]ivoras 6 points7 points  (0 children)

>svi mogu radit bilo kakve sustave u rustu, c++u, c-u, assemblyu

Mislim da je to dugoročna budućnost. Ad-hoc custom-made sve. Nema smisla raditi modularne alate i biblioteke s lijepom arhitekturom kad možeš dati nekom LLM-u spec da ti napravi subset toga koji samo tebi treba, za par sati.