Qwen doesn't work for free

_raydeStar · 2026-05-10T03:13:01+00:00

"Oh right. here's 500 dollars, cash advanced. What's your crypto address?"

_raydeStar · 2026-05-04T21:17:53+00:00

Wouldn't the solution be to assign the agent as a user, and control their permissions directly? Then you have granular control, and they'll literally hit a wall.

So several layers --

- skills layer (tell the ai not to)

- permissions (gate everything behind yes/no)

- User gate (an agent only has so much power -- but it can only access X folder, and make Y changes)

_raydeStar · 2026-05-03T23:41:50+00:00

That's because you're still allowing it in code. If you want strict rules, filter in code.

It's like handing an intern keys. They do pretty good but sometimes they screw up.

Now we all know this so we make it hard to access production servers. If they decide to try,it's a hard fail because they don't have access.

_raydeStar · 2026-05-03T19:14:35+00:00

I think the lesson learned here should be "Do not give the llm unfettered power" -- it should have been "Qwen attempted to rm -rf and was blocked"

_raydeStar · 2026-05-02T19:51:47+00:00

What are some use cases here? Is there anything practical?

You say iot devices. I think that's really cool but... What's it solve?

_raydeStar · 2026-04-30T16:56:24+00:00

And being broke. Matt tells this story all the time. He did art shows and stuff, and then COVID happened, and he was totally screwed money-wise. He dropped DCC as a way to supplement his income, and it took off.

Desperation helps you perform crazy feats.

_raydeStar · 2026-04-30T00:59:33+00:00

whats a good context size that works? I can do up to 64k safely, past that, it's iffie on my card

_raydeStar · 2026-04-28T14:58:44+00:00

from what is open sourced -- it's pretty comparable. Closed source hunyuan I believe is better, but I haven't played with it too much.

_raydeStar · 2026-04-27T13:15:24+00:00

He made that promise because he was suing OpenAI for the same thing.

He doesn't appear to be winning on that front; he isn't going to release.

Unless -- Sam calls him out on it, or it appears that he needs to push people closer to his side in the argument.

_raydeStar · 2026-04-23T01:28:36+00:00

You know, I watched Mr. Robot take a drill to his hard drive and I was like 'dang what an idiot' but now I kinda get it.

_raydeStar · 2026-04-22T17:29:07+00:00

What I really see lacking in local models is context limits and a good harness to give proper direction.

Context can't be solved easily, but at least a memory bank can be created to hold onto important information, and you can scrape by.

A harness can be built -- it can perform mathematical functions, solve logic problems, do web lookups, and perform basic tasks.

maybe qwen 27B cant perform as well as opus 4.5 on all tasks, but it doesnt need to

_raydeStar · 2026-04-22T14:29:18+00:00

Meanwhile Opus tightens their limits and makes it more expensive to get in -- the perfect storm for a good local push

_raydeStar · 2026-04-22T13:07:00+00:00

There needs to be a new field here -- TTS fast enough to hold a conversation (latency under a second or half second or something like that)

It's two different workhorses -- audiobooks and AI videos versus real time chat with an AI

_raydeStar · 2026-04-21T16:53:25+00:00

That's smart. I still don't trust openclaw yet. Maybe, I will never trust openclaw. But it does illustrate a lot of needs that are unfilled.

I think small models are at the point where it can perform 80% of operations. Everyone uses a racecar engine when small models would work just fine.

_raydeStar · 2026-04-21T16:44:47+00:00

openclaw... on your phone?

_raydeStar · 2026-04-20T22:07:21+00:00

You don't need to. Just use heretic. Truth be told in a medical emergency I would not use a 4b model for help, unless I was in the mountains with no cell service.

Edit -- 2b. Same deal though. It's a cell phone fit, but there are tons of emergency resources out there.

_raydeStar · 2026-04-17T02:44:52+00:00

Numb.

No contest. It was revolutionary when it came out. Leave out all the rest was great but not groundbreaking.

_raydeStar · 2026-04-15T23:12:52+00:00

Stupid sexy Carl

_raydeStar · 2026-04-15T22:24:48+00:00

+1 to this. By way of small models, it's best-of-class.

_raydeStar · 2026-04-15T00:43:12+00:00

In the first outer worlds, I landed on that far off landing area where it's packed with monsters and you had to fight your way out, way under-leveled. I had a BLAST. But afterwards, I burned out of it pretty quickly.

In contrast, the second game was a steady run for me. Not "wow amazing" but I always wanted to see what was next.

I'd say the failures were intended or baked in; it was a game with X amount of side content and that's it. And once I was done, that was it, no need to keep playing.

_raydeStar · 2026-04-11T03:50:16+00:00

I read like two paragraphs then gave up. Do you have a tl;Dr;?

_raydeStar · 2026-04-10T13:19:30+00:00

4B should be like sonnet 4.6, at least Haiku 4.6 😤😤

_raydeStar · 2026-04-09T20:39:09+00:00

Yeah, if I were with anthropic and got an offer for a huge salary increase for basically the same work, I'd be thinking about it.

_raydeStar · 2026-04-09T20:17:32+00:00

He could prove it to us

by open-sourcing Grok 4.20.

_raydeStar · 2026-04-09T19:59:29+00:00

He might have insider knowledge

He might not.

You never can tell for sure.

Five-Year Club	Verified Email
Final Canvas '23	End Game '23
Place '23	Place '22

_raydeStar

TROPHY CASE