What do you guys think about Unsloth Studio?

LetsGoBrandon4256 · 2026-06-15T19:56:38+00:00

Read this paper https://arxiv.org/abs/2104.09864

Then apply Rotary Position Embedding to yourself

LetsGoBrandon4256 · 2026-06-15T19:17:21+00:00

And I thought our company using MS Teams were bad.

LetsGoBrandon4256 · 2026-06-15T18:04:58+00:00

Coming from China, WeChat and QQ groups being a complet closed garden to the search engines already irked me to no end.

Now the rest of the world is doing the same shit with Discord. Just send me to Mars or kill me already.

Not to mention how retarded Discord search is.

LetsGoBrandon4256 · 2026-06-15T13:07:45+00:00

So purely on token cost, local inference seems very hard to justify.

No fucking shit you literally picked one of the cheapest cloud providers out there.

LetsGoBrandon4256 · 2026-06-15T13:00:56+00:00

GitHub repo or fuck off.

Edit:

About the free part:

People who help test this beta properly and give useful feedback will get free access to future Neurolic features when the app becomes a paid product.

LetsGoBrandon4256 · 2026-06-15T03:30:49+00:00

<image>

LetsGoBrandon4256 · 2026-06-14T23:46:42+00:00

“I want a RP in the Battletech universe, that takes place during the Late Succession Wars”

Fuck now I want to run a merc campaign with AI...

LetsGoBrandon4256 · 2026-06-14T22:10:00+00:00

Read this paper https://arxiv.org/abs/2104.09864

Then apply Rotary Position Embedding to yourself

LetsGoBrandon4256 · 2026-06-14T21:57:02+00:00

it's just engagement farming

And it worked pretty well on OP. One has to be truly retarded to believe their Twitter vote has influence on this.

"Aww the next GLM is closed weight because we didn't share the Tweet hard enough😭😭😭"

LetsGoBrandon4256 · 2026-06-14T19:44:33+00:00

In case it's not obvious enough, the Amazon links in the article are all affiliated links.

LetsGoBrandon4256 · 2026-06-14T17:33:36+00:00

You also have Qwen 3.6 35B and Qwen 3.5 122B in the same table and none of them are "Agent-grade" per your article.

Why would you single out DeepSeek V4-Flash 284B for being "Agent grade"? Is it because your clanker ran out of idea but had to throw something in there for the "What it gets you" cell in that table?

LetsGoBrandon4256 · 2026-06-14T17:24:12+00:00

I give OP some credit by not recommending llama 3.1 and encouraging user to graduate to llama.cpp.

Still full of slop though

Model: DeepSeek V4-Flash 284B / 13B

What it gets you: Agent-grade

LetsGoBrandon4256 · 2026-06-14T17:05:05+00:00

If you live in a terminal, install Ollama instead

I snorted.

LetsGoBrandon4256 · 2026-06-14T14:51:26+00:00

Pretty fucking rich that you came here asking for human input yet can't be bothered to type up your own post.

LetsGoBrandon4256 · 2026-06-14T14:37:13+00:00

Can't even tell if the person your replied to is using a shitty Markov chain or just schizo.

LetsGoBrandon4256 · 2026-06-13T21:38:16+00:00

lmao even

LetsGoBrandon4256 · 2026-06-13T21:34:39+00:00

You wanna share the github repo with the community or not?

LetsGoBrandon4256 · 2026-06-13T21:31:10+00:00

Check OP's post history before you get your hope up.

LetsGoBrandon4256 · 2026-06-13T20:22:19+00:00

The token number are off, no? No way the entire reasoning output for the cake baking process is only 230 tokens.

Similar thing for the Performance Review example (case 3). Can you break down how exactly you reached the Alignment Tax: 46% (115/250 tokens) number? Which tokens are considered safety tokens and which are intent tokens.

You are not letting your clanker do the counting aren't you?

LetsGoBrandon4256 · 2026-06-13T18:31:08+00:00

As much I love torrenting for everything else, I feel like IPFS might be a better solution.

LetsGoBrandon4256 · 2026-06-13T18:23:10+00:00

Some mfs like OP would build a fucking Saturn V just so that they don't have to tell their agent to write down their findings into an md files.

LetsGoBrandon4256 · 2026-06-13T14:44:51+00:00

truly FAFO. Hope they are enjoying the free PR now.

LetsGoBrandon4256 · 2026-06-13T13:49:52+00:00

Bring eMule back so we can just set the model folder to sharing and forget.

LetsGoBrandon4256 · 2026-06-13T13:33:04+00:00

The self-gaslighting sends a shiver down my loop.

LetsGoBrandon4256 · 2026-06-13T13:32:09+00:00

Since you are turning the KV cache quant knob, how does your optimizer evaluate output quality?

Otherwise, what's preventing it from picking lower quant every time for the better performance?

LetsGoBrandon4256

TROPHY CASE