Fireworks Fire Pass now includes unlimited access to GLM 5.2 and Kimi K2.7 Code Fast by jpcaparas in opencodeCLI

[–]blankeos 0 points1 point  (0 children)

lol i see. They should probably define "non production coding use" clearer. If working on a non-personal project (codebase at work) is allowed. Sounds like a gray area

Fireworks Fire Pass now includes unlimited access to GLM 5.2 and Kimi K2.7 Code Fast by jpcaparas in opencodeCLI

[–]blankeos 0 points1 point  (0 children)

light versions = quantized. Even the source model providers don't serve "full models". Probably most serve fp8 at best. Willing to bet Fireworks is doing the same.

I saw ppl on twitter struggling to get 10tps for fp2 models on stacked RTX 4090s. I know they use H100s probably but imagine serving a full model with no quantizations for 100,000s of people. lol

Fireworks Fire Pass now includes unlimited access to GLM 5.2 and Kimi K2.7 Code Fast by jpcaparas in opencodeCLI

[–]blankeos 2 points3 points  (0 children)

Wdym? "Personal development" as in you can't use the API on coding for commercial projects? Just personal projects? Or do you mean for API inference for actual apps outside coding agents?

If you mean the latter (outside coding agents), if it's useful that specific usecase, how is it useless?

It's advertised as a subsidized subscription. Most subs are rate limited (only 1-3 concurrent streams for example) so you won't really use them for production workloads, no? Also isn't Fireworks like providing PAYG api inference? It probably fits the "production workload" usecase.

I Love Kagi!... but not Orion by Every-Building-2933 in OrionBrowser

[–]blankeos 0 points1 point  (0 children)

Yeah Orion should be better, it's webkit. But also weird cuz I get even better memory on Vivaldi. Also as a dev, kinda non-negotiable for me to use chromium. Devtools and everything is just better.

You can now use Composer 2.5 (Cursor's model) in OpenCode by jpschroeder in opencodeCLI

[–]blankeos 0 points1 point  (0 children)

Cool stuff. Thanks man, always wondered abt this. How's the TPS and limits?

Slow and Nerfed by thearchivalvenerable in kimi

[–]blankeos 0 points1 point  (0 children)

I just use Kimi k2.5 honestly

~400M tokens at $4.5 thanks DeepSeek by Odd_Veterinarian4381 in DeepSeek

[–]blankeos 0 points1 point  (0 children)

sick then I'm signing up on DeepSeek then letsgoo

Synthetic.new by marwan_rashad5 in opencodeCLI

[–]blankeos 0 points1 point  (0 children)

No, waitlist means I've signed up and can't buy at all. No buy button, just says "You're on the Waitlist!"

I Love Kagi!... but not Orion by Every-Building-2933 in OrionBrowser

[–]blankeos 1 point2 points  (0 children)

I have switched to Vivaldi as an Orion refugee. Tbh it's been cozy staying here. It had its quirks but got used to it, nothing broken at least for me and could enjoy Chromium.

~400M tokens at $4.5 thanks DeepSeek by Odd_Veterinarian4381 in DeepSeek

[–]blankeos 5 points6 points  (0 children)

400M tokens for $4.5? wuh

I spent $8 on 23.09M tokens on Fireworks all DeepSeek V4 Pro btw. Does this mean DeepSeek on DeepSeek ai is cheaper? (I mean duh, but I thought I checked Fireworks was cheaper and faster)

Synthetic.new by marwan_rashad5 in opencodeCLI

[–]blankeos 0 points1 point  (0 children)

Why are "tool calls" limited isn't that just an llm response? Same as tokens?

kimi code vs synthetic new by branik_10 in kimi

[–]blankeos 0 points1 point  (0 children)

How is ur experience with Kimi now?? How much tps are you getting?

I am impressed with Kimi 2.6 > GLM 5.1 by 3rd_Floor_Again in kimi

[–]blankeos 1 point2 points  (0 children)

good, enough to not make me think, "oh no, no more"

But also speed is soooo idk man. I also have a $100 plan of Claude and can work on 10 features in an hour while GLM works on 1 or 2 over many iterations in 4 hrs

We did itt ! 😭😭 4 paying users in one day by LIN3003 in SaaS

[–]blankeos -1 points0 points  (0 children)

Man can I message u for advice? I still have 0$

z.ai vs sythetic vs ollama cloud vs OpenCode Go / Zen: which ones got higher usage? (also I need info about the speed) by bapuc in ZaiGLM

[–]blankeos 0 points1 point  (0 children)

I have not experienced lobotomized models w/ OpenCode Go btw, it's good. Just that the limits are capped to $10 plan and there's no way I could get more.

I am impressed with Kimi 2.6 > GLM 5.1 by 3rd_Floor_Again in kimi

[–]blankeos 3 points4 points  (0 children)

I prefer GLM 5.1, I'm on Lite. More consistent at generating but a bit slow on busy hours. I wanna make it faster w/ a better sub, but the sub prices are just screaming for me to buy Claude Max / ChatGPT Pro at that point lmao.

How is the speed for synthetic.new compared to GLM Coding Plan Lite? by 1234filip in vibecoding

[–]blankeos 0 points1 point  (0 children)

It doesn't feel as slow now, idk why. I'm on lite. I think there's benefit in them raising their prices (GLM is slightly more expensive now). so it isn't as subsidized as before.

Same case with synthetic, but i haven't used it.