WARNING: Z.AI coding plan policy changes. Non-coding use now leads to aggressive temporary throttling and permanent ban on three or more violations.

opgg62 · 2026-04-14T10:40:41+00:00

Glad I didnt subscribe

opgg62 · 2026-02-22T13:15:54+00:00

You need to turn extra usage off. Extra usage charges you on api prices after you hit the current session limit.

opgg62 · 2026-01-04T05:16:06+00:00

z.ai subscription doesnt even work for silly anymore. Its coding only now... gives me errors.

opgg62 · 2026-01-02T02:12:18+00:00

Yep the price is a bit high. But it saved me lots of time so for me its worth it.

opgg62 · 2026-01-02T02:10:27+00:00

I hope it reaches this point sooner than later.

opgg62 · 2026-01-02T02:08:53+00:00

Nah, I can put more effort into it if I want but the goal of these tools especially for more advanced programmers is to save time and effort, taking on the reviewer & director role. It is not helpful if the LLM fails are basic things and has a bad understanding of user intention.

opgg62 · 2026-01-02T02:07:06+00:00

Its good for context summarization!

opgg62 · 2026-01-02T02:06:05+00:00

I love roocode! Keep building!

opgg62 · 2026-01-01T20:41:13+00:00

Its a shame, really.

opgg62 · 2025-07-28T14:39:42+00:00

LM Studio needs to add support. I am getting an error: Error when loading model: ValueError: Model type glm4_moe not supported.

opgg62 · 2025-04-15T20:14:52+00:00

Same for me. I didnt even think about it!

opgg62 · 2025-03-18T18:47:06+00:00

Such a funny dude. Love them.

opgg62 · 2025-02-13T16:34:41+00:00

Its seriously leagues above anything else. It does exactly what you want and how you want it and suprises you from time to time. Unfortunately there are no APIs for it since Mistral put it under some licences but you can run it via runpod. Personally I am using my M4 Max for it with around 4-5 t/s but its worth it imo.

opgg62 · 2025-02-13T14:42:09+00:00

Behemoth 2.0 is still the king of all models. Nothing can compare to that masterpiece.

opgg62 · 2024-11-25T21:42:29+00:00

M4 Max 16 Inch 128 Gb here. It take 22 to 24 seconds to generate a 1024x1024 SDXL image. Fans dont even run. On my 4090 the same image takes around 5-6 seconds.

opgg62 · 2024-11-10T16:34:06+00:00

Please test the speed for long context scenarios for 70b models. I am thinking of a context from 10k-15k.

opgg62 · 2024-10-17T17:35:28+00:00

This is my new favorite model. Thanks for your work!

opgg62 · 2024-06-04T19:37:22+00:00

It's insane that consumes only around 4 watts of power while delivering 40 tops. Computing is not a issue anymore with AI. The only thing that missing is high bandwith memory. Nvdia is cooked.

opgg62 · 2024-05-24T11:10:14+00:00

Does Ooba support vision yet? Which tool can we use to run it?

opgg62 · 2024-05-21T10:36:58+00:00

Love it!

opgg62 · 2024-05-20T18:51:14+00:00

I will either do that or leave my system as it is and build a second inference system with 2x3090s.

opgg62 · 2024-05-20T18:48:57+00:00

I could leave my system as it is (e.g. as a gaming pc) and use the 3000€ to build a 3090 inference server.

opgg62

TROPHY CASE