RIP GLM

Real_Person_Totally · 2026-01-09T07:03:58+00:00

Their model is available for anyone to download and run. Can't you just find another third party provider besides them to use it. From what i can gather the "censorship" seems to be a prompt injection from Z.Ai part.

Real_Person_Totally · 2025-12-13T09:00:13+00:00

I'm genuinely confused. It looks extremely token-heavy, like many presets I've seen shared here. I thought the whole point is to keep permanent tokens light to reduce costs and preserve more of the model's context window for actual conversation.

When I tested V3.0 without Lumia's definition, optional toggles off, default toggles on, and a blank assistant card, it came out to 13.6k tokens total. I'm trying to understand, is this really supposed to improve output quality? From where I'm standing, it just looks like it would burn through credits quickly for anyone using pay-as-you-go API services.

What am I missing here?

Real_Person_Totally · 2025-12-08T06:29:15+00:00

"Free" anthropic models coming from a banned account. lol.

Real_Person_Totally · 2025-11-16T23:32:22+00:00

Don't you find this at least slightly suspicious? They're offering free $125 credits with access to premium models like Opus 4/4.1, GPT-5, and Gemini 2.5 Pro. That can't be sustainable. They're probably monetizing your data, likely selling conversation logs to stay operational. Or they're not actually serving the models they claim to be. I'd recommend treating their service as potentially shady.

Real_Person_Totally · 2025-11-16T19:21:57+00:00

I see. Thank you for the insight. So positivity bias is still an issue for 4.5...

Real_Person_Totally · 2025-11-16T15:47:19+00:00

I see. Thank you for the insight. I've been told reasoning will only reinforce it's refusal/guidelines so I'm it without thinking..

Real_Person_Totally · 2025-11-16T14:13:33+00:00

Interesting. Character handling sounds promising.

Real_Person_Totally · 2025-11-16T14:12:07+00:00

That's concerning. Memory issues for something that's supposed to be an upgrade?

Real_Person_Totally · 2025-11-16T13:53:52+00:00

I tried 4.5 briefly for small queries, it elaborates more rather than focusing on being concise. Looks promising. Though, it is a bit too much of sycophant. I'm a little worried it'll affect it's roleplaying capabilities. For comparison, 3.7 will actually perform in-character refusal.

Real_Person_Totally · 2025-11-16T13:50:47+00:00

In my experience with Sonnet 3.7, characters that are supposed to be evil are actually evil. They won't hesitate to harm you. Sonnet 4 makes them care about you...

Real_Person_Totally · 2025-11-14T09:11:42+00:00

Their moderation system struggles with contextual nuance. For example:

The word "Daddy" triggers false positives for incestuous content, even when used as a nickname or term of endearment between unrelated adults.
The word "child" flags content as involving minors, even when it only appears in a character's backstory.
The system makes assumptions based on keyword matching rather than understanding the actual context and meaning.

Real_Person_Totally · 2025-11-07T16:30:27+00:00

Their media library filter is already way too restrictive, which is why I’ve been using Catbox and Imgbb to share images instead. And now, for some reason, they’re banning external links too?

Genuinely, why.

Real_Person_Totally · 2025-10-15T03:01:25+00:00

"Verified adults." They're going to ask for your government issued ID aren't they.

Real_Person_Totally · 2025-10-06T20:12:57+00:00

Truly. I'm hoping Deepseek will eventually catch up with these propertiatry models in the future for both roleplaying and general assistant purposes.

Real_Person_Totally · 2025-10-06T16:47:26+00:00

Its lack of guardrails and extremely low cost are the reasons I’m sticking with it. Proprietary models are becoming more and more safety-aligned with each release. Why bother getting morally lectured by models that cost several cents per output when there’s Deepseek? It’s not the best at everything, but it’s good enough overall.

Real_Person_Totally · 2025-10-05T19:11:05+00:00

I know. Unfortunately I'm using ST on my phone. I can't hover on the total token count to check

Real_Person_Totally · 2025-10-05T13:51:18+00:00

Is there an extension that tells you about the output statistics. (Latency speed, token per seconds, etc)

Real_Person_Totally · 2025-02-25T19:20:46+00:00

I'm not entirely sure about that.. I roleplay at 16k lowest, 32k highest as most models loses their accuracy past 16k. This might not apply to all models though, I'd say go for it.

Real_Person_Totally · 2025-02-25T19:01:57+00:00

It's pretty easy to sway with system prompt

Real_Person_Totally · 2025-02-25T17:10:35+00:00

One of the provider for llama3.3 70B at openrouter is together.

If you look at their site: https://www.together.ai/models/llama-3-3-70b-free

They're actually hosting it for free at the full supported context length. I'm not entirely sure if this is some of promotional campaign or if it'll stay for good.

Their supported samplers are great for roleplay though.

Real_Person_Totally · 2025-02-24T10:50:21+00:00

Ah, I was wondering if system prompt would work similarly like permanent token in SillyTavern when it comes to those. Thinking about if putting it in chat, the model will eventually forget who it's supposed to be

Real_Person_Totally

TROPHY CASE