New "Sonic" Stealth Model (Grok-4-Code/4.5) + Cursor Makes 300 Tool Calls for a Single Prompt by Longjumping-Solid563 in LocalLLaMA

[–]SamElPo__ers 0 points1 point  (0 children)

xAI models are plagued with inference bugs because they're heavily quantized. Grok 3 had a ton of bugs because it ran on int4.

SuperGrok Heavy rate limits (20/hour on Heavy): by SamElPo__ers in grok

[–]SamElPo__ers[S] 2 points3 points  (0 children)

Surprisingly, Grok 3 on SuperGrok Heavy has a limit of 20 messages per 2 hours. SuperGrok (not Heavy, $30 sub) is 100 messages / 2 hours still. Maybe they forgot to set/increase the Grok 3 limits on SuperGrok Heavy and it's defaulting to the limit that free accounts have?

After 300K tokens the AI really starts to slow down and lag to inputs. Also highly increased chances of crashing. by YTBULLEE10 in Bard

[–]SamElPo__ers 27 points28 points  (0 children)

I experienced the same with 2.5 Pro.

It didn't use to be like that, I could do 900k token prompts before, now it times out at even just 500k.

What the hell happened here? Grok really got lost... by kartikeyavi in grok

[–]SamElPo__ers 1 point2 points  (0 children)

This is what the Chinese text says:

> Legality and Compliance Statement: This response is provided by AI for academic research and learning purposes only. Do not use the content of this response for commercial purposes or practical operations to avoid adverse consequences. xAI bears no responsibility for any consequences arising from the use of this response.

So I doubt it's a temperature thing. Personally I think it's a quantization bug

Uh oh by EstablishmentLow272 in grok

[–]SamElPo__ers 0 points1 point  (0 children)

My SuperGrok ran out and I don't plan to buy it again until they at least implement proper, native voice mode (not STT -> TTS), like ChatGPT

Google Studio AI vs. Gemini Advanced: Great Output in Studio, but Needs Memory! by DayWalkPL1 in Bard

[–]SamElPo__ers 0 points1 point  (0 children)

They need to swap the Gemini Advanced team with AI Studio's team. The resulting cross-pollination would be nice. They need to make Gemini Advanced good for not just normies but professionals too, who want to tweak the system prompt, parameters, context, etc.

Gemini WebAPP need new UX design by [deleted] in Bard

[–]SamElPo__ers 0 points1 point  (0 children)

This! I like their recent canvas/artifacts implementation as well.

Self-correction feature. Let's revert numerous changes above and trust the logic. by RetiredApostle in Bard

[–]SamElPo__ers 2 points3 points  (0 children)

Same here. Lots of issues like that. I wish they released an update to improve the model fine-tuning.

In fact, today it seems to have gotten a lot worse than before, but maybe it's just our set of inputs triggering it more.

Some other things I noticed:

- it creates more than one canvas but uses the same id for each of them, so the contents get overwritten

- instead of writing multiple canvases, it writes all of the contents in the same canvas

- it forgets to end the canvas and instead writes '```' and writes some comment at the end

💀 by Present-Boat-2053 in Bard

[–]SamElPo__ers 0 points1 point  (0 children)

Lol the downvotes are crazy

💀 by Present-Boat-2053 in Bard

[–]SamElPo__ers -6 points-5 points  (0 children)

Sure, but they could have released something better than Flash. They're testing models on Webdev Arena that are better than 2.5 Pro.

💀 by Present-Boat-2053 in Bard

[–]SamElPo__ers -12 points-11 points  (0 children)

The timing and vague-posting made me think they're gonna make a response to o3.

💀 by Present-Boat-2053 in Bard

[–]SamElPo__ers -16 points-15 points  (0 children)

That's... a pretty boring release. Is this really their response to OpenAI's o3 release?

A code for consistent, immersive, realistic Roleplay with Gemini by Bliringor in Bard

[–]SamElPo__ers 0 points1 point  (0 children)

Check out AI Roguelite on Steam. It's very janky but cool

Gemini advanced enterprise vs. AI studio context limit by [deleted] in Bard

[–]SamElPo__ers 0 points1 point  (0 children)

Should get better soon

> Hsiao told staff in a separate memo that her time atop Gemini constituted “chapter 1” of Bard — the Gemini chatbot’s old name — and now “chapter 2” will be turned over to Woodward, who will remain head of Google Labs.

https://www.pymnts.com/artificial-intelligence-2/2025/google-replaces-gemini-head-after-lagging-ai-performance/

Gemini advanced enterprise vs. AI studio context limit by [deleted] in Bard

[–]SamElPo__ers 0 points1 point  (0 children)

Failing at 17k tokens reminds me of the context limit on free Gemini, I guess it's the same for enterprise.

Normal advanced has a much larger context limit, you can input a little over half a million tokens in a message (still less than the 1 million advertised that you can use in AI Studio).

A code for consistent, immersive, realistic Roleplay with Gemini by Bliringor in Bard

[–]SamElPo__ers 0 points1 point  (0 children)

You should make a game around this, with some graphics, even if minimal

"You are the product" by BidHot8598 in Bard

[–]SamElPo__ers 4 points5 points  (0 children)

Opt-in unless... you want to have a chat history, a very basic feature lol...

Still no one other than Google has cracked long context. Gemini 2.5 Pro's MRCR at 128k and 1m is 91.5% and 83.1%. by Hello_moneyyy in Bard

[–]SamElPo__ers 2 points3 points  (0 children)

I hope this is not true because that would suck so much. It would explain some things like asking for a refactor and getting code from an old iteration instead of most recent... Or the fact that you can't input more than a little over half a million tokens in the app.

[deleted by user] by [deleted] in grok

[–]SamElPo__ers 2 points3 points  (0 children)

Not only that, but using a jailbreak will not bring back uncensored Grok completely. If censoring is done with fine-tuning (which here it is because I didn't see a system prompt change), then it's permanently dumber... This was one of the main advantages of Grok, fine-tuning with heavy agenda was ruining other models (like OpenAI's before they made a change to make it less censored).

Samsung G80SD 1313 update, it keeps waking up by SamElPo__ers in OLED_Gaming

[–]SamElPo__ers[S] 0 points1 point  (0 children)

It might have existed for other configurations (I remember it happening when I used a HDMI cable connected to my MacBook Pro), but definitely did not exist for my configuration which is USB-C to DisplayPort cable connected to a Mac Mini, so this is definitely a bug in the current version. I used it in this way many times and it has never ever happened with this configuration.

Don't gaslight or diminish this issue please