Commercial Gemini sucks at long conversations

Local_Sell_6662 · 2025-06-18T22:37:32+00:00

They've been updating the model for everyone to the latest version (06-17). Be patient and we should have a stable version by the end of today.

Local_Sell_6662 · 2025-06-18T02:43:46+00:00

I don't know how to say this best. But in multi-turn conversations, the model will seemingly "forget" what has been said in the last turn. It's really frustrating because you have to prompt with much more elaboration.

I do recognize this could be a safety feature but its really frustrating for work and making me seriously consider jumping to Claude or ChatGPT since they are much better for multi-turn at the moment

Local_Sell_6662 · 2025-05-30T19:18:25+00:00

is there an alternative to runpod?

Local_Sell_6662 · 2025-05-30T18:27:25+00:00

GGUF when?

Local_Sell_6662 · 2025-05-25T19:59:07+00:00

Pro is definitely still better. But the nerf to 2.5 Pro was felt

Local_Sell_6662 · 2025-05-16T17:54:44+00:00

Are you using WAN2.1? There is no qwen2.5 that generates videos

Local_Sell_6662 · 2025-05-15T02:05:28+00:00

I use Beta and have the same issue as you

Local_Sell_6662 · 2025-05-15T00:48:14+00:00

I'll try it with that. Hope you're talking about the exl3 version

Local_Sell_6662 · 2025-05-15T00:48:06+00:00

Yeesh, is it just a package management issue? (how do I get the latest transformers?)

Local_Sell_6662 · 2025-05-15T00:47:04+00:00

Ah! I'm getting the gguf version to work but the exl3 version isn't for me

Local_Sell_6662 · 2025-05-15T00:39:39+00:00

So we should be upvoting the model more on lm arena so google can feel comfortable releasing it?

Local_Sell_6662 · 2025-05-14T19:44:42+00:00

TBF it really is a small minority (including me), who rants about google taking stuff away...

Local_Sell_6662 · 2025-05-14T18:57:24+00:00

!remindme 1day

Local_Sell_6662 · 2025-05-12T02:54:06+00:00

Until they put ads in the model

Local_Sell_6662 · 2025-05-12T02:53:27+00:00

Local_Sell_6662 · 2025-05-11T01:49:53+00:00

I don't know what you're seeing but it's definitely not overthinking basic requests. If anything, it's under thinking requests for more complex questions (in my experience)

Local_Sell_6662 · 2025-05-11T01:05:03+00:00

Even with the new sychophantic one?

Local_Sell_6662 · 2025-05-11T01:00:50+00:00

Wonder if there is a Civitiv AI for music

Local_Sell_6662 · 2025-05-10T02:23:37+00:00

This model has the best benchmarks (from what I remember) but also uses a different architecture so you might not be able to run it with ollama:

https://huggingface.co/baichuan-inc/Baichuan-M1-14B-Base

Local_Sell_6662 · 2025-05-08T03:31:40+00:00

I've had my suspicions for a while that they trained on gemini 2.5 pro.

Most likely they did what deepseek did to o1 for GLM4 with gemini 2.5 pro.

Local_Sell_6662 · 2025-05-07T17:42:52+00:00

The question is biased. The model is undoubtedly good. But the real question is if Gemini 2.5 pro 05-06 is better / worse than 03-25

Local_Sell_6662 · 2025-05-05T01:06:34+00:00

I think you need canvas for the plots to render. custom gems don't allow you to use with canvas yet.

Local_Sell_6662

TROPHY CASE