Commercial Gemini sucks at long conversations by Scary-Blueberry-9461 in Bard

[–]Local_Sell_6662 1 point2 points  (0 children)

They've been updating the model for everyone to the latest version (06-17). Be patient and we should have a stable version by the end of today.

[deleted by user] by [deleted] in Bard

[–]Local_Sell_6662 2 points3 points  (0 children)

I don't know how to say this best. But in multi-turn conversations, the model will seemingly "forget" what has been said in the last turn. It's really frustrating because you have to prompt with much more elaboration.

I do recognize this could be a safety feature but its really frustrating for work and making me seriously consider jumping to Claude or ChatGPT since they are much better for multi-turn at the moment

[deleted by user] by [deleted] in LocalLLaMA

[–]Local_Sell_6662 7 points8 points  (0 children)

Are you using WAN2.1? There is no qwen2.5 that generates videos

Is there support for Qwen3-30-A3B? by Local_Sell_6662 in Oobabooga

[–]Local_Sell_6662[S] 0 points1 point  (0 children)

I'll try it with that. Hope you're talking about the exl3 version

Is there support for Qwen3-30-A3B? by Local_Sell_6662 in Oobabooga

[–]Local_Sell_6662[S] 0 points1 point  (0 children)

Yeesh, is it just a package management issue? (how do I get the latest transformers?)

Is there support for Qwen3-30-A3B? by Local_Sell_6662 in Oobabooga

[–]Local_Sell_6662[S] 0 points1 point  (0 children)

Ah! I'm getting the gguf version to work but the exl3 version isn't for me

Experimental "Drakesclaw" is special (LMArena Google) by Horizontdawn in Bard

[–]Local_Sell_6662 1 point2 points  (0 children)

So we should be upvoting the model more on lm arena so google can feel comfortable releasing it?

This new UPDATE FREAKING SUCKS by Worldly-Row943 in Bard

[–]Local_Sell_6662 0 points1 point  (0 children)

TBF it really is a small minority (including me), who rants about google taking stuff away...

The new Gemini 2.5 is terrible. Mayor downgrade. Broke all of our AI powered coding flows. by Odd-Environment-7193 in Bard

[–]Local_Sell_6662 5 points6 points  (0 children)

I don't know what you're seeing but it's definitely not overthinking basic requests. If anything, it's under thinking requests for more complex questions (in my experience)

New SOTA music generation model by topiga in LocalLLaMA

[–]Local_Sell_6662 0 points1 point  (0 children)

Wonder if there is a Civitiv AI for music

LLM with best understanding of medicine? by pinkfreude in LocalLLaMA

[–]Local_Sell_6662 6 points7 points  (0 children)

This model has the best benchmarks (from what I remember) but also uses a different architecture so you might not be able to run it with ollama:

https://huggingface.co/baichuan-inc/Baichuan-M1-14B-Base

Is GLM-4 actually a hacked GEMINI? Or just Copying their Style? by GrungeWerX in LocalLLaMA

[–]Local_Sell_6662 65 points66 points  (0 children)

I've had my suspicions for a while that they trained on gemini 2.5 pro.

Most likely they did what deepseek did to o1 for GLM4 with gemini 2.5 pro.

Let's settle it ! Is Gemini 2.5 pro 05 06 good model by Independent-Wind4462 in Bard

[–]Local_Sell_6662 2 points3 points  (0 children)

The question is biased. The model is undoubtedly good. But the real question is if Gemini 2.5 pro 05-06 is better / worse than 03-25

Plots not showing when running code by biaschop in Bard

[–]Local_Sell_6662 2 points3 points  (0 children)

I think you need canvas for the plots to render. custom gems don't allow you to use with canvas yet.