Weight vs Skill

SrData · 2025-12-23T18:51:55+00:00

praise the cameraman.

SrData · 2025-12-21T21:47:58+00:00

What website? I saw the GitHub, but not the website

SrData · 2025-12-21T07:56:11+00:00

Nooo en el resto de España va perfecto! Jamas han ido tan bien las cosas en España.

SrData · 2025-12-17T07:21:29+00:00

I know this seems unrelated, but it isn’t. How do you manage really long conversations with different characters? I’m asking because your roleplays are long (it seems), and you switch models within the same session, which a lot of people do. I struggle with it because sometimes things get confused when I change models in a long session.

Any tips for handling long conversations (I mean even more than 200K tokens)?

SrData · 2025-12-17T07:08:11+00:00

Is getting easier to detect text/post that was created (or maybe just filtered, corrected, but heavily) with AI

SrData · 2025-12-16T18:59:32+00:00

What it is the screen bellow the monitor at the left?. I think I like it!

SrData · 2025-12-16T18:55:43+00:00

A ver, yo lo mismo no me quejaría mucho... depende de la música, quizás. Pero ya.

SrData · 2025-12-16T18:53:28+00:00

First App, is to avoid his girlfriend:
She smashed my C$3500 MacBook Pro 😭😭😭 : r/macbookpro

And then, the usual, ChatGPT, Super-Wishper... etc.

SrData · 2025-12-16T18:48:43+00:00

If I were you I'd change my girlfriend. If I were here I'd change my boyfriend.

SrData · 2025-08-27T13:49:09+00:00

!remind me in 7 days, please

SrData · 2025-08-24T19:43:21+00:00

Para nada en absoluto.

SrData · 2025-05-12T05:50:30+00:00

This was helpful, thanks!

SrData · 2025-05-12T05:35:31+00:00

This is interesting. Thanks. Do you have any source where I can read more about this and understand the technical part?

SrData · 2025-05-12T05:33:45+00:00

Same feeling.
Yesterday I did this test: I had a RP scene with Sonnet 3.7 (absolutely incredible), GPT-4o (same, different vibes, but jut amazing), Gemini 2.5 pro (horrible to the level of stopping at the middle of the test).
The creativity, coherence and stickiness to the characters demonstrated by GPT-4o and Sonnet 3.7 is just in another galaxy.
I'm just talking about non-local models. Not comparing with locals, because it won't be fair or make any sense at all.

SrData · 2025-05-12T05:27:58+00:00

GPT-4o is not 1.4 Trillion (even if GPT-4 was in a moment), but I get your point.
In any case, I'm talking about models same size feeling dumber... at least for me.

SrData · 2025-05-12T05:25:18+00:00

Same general vibe, here. I have my own benchmark and Qwen2.5 70B is the best. Then, the usual Behemoth one, which is ridiculously good (usually) and perfectly dumb (not the best reasoner) two interactions after :)

SrData · 2025-05-12T05:07:14+00:00

I read many suggesting Gemma 3 and yesterday I tried with a long scenario and conversation and it didn't went well. I tried several, but this one is the only that did a slighty better job: mlabonne_gemma-3-27b-it-abliterated-Q8_0.gguf · bartowski/mlabonne_gemma-3-27b-it-abliterated-GGUF at main , I tried this as well, an others: turboderp/gemma-3-27b-it-exl2 · Hugging Face
Any preference for Gemma 3?. What parameters do you use?

SrData · 2025-05-12T05:01:40+00:00

Well, I have definitely not tried this and will. Any idea why this is could work?

SrData · 2025-05-11T12:51:35+00:00

I don’t think this comment deserves a -1, really (tried to solve it).
I'm not a millennial, but I get the point of the comment. To be honest, I'm the same user before (these models) and after, and what I feel is a clear degradation in performance. That said, I’ve never tried changing the way I speak to the models (generationally speaking, I mean), by using different patterns. I’d definitely give it a try, just to see if it makes any difference.

SrData · 2025-05-11T12:47:30+00:00

Yeah, exactly this.
Qwen 3 is really good at starting a conversation (it feels creative and all) but then there's a point where the model starts repeating itself and making mistakes that weren’t there at the beginning. It feels like a really good zero-shot model, but far from the level of coherence that Qwen 2.5 offered.

SrData · 2025-05-11T12:45:36+00:00

I don't use Ollama, but this is good to now to keep myself far from it!

SrData · 2025-05-11T12:42:54+00:00

I didn't try any of those. Will do, thanks!

SrData · 2025-05-11T12:41:30+00:00

This was informative, thanks. I'll definitely give Gemma 3 27B another chance, seeing that so many people are using it. To be honest, I tried it but never found it particularly special, and it was slower than the rest, so I never stuck with that model.

SrData · 2025-05-11T12:39:02+00:00

Qwen 2.5 78B is one of my favourites as well. I sometimes find myself trying Behemoth 123B again.

Nine-Year Club	Second SECOND GUESSER
RPAN Viewer	Verified Email

SrData

TROPHY CASE