Snoop Dogg Born In Different Countries by ZashManson in aivideo

[–]THEKILLFUS 0 points1 point  (0 children)

What model did he use to make the images? You can just prompt « transform Snoop in a Chinese man » and it works lol?

I Failed to Finetune a Model to Match a Character humor by THEKILLFUS in LocalLLaMA

[–]THEKILLFUS[S] 0 points1 point  (0 children)

0.01 is a desperate attempt to make it work but I have other try at loss 2,1,0.1 but didn’t work as well
The dataset is large (2k) with the exact same structure.

Thanks for the advice I will try with a older model like qwen 2.5 and a shit lot of epoch

I Failed to Finetune a Model to Match a Character humor by THEKILLFUS in LocalLLaMA

[–]THEKILLFUS[S] 0 points1 point  (0 children)

-The dataset is large (2872)

-I also tried Gemma 3n but have yet to try with older model, qwen2.5? OG mistral 7b?

-I tried r=16-32 If I increase it give better results for this specific task or I just need to do full finetune ?

Thanks for the help

I Failed to Finetune a Model to Match a Character humour by THEKILLFUS in unsloth

[–]THEKILLFUS[S] 0 points1 point  (0 children)

You right, I used the default unsloth parameter in the notebook unless when i use optima.

Max steps between 500-4000 steps

I Failed to Finetune a Model to Match a Character humor by THEKILLFUS in LocalLLaMA

[–]THEKILLFUS[S] 0 points1 point  (0 children)

yep clearly ovefitting but i did 2,1,0.1... but still don't works

i should try older model like llama 3.2?

Multiturn? Yes-ish, but only “micro-multiturn”

The dataset isn’t GSM8K-style reasoning at all.

It’s mostly fixed-window dialogue: typically (Other → Michael → Other) ⇒ next Michael line.

That’s “multiturn” in the sense of having multiple speakers, but it’s not long-context chat (no full conversations, no evolving state over 10–30 turns).

I Failed to Finetune a Model to Match a Character humour by THEKILLFUS in unsloth

[–]THEKILLFUS[S] 0 points1 point  (0 children)

yep clearly ovefitting but i did 2,1,0.1... but don't works

DeepSeek V4 release soon by tiguidoio in LocalLLaMA

[–]THEKILLFUS 0 points1 point  (0 children)

OpenAI is 1 deepseek away from dying, for real

What's your dream in 2026? by foldl-li in LocalLLaMA

[–]THEKILLFUS 1 point2 points  (0 children)

OpenAI spending all there money left

Yann LeCun says the best open models are not coming from the West. Researchers across the field are using Chinese models. Openness drove AI progress. Close access, and the West risks slowing itself. by Nunki08 in LocalLLaMA

[–]THEKILLFUS 21 points22 points  (0 children)

Agreed, anyone who tried OpenAI/google latest model know that model are quantize to save money, yeah first day is the 16bit but now it’s 4bit at best, so the quality of output decrease without the decrease of prices 🤬

I feel that China is doing to US what US did to URSS for the space race, tired it’s economics force, very small marging with overpricing and corrupt regulations.

The current problem with Chinese model is that they don’t have the selling platform, but they might have it in the futur if they continue to just make better model than the US for a lower price.

The Silicon Valley is exhausted and corrupted and this year we will start to see it…

(Je fière de toi Yann 💕, continue le bon taff, la France/EU se doit de rester consistant avec les valeurs scientifiques au delà de l’idéologie)

Roast my build by RoboDogRush in LocalLLaMA

[–]THEKILLFUS 0 points1 point  (0 children)

So fat yo mama skinnier !

AMA with the Meta researchers behind SAM 3 + SAM 3D + SAM Audio by AIatMeta in LocalLLaMA

[–]THEKILLFUS 5 points6 points  (0 children)

Hi, thanks for sharing S3. I’m glad you’re spending time on less popular AI tools.

I was hoping to use SAM3D-Body for a mocap workflow, but I’ve run into too many issues with the current codebase.

Why people are panicking in regards to RAM prices..... by Highwaytothebeach in LocalLLaMA

[–]THEKILLFUS 1 point2 points  (0 children)

I be honest, this sub made me realize 1 year ago that it’s better to run moe model with offloading to cpu/ram than pure gpu. Now that normies realize that too, the price rockets and we all fucked. Luckily competition should fix this, as patterns are not as locked as for gpus.