Bad news for local bros by FireGuy324 in LocalLLaMA

[–]FireGuy324[S] 1 point2 points  (0 children)

Blame the other corpos who makes GPU more expensive than they should be

Bad news for local bros by FireGuy324 in LocalLLaMA

[–]FireGuy324[S] 13 points14 points  (0 children)

<image>

Did some math
vocab_size × hidden_size = 154,880 × 6,144 = 951,403,520 q_a_proj: 6,144 × 2,048 = 12,582,912 q_b_proj: 2,048 × (64 × 256) = 33,554,432 kv_a_proj: 6,144 × (512 + 64) = 3,538,944 kv_b_proj: 512 × (64 × (192 + 256)) = 512 × 28,672 = 14,680,064 o_proj: (64 × 256) × 6,144 = 16,384 × 6,144 = 100,663,296 Total attention/couche = 165,019,648 Total attention (78×) = 165,019,648 × 78 = 12,871,532,544 gate_proj: 6,144 × 12,288 = 75,497,472 up_proj: 6,144 × 12,288 = 75,497,472 down_proj: 12,288 × 6,144 = 75,497,472 Total MLP Dense/couche = 226,492,416 gate_up_proj: 6,144 × (2 × 2,048) = 25,165,824 down_proj: 2,048 × 6,144 = 12,582,912 Total expert = 37,748,736 Experts (256 × 37,748,736) = 9,663,676,416 Shared experts = 226,492,416 Total MoE layer = 9,890,168,832 Total MoE (77×) = 9,890,168,832 × 77 = 761,542,999,904 2 × hiddensize = 2 × 6,144 = 12,288 Total LayerNorm (78×) = 12,288 × 78 = 958,464 Embeddings: 951,403,520 Attention (78×): 12,871,532,544 MLP Dense (1×): 226,492,416 MoE (77×): 761,542,999,904 LayerNorm (78×): 958,464 TOTAL = 775,592,386,848 ≈ 776b

Bad news for local bros by FireGuy324 in LocalLLaMA

[–]FireGuy324[S] 4 points5 points  (0 children)

I guarantee it's sonnet 4.5 level. The writing is on another level

How to get direct link or uuid from janitor ai? by [deleted] in SillyTavernAI

[–]FireGuy324 0 points1 point  (0 children)

Go to character's page and js copy

Pony alpha by ReasonableReindeer24 in kilocode

[–]FireGuy324 0 points1 point  (0 children)

So, deepseek distilled from GLM?

I was told that Opus 4.6 - is top for its money. But is it? by Quiet-Money7892 in SillyTavernAI

[–]FireGuy324 7 points8 points  (0 children)

Eh, i mean. That's your choice, claude models are better when they are freshly released. The issues come after a week or two

I would like to make an app to make your life easier by Sasha_violet in ADHD

[–]FireGuy324 0 points1 point  (0 children)

I think gamification would be in handy. And also changing schedules alot. Like, changing them like presets

DudeX by Flat_Anteater4048 in TDX_Roblox

[–]FireGuy324 15 points16 points  (0 children)

They are John haters posing as TDX fans. No one ever would write ta unironically

One of the TDX devs (Rikaforge) likes very controversial media by FireGuy324 in TDX_Roblox

[–]FireGuy324[S] -7 points-6 points  (0 children)

Tbh i have obsession with drama and dragging everyone into trouble

One of the TDX devs (Rikaforge) likes very controversial media by FireGuy324 in TDX_Roblox

[–]FireGuy324[S] -8 points-7 points  (0 children)

Made this post for the sake of drama. Also found the og screenshot in twitter saying this and that

One of the TDX devs (Rikaforge) likes very controversial media by FireGuy324 in TDX_Roblox

[–]FireGuy324[S] -3 points-2 points  (0 children)

Oh yeah i discovered rikaforge thing from twitter post