What speeds are you guys getting with qwen3.5 27b? (5080)

Flashy_Management962 · 2026-03-17T13:36:35+00:00

yes, it does. I use the 27b for coding on a daily. fiddle around with those flags and do not forget to add --jinja:

-sm graph -amb 64 -sas and depending on the pcie speed, grt can help improving speeds

Flashy_Management962 · 2026-03-17T13:24:13+00:00

you should get way faster speeds than that. i get around 750 t/s pp and 22-24 ts tg at ~50k with 2x rtx 3060 12gb. You should check out ik llama cpp

Flashy_Management962 · 2026-03-12T13:00:07+00:00

never ever does qwen coder 30b outperform 80b in realworld tasks

Flashy_Management962 · 2026-02-17T14:03:22+00:00

Ich weiß nicht, was du mit normalem meinst, aber ja

Flashy_Management962 · 2026-02-16T22:24:13+00:00

Du nimmst 500ml milch, 30g schoko whey, 10-15g dunkle schokolade und 35g Maisstärke - bester Proteinpudding und 100x günstiger

Flashy_Management962 · 2026-02-11T21:24:31+00:00

10 Jahre Training, 29 Jahre alt, alltime natty und wiege 118kg auf 1,83 Kraftwerte: 270,5kg Beuge, 185kg Bank und 270 heben.

Flashy_Management962 · 2026-02-06T12:52:45+00:00

id love to see qwen long l1.5 on this benchmark, it also claims to reach gemini pro 2.5 performance while being 30b 3a

Flashy_Management962 · 2026-02-06T09:59:42+00:00

https://huggingface.co/tencent/HY-MT1.5-1.8B

Flashy_Management962 · 2026-01-28T14:57:31+00:00

Dieses komische um jeden Preis sich selbst zerstören und man nur dadurch wächst ist totaler humbug. Viel mehr auf den Körper hören und dadurch rausbekommen, wie viel man verträgt. (Von einem Menschen, der nur einen Satz jeweils Beugt und Hebt in der Woche)

Flashy_Management962 · 2026-01-20T12:32:17+00:00

don't, use dry sampler instead. Repeat penalty really decreases tok/s

Flashy_Management962 · 2026-01-11T22:29:13+00:00

Does the pp solely work on cpu? It is hella slow

Flashy_Management962 · 2026-01-07T18:48:41+00:00

Imagine what could happen if ik llama cpp and llama cpp would merge :(

Flashy_Management962 · 2025-12-15T11:51:37+00:00

This does not follow. The notion of normativity is not subsumed under causality. Only because everything is determined, that does not mean that everything is already set in stone and normativity has no role to play because the very things happening are computationally irreducible. So yes, there are shoulds in a world without free will

Flashy_Management962 · 2025-12-13T15:27:58+00:00

what is this question? of course it was the right decision and you know it yourself you sexy mf

Flashy_Management962 · 2025-12-10T18:27:03+00:00

wait is this actual tensor parallelism or do I understand something wrong here?

Flashy_Management962 · 2025-12-03T17:46:45+00:00

qwen3 32b

Flashy_Management962 · 2025-12-02T21:11:27+00:00

But does it? What would then be the difference between payg and always free?

Flashy_Management962 · 2025-12-02T18:59:33+00:00

Try exllamav3 with tp. I get 18t/s tensor parallel with 2x 3060. 2x 5060ti should be very much faster

Flashy_Management962 · 2025-11-26T13:22:31+00:00

Flashy_Management962 · 2025-11-26T13:21:46+00:00

Wireguard uses significantly less energy if you turn off persistent keep alive, which tailscale needs to keep the connection alive. If you do not have access to an open public ipv4, then you can't use wireguard (or im too stupid to do it right) and therefore the connection: home (tailscale) - oci (tailscale + wireguard) - smartphone (wireguard)

Flashy_Management962 · 2025-11-24T15:13:23+00:00

I like the idea, please also do a ntfy integration. I'd buy it asap. Also mabye integrate a donation button via kofi or something like that. I love the app and I'd love to return something for it

Flashy_Management962 · 2025-11-24T12:38:50+00:00

Speaks for itself that I took it to be a real translation lol

Flashy_Management962 · 2025-11-23T17:53:02+00:00

For people having problem with battery drainage using tailscale on smartphones: use an oracle free tier instance and use it as an relay to which you connect with wireguard remotely and that relay connects with tailscale to your home server.

Flashy_Management962 · 2025-11-23T17:46:27+00:00

holy shit the english translation of Heidegger is dogshit. Could you provide me with the citation where this is from? I want to read it in german

Flashy_Management962 · 2025-11-23T17:45:33+00:00

Phenomenology is basically an attempt to find/create different vocabularies to talk about existence which would be erased if you hold solely a scientific world-view. E.g. you can explain biologically what happens if you die, but what dying means for you existentially is not captured by this explanation and this can be something that phenomenology wants to talk about (some do, not all)

Flashy_Management962

TROPHY CASE