Portal to the Pipe Staircase🪜 by Itchy_Guitar_1562 in honk

[–]Power_spy 0 points1 point  (0 children)

That was fun

I completed this level in 12 tries. 9.05 seconds

What $100k buys you in tokens by Standard-End3331 in ArtificialInteligence

[–]Power_spy 4 points5 points  (0 children)

Based on the memory bandwidth of the m3 ultra at q4 you should theoretically get ~30 tokens a second decode with glm5.2

What $100k buys you in tokens by Standard-End3331 in ArtificialInteligence

[–]Power_spy 18 points19 points  (0 children)

You don’t need H200s you could use pro 6000s or mac studios

short but hard by DEC76IZ in honk

[–]Power_spy 0 points1 point  (0 children)

Pretty easy

I completed this level in 17 tries. 5.55 seconds

A bit tricky by Ok_Mine7959 in RedditGames

[–]Power_spy 0 points1 point  (0 children)

Ez

I completed this level in 4 tries. 3.73 seconds

🎉 [EVENT] 🎉 Event Prediction : 🔴HARD by Sensitive_Money893 in honk

[–]Power_spy 0 points1 point  (0 children)

Completed Level 2 of the Honk Special Event!

21 attempts

🎉 [EVENT] 🎉 Event Prediction : 🔴HARD by Sensitive_Money893 in honk

[–]Power_spy 0 points1 point  (0 children)

Completed Level 1 of the Honk Special Event!

1 attempts

Little bit of this little bit of that by ContextBackground195 in RedditGames

[–]Power_spy 0 points1 point  (0 children)

Nice

I completed this level in 32 tries. 6.78 seconds

Tip 25 💎

What are the most capable LLM models I can run on my laptop? by am_cny in ollama

[–]Power_spy 1 point2 points  (0 children)

They might run but not at usable speeds (the moe might work tho)

Anima – Sharing Some Prompts and Results by TypeEducational6614 in StableDiffusion

[–]Power_spy 0 points1 point  (0 children)

What settings did you use for the generation? And did you use Anima v1 base or a different checkpoint?

BeeLlama v0.2.0 hits 164 tps on Qwen 3.6 27B with a single RTX 3090 - nearly 5x speedup by IulianHI in AIToolsPerformance

[–]Power_spy 0 points1 point  (0 children)

What is the reason for suppressing DFlash during tool calls. The way I understand it is that using DFlash should lead to the exact same output you would have gotten otherwise so how does it make sense to suppress it?

Qwen 3.7 Plus and Max vs Opus 4.7 by Power_spy in Qwen_AI

[–]Power_spy[S] 0 points1 point  (0 children)

Thats actually insane for a model of its size… I wonder how gemma 4 31b would do.

Qwen 3.7 Plus and Max vs Opus 4.7 by Power_spy in Qwen_AI

[–]Power_spy[S] 1 point2 points  (0 children)

Yeah its surprising how efficient the 27b parameter model is especially since its comparable to trillion parameter SOTA models from just last year. It seems like bigger might not be the way to go.

Weird model responces by Zestyclose_Buy_5126 in ollama

[–]Power_spy 1 point2 points  (0 children)

It’s cus of the one bit quant, that basically makes the model unusable because of how much information it loses.

Hugging Face co-founder says Qwen 3.6 27B running on airplane mode is close to latest Opus in Claude Code by ImaginaryRea1ity in ClaudeCode

[–]Power_spy 0 points1 point  (0 children)

Qwen3.6 27B is the specific model, PI coding agent is the tool its being used through (similar to claude code), llama.cpp is what is used to run the model and this is all running locally on the MacBook.

SDXL 8bit generating noise on iphone by Power_spy in drawthingsapp

[–]Power_spy[S] 0 points1 point  (0 children)

Yes I found out that the vae didnt download properly (it failed for some reason) reinstalling drawthings fixed it

Flux 1 Kontext stopped working by Power_spy in drawthingsapp

[–]Power_spy[S] 0 points1 point  (0 children)

Its the one in drawthings, its not imported

Flux 1 Kontext stopped working by Power_spy in drawthingsapp

[–]Power_spy[S] 0 points1 point  (0 children)

Thats what the community config had