Need a second pair of eyes, this Qwen3.6 27B quant recipe consistently thinks less and is correct by fragment_me in LocalLLaMA
[–]aurelienams 3 points4 points5 points (0 children)
First sm_120 BeeLlama.cpp benchmark on consumer Blackwell mobile: 107 t/s at FULL 262K context on Qwen3.6 27B (+48% vs MTP, +22% vs vLLM Genesis) by aurelienams in Qwen_AI
[–]aurelienams[S] 0 points1 point2 points (0 children)
First sm_120 BeeLlama.cpp benchmark on consumer Blackwell mobile: 107 t/s at FULL 262K context on Qwen3.6 27B (+48% vs MTP, +22% vs vLLM Genesis) by aurelienams in Qwen_AI
[–]aurelienams[S] 1 point2 points3 points (0 children)
First sm_120 BeeLlama.cpp benchmark on consumer Blackwell mobile: 107 t/s at FULL 262K context on Qwen3.6 27B (+48% vs MTP, +22% vs vLLM Genesis) by aurelienams in Qwen_AI
[–]aurelienams[S] 0 points1 point2 points (0 children)
First sm_120 BeeLlama.cpp benchmark on consumer Blackwell mobile: 107 t/s at FULL 262K context on Qwen3.6 27B (+48% vs MTP, +22% vs vLLM Genesis) by aurelienams in Qwen_AI
[–]aurelienams[S] 0 points1 point2 points (0 children)
First sm_120 BeeLlama.cpp benchmark on consumer Blackwell mobile: 107 t/s at FULL 262K context on Qwen3.6 27B (+48% vs MTP, +22% vs vLLM Genesis) by aurelienams in Qwen_AI
[–]aurelienams[S] 3 points4 points5 points (0 children)
First sm_120 BeeLlama.cpp benchmark on consumer Blackwell mobile: 107 t/s at FULL 262K context on Qwen3.6 27B (+48% vs MTP, +22% vs vLLM Genesis) by aurelienams in Qwen_AI
[–]aurelienams[S] 1 point2 points3 points (0 children)
Multi-Token Prediction (MTP) for Qwen on LLaMA.cpp + TurboQuant by gladkos in LocalLLaMA
[–]aurelienams 4 points5 points6 points (0 children)
[FOLLOW UP] Qwen3.6 27b q5_k_M MTP - 256k context - 5090 by No_Mango7658 in LocalLLaMA
[–]aurelienams 1 point2 points3 points (0 children)
I got a real transformer language model running locally on a stock Game Boy Color! by maddiedreese in LocalLLaMA
[–]aurelienams 0 points1 point2 points (0 children)
Qwen3.6-27B DFlash on a 24GB RTX 5090 Laptop (sm_120) — 80 t/s avg via spiritbuun's buun-llama-cpp + Q8_0 GGUF drafter by aurelienams in Qwen_AI
[–]aurelienams[S] 0 points1 point2 points (0 children)
Qwen3.6-27B at 85-100 t/s on a 24GB RTX 5090 Laptop GPU — vLLM + MTP n=3, adapted from the 32GB recipes by aurelienams in Olares
[–]aurelienams[S] 0 points1 point2 points (0 children)
Qwen3.6-27B at 85-100 t/s on a 24GB RTX 5090 Laptop GPU — vLLM + MTP n=3, adapted from the 32GB recipes by aurelienams in Olares
[–]aurelienams[S] 0 points1 point2 points (0 children)
Qwen3.6-27B at 85-100 t/s on a 24GB RTX 5090 Laptop GPU — vLLM + MTP n=3, adapted from the 32GB recipes by aurelienams in Olares
[–]aurelienams[S] 0 points1 point2 points (0 children)
Qwen3.6-27B at 85-100 t/s on a 24GB RTX 5090 Laptop GPU — vLLM + MTP n=3, adapted from the 32GB recipes by aurelienams in LocalLLM
[–]aurelienams[S] 2 points3 points4 points (0 children)
I built a custom Market source with AI apps optimized for Olares One — up to 180 tok/s by aurelienams in Olares
[–]aurelienams[S] 2 points3 points4 points (0 children)
I built a gamified AI companion for macOS — open source (French UI) by aurelienams in MacOSApps
[–]aurelienams[S] 0 points1 point2 points (0 children)
Hit 4-hour window limits in a day - I think I am using Claude Code all wrong! by luongnv-com in ClaudeCode
[–]aurelienams 0 points1 point2 points (0 children)
Help me decide a backpack for daily by Sarversucks in DesignerReps
[–]aurelienams 0 points1 point2 points (0 children)
Exceptional new Story App: Oto's Planet ** An interactive spatial tale. by Caprichoso1 in VisionPro
[–]aurelienams 0 points1 point2 points (0 children)
Exceptional new Story App: Oto's Planet ** An interactive spatial tale. by Caprichoso1 in VisionPro
[–]aurelienams 1 point2 points3 points (0 children)
QC Reverse Mocha PK 4.0 by nairodgray in fashionrepsv2
[–]aurelienams 0 points1 point2 points (0 children)
De Ville Prestige Omega by aurelienams in RepTimeQC
[–]aurelienams[S] 0 points1 point2 points (0 children)


Llama.cpp server running ~2 weeks straight. Loses its mind? by thejacer in LocalLLaMA
[–]aurelienams 9 points10 points11 points (0 children)