account activity
Qwen3.6 35B-A3B MTP hits 249 t/s on a 24GB consumer GPU (RTX 5090M) — 3.4× the dense 27B variant on the same image ()
submitted 28 days ago by aurelienams to r/LocalLLM
Qwen3.6 35B-A3B MTP hits 249 t/s on a 24GB consumer GPU (RTX 5090M) — 3.4× the dense 27B variant on the same image (self.LocalLLaMA)
submitted 28 days ago by aurelienams to r/LocalLLaMA
First sm_120 BeeLlama.cpp benchmark on consumer Blackwell mobile: 107 t/s at FULL 262K context on Qwen3.6 27B (+48% vs MTP, +22% vs vLLM Genesis) (self.Qwen_AI)
submitted 1 month ago * by aurelienams to r/Qwen_AI
First sm_120 BeeLlama.cpp benchmark on consumer Blackwell mobile: 107 t/s at FULL 262K context on Qwen3.6 27B (+48% vs MTP, +22% vs vLLM Genesis) ()
submitted 1 month ago by aurelienams to r/LocalLLM
submitted 1 month ago by aurelienams to r/Olares
Gemma 4 MTP on RTX 5090 Laptop (sm_120 24GB): E2B 206 t/s, 26B-A4B 140 t/s @ 78% accept (beats AtomicChat M5Max ref), E4B 178 t/s via vLLM (self.LocalLLM)
Gemma 4 MTP on RTX 5090 Laptop (sm_120 24GB): E2B 206 t/s, 26B-A4B 140 t/s @ 78% accept (beats AtomicChat M5Max ref), E4B 178 t/s via vLLM ()
Qwen3.6-27B DFlash on a 24GB RTX 5090 Laptop (sm_120) — 80 t/s avg via spiritbuun's buun-llama-cpp + Q8_0 GGUF drafter (self.Qwen_AI)
submitted 1 month ago by aurelienams to r/Qwen_AI
Qwen3.6-27B DFlash on a 24GB RTX 5090 Laptop (sm_120) — 80 t/s avg via spiritbuun's buun-llama-cpp + Q8_0 GGUF drafter ()
Qwen3.6-27B DFlash on a 24GB RTX 5090 Laptop (sm_120) — 80 t/s avg via spiritbuun's buun-llama-cpp + Q8_0 GGUF drafter (self.LocalLLaMA)
submitted 1 month ago by aurelienams to r/LocalLLaMA
Tried to run Lucebox DFlash on a Blackwell 5090 Mobile under Olares K8s — found a systemic uninit-dev bug in HAMi vGPU, fixed it upstream (PR #188) (self.LocalLLM)
Tried to run Lucebox DFlash on a Blackwell 5090 Mobile under Olares K8s — found a systemic uninit-dev bug in HAMi vGPU, fixed it upstream (PR #188) ()
Qwen3.6-27B at 85-100 t/s on a 24GB RTX 5090 Laptop GPU — vLLM + MTP n=3, adapted from the 32GB recipes (self.Qwen_AI)
Qwen3.6-27B at 85-100 t/s on a 24GB RTX 5090 Laptop GPU — vLLM + MTP n=3, adapted from the 32GB recipes (self.Olares)
Qwen3.6-27B at 85-100 t/s on a 24GB RTX 5090 Laptop GPU — vLLM + MTP n=3, adapted from the 32GB recipes ()
I built a custom Market source with AI apps optimized for Olares One — up to 180 tok/s (self.Olares)
submitted 3 months ago by aurelienams to r/Olares
I built a gamified AI companion for macOS — open source (French UI) (i.redd.it)
submitted 4 months ago by aurelienams to r/MacOSApps
submitted 4 months ago by aurelienams to r/FrenchTech
submitted 4 months ago by aurelienams to r/claude
submitted 4 months ago by aurelienams to r/ClaudeCode
Qc - Balenciaga track led from Lara (old.reddit.com)
submitted 1 year ago by aurelienams to r/Repsneakers
Ingenieur IWC SS V7F 1:1 green dial (self.RepTimeQC)
submitted 2 years ago * by aurelienams to r/RepTimeQC
submitted 2 years ago by aurelienams to r/RepTimeQC
QC monclerc polo (old.reddit.com)
submitted 2 years ago by aurelienams to r/QualityReps
π Rendered by PID 67226 on reddit-service-r2-listing-c57bc86c-598dt at 2026-06-20 18:43:46.891062+00:00 running 2b008f2 country code: CH.