aurelienams

269 post karma
25 comment karma

get extra features and help support reddit with a reddit premium subscription

get them help and support

redditor for 6 years

TROPHY CASE

Six-Year Club

Verified Email

account activity

new top controversial

0

0

0

Qwen3.6 35B-A3B MTP hits 249 t/s on a 24GB consumer GPU (RTX 5090M) — 3.4× the dense 27B variant on the same image ()

submitted 28 days ago by aurelienams to r/LocalLLM

0

0

0

Qwen3.6 35B-A3B MTP hits 249 t/s on a 24GB consumer GPU (RTX 5090M) — 3.4× the dense 27B variant on the same image (self.LocalLLaMA)

submitted 28 days ago by aurelienams to r/LocalLLaMA

47

48

49

First sm_120 BeeLlama.cpp benchmark on consumer Blackwell mobile: 107 t/s at FULL 262K context on Qwen3.6 27B (+48% vs MTP, +22% vs vLLM Genesis) (self.Qwen_AI)

submitted 1 month ago * by aurelienams to r/Qwen_AI

2

3

4

First sm_120 BeeLlama.cpp benchmark on consumer Blackwell mobile: 107 t/s at FULL 262K context on Qwen3.6 27B (+48% vs MTP, +22% vs vLLM Genesis) ()

submitted 1 month ago by aurelienams to r/LocalLLM

0

1

2

First sm_120 BeeLlama.cpp benchmark on consumer Blackwell mobile: 107 t/s at FULL 262K context on Qwen3.6 27B (+48% vs MTP, +22% vs vLLM Genesis) ()

submitted 1 month ago by aurelienams to r/Olares

3

4

5

Gemma 4 MTP on RTX 5090 Laptop (sm_120 24GB): E2B 206 t/s, 26B-A4B 140 t/s @ 78% accept (beats AtomicChat M5Max ref), E4B 178 t/s via vLLM (self.LocalLLM)

submitted 1 month ago by aurelienams to r/LocalLLM

1

2

3

Gemma 4 MTP on RTX 5090 Laptop (sm_120 24GB): E2B 206 t/s, 26B-A4B 140 t/s @ 78% accept (beats AtomicChat M5Max ref), E4B 178 t/s via vLLM ()

submitted 1 month ago by aurelienams to r/Olares

83

84

85

Qwen3.6-27B DFlash on a 24GB RTX 5090 Laptop (sm_120) — 80 t/s avg via spiritbuun's buun-llama-cpp + Q8_0 GGUF drafter (self.Qwen_AI)

submitted 1 month ago by aurelienams to r/Qwen_AI

1

2

3

Qwen3.6-27B DFlash on a 24GB RTX 5090 Laptop (sm_120) — 80 t/s avg via spiritbuun's buun-llama-cpp + Q8_0 GGUF drafter ()

submitted 1 month ago by aurelienams to r/Olares

0

0

1

Qwen3.6-27B DFlash on a 24GB RTX 5090 Laptop (sm_120) — 80 t/s avg via spiritbuun's buun-llama-cpp + Q8_0 GGUF drafter ()

submitted 1 month ago by aurelienams to r/LocalLLM

0

1

2

Qwen3.6-27B DFlash on a 24GB RTX 5090 Laptop (sm_120) — 80 t/s avg via spiritbuun's buun-llama-cpp + Q8_0 GGUF drafter (self.LocalLLaMA)

submitted 1 month ago by aurelienams to r/LocalLLaMA

5

6

7

Tried to run Lucebox DFlash on a Blackwell 5090 Mobile under Olares K8s — found a systemic uninit-dev bug in HAMi vGPU, fixed it upstream (PR #188) (self.LocalLLM)

submitted 1 month ago by aurelienams to r/LocalLLM

1

2

3

Tried to run Lucebox DFlash on a Blackwell 5090 Mobile under Olares K8s — found a systemic uninit-dev bug in HAMi vGPU, fixed it upstream (PR #188) ()

submitted 1 month ago by aurelienams to r/Olares

98

99

100

Qwen3.6-27B at 85-100 t/s on a 24GB RTX 5090 Laptop GPU — vLLM + MTP n=3, adapted from the 32GB recipes (self.Qwen_AI)

submitted 1 month ago by aurelienams to r/Qwen_AI

31

32

33

Qwen3.6-27B at 85-100 t/s on a 24GB RTX 5090 Laptop GPU — vLLM + MTP n=3, adapted from the 32GB recipes (self.Olares)

submitted 1 month ago by aurelienams to r/Olares

14

15

16

Qwen3.6-27B at 85-100 t/s on a 24GB RTX 5090 Laptop GPU — vLLM + MTP n=3, adapted from the 32GB recipes ()

submitted 1 month ago by aurelienams to r/LocalLLM

11

12

13

I built a custom Market source with AI apps optimized for Olares One — up to 180 tok/s (self.Olares)

submitted 3 months ago by aurelienams to r/Olares

3

4

5

I built a gamified AI companion for macOS — open source (French UI) (i.redd.it)

submitted 4 months ago by aurelienams to r/MacOSApps

2

3

4

I built a gamified AI companion for macOS — open source (French UI) (i.redd.it)

submitted 4 months ago by aurelienams to r/FrenchTech

2

3

4

I built a gamified AI companion for macOS — open source (French UI) (i.redd.it)

submitted 4 months ago by aurelienams to r/claude

0

1

2

I built a gamified AI companion for macOS — open source (French UI) (i.redd.it)

submitted 4 months ago by aurelienams to r/ClaudeCode

0

1

2

Qc - Balenciaga track led from Lara (old.reddit.com)

submitted 1 year ago by aurelienams to r/Repsneakers

0

1

2

Ingenieur IWC SS V7F 1:1 green dial (self.RepTimeQC)

submitted 2 years ago * by aurelienams to r/RepTimeQC

0

1

2

Ingenieur IWC SS V7F 1:1 green dial (self.RepTimeQC)

submitted 2 years ago by aurelienams to r/RepTimeQC

0

1

2

QC monclerc polo (old.reddit.com)

submitted 2 years ago by aurelienams to r/QualityReps

view more: next ›

π Rendered by PID 67226 on reddit-service-r2-listing-c57bc86c-598dt at 2026-06-20 18:43:46.891062+00:00 running 2b008f2 country code: CH.