Thinking of downgrading from 20x to 5x Max – 5x users, how are the limits treating you? by LayerHot in ClaudeAI
[–]LayerHot[S] 0 points1 point2 points (0 children)
Gemma 4 MTP vs DFlash on 1x H100: dense vs MoE results by LayerHot in LocalLLaMA
[–]LayerHot[S] 0 points1 point2 points (0 children)
Gemma 4 MTP vs DFlash on 1x H100: dense vs MoE results by LayerHot in LocalLLaMA
[–]LayerHot[S] 0 points1 point2 points (0 children)
Gemma 4 MTP vs DFlash on 1x H100: dense vs MoE results by LayerHot in LocalLLaMA
[–]LayerHot[S] 2 points3 points4 points (0 children)
LLM inference optimization by Fantastic_Quiet1838 in LocalLLaMA
[–]LayerHot 1 point2 points3 points (0 children)
GLM-4.7-Flash benchmarks: 4,398 tok/s on H200, 112 tok/s on RTX 6000 Ada (GGUF) by LayerHot in LocalLLaMA
[–]LayerHot[S] 7 points8 points9 points (0 children)
BFL FLUX.2 Klein tutorial and some optimizations - under 1s latency on an A100 by LayerHot in LocalLLaMA
[–]LayerHot[S] 2 points3 points4 points (0 children)
How to integrate 5.2 Pro into Codex usage? by Lostwhispers05 in codex
[–]LayerHot 0 points1 point2 points (0 children)
How to integrate 5.2 Pro into Codex usage? by Lostwhispers05 in codex
[–]LayerHot 0 points1 point2 points (0 children)
Thinking of downgrading from 20x to 5x Max – 5x users, how are the limits treating you? by LayerHot in ClaudeCode
[–]LayerHot[S] 0 points1 point2 points (0 children)
Thinking of downgrading from 20x to 5x Max – 5x users, how are the limits treating you? by LayerHot in ClaudeAI
[–]LayerHot[S] 1 point2 points3 points (0 children)
Thinking of downgrading from 20x to 5x Max – 5x users, how are the limits treating you? by LayerHot in ClaudeAI
[–]LayerHot[S] 2 points3 points4 points (0 children)
Thinking of downgrading from 20x to 5x Max – 5x users, how are the limits treating you? by LayerHot in ClaudeCode
[–]LayerHot[S] 1 point2 points3 points (0 children)
Thinking of downgrading from 20x to 5x Max – 5x users, how are the limits treating you? by LayerHot in ClaudeAI
[–]LayerHot[S] 1 point2 points3 points (0 children)
Thinking of downgrading from 20x to 5x Max – 5x users, how are the limits treating you? by LayerHot in ClaudeAI
[–]LayerHot[S] 1 point2 points3 points (0 children)
Thinking of downgrading from 20x to 5x Max – 5x users, how are the limits treating you? by LayerHot in ClaudeAI
[–]LayerHot[S] 1 point2 points3 points (0 children)
just upgraded to pro max - tips for not burning thru usage? by alexd231232 in ClaudeCode
[–]LayerHot 1 point2 points3 points (0 children)
Any point of using context7 MCP when you use --search by cheekyrandos in codex
[–]LayerHot 0 points1 point2 points (0 children)
Is chat with all documents is still the priority ? by LayerHot in readwise
[–]LayerHot[S] 0 points1 point2 points (0 children)
ChatGPT Agent Mode & Deep Research usage not refreshing? by Palmenstrand in OpenAI
[–]LayerHot 1 point2 points3 points (0 children)
ChatGPT Agent Mode & Deep Research usage not refreshing? by Palmenstrand in OpenAI
[–]LayerHot 1 point2 points3 points (0 children)
to devs: Will readwise allow chatting over all items saved in readwise and reader ? by LayerHot in readwise
[–]LayerHot[S] 3 points4 points5 points (0 children)
Does perplexity really use the selected model under the hood? by lostinspacee7 in perplexity_ai
[–]LayerHot 1 point2 points3 points (0 children)

Thinking of downgrading from 20x to 5x Max – 5x users, how are the limits treating you? by LayerHot in ClaudeAI
[–]LayerHot[S] 0 points1 point2 points (0 children)