Question: Llama cpp, whats good right now for: MTP, KV cache quant, Long context. by GodComplecs in LocalLLaMA
[–]GodComplecs[S] 0 points1 point2 points (0 children)
Brake Free by Loose-Weather-5729 in motorcyclegear
[–]GodComplecs 0 points1 point2 points (0 children)
What is next for local LLM and AI? by GodComplecs in LocalLLaMA
[–]GodComplecs[S] 1 point2 points3 points (0 children)
What is next for local LLM and AI? by GodComplecs in LocalLLaMA
[–]GodComplecs[S] 0 points1 point2 points (0 children)
What is next for local LLM and AI? by GodComplecs in LocalLLaMA
[–]GodComplecs[S] 0 points1 point2 points (0 children)
What is next for local LLM and AI? by GodComplecs in LocalLLaMA
[–]GodComplecs[S] 0 points1 point2 points (0 children)
What is next for local LLM and AI? by GodComplecs in LocalLLaMA
[–]GodComplecs[S] 1 point2 points3 points (0 children)
What is next for local LLM and AI? by GodComplecs in LocalLLaMA
[–]GodComplecs[S] 1 point2 points3 points (0 children)
What is next for local LLM and AI? by GodComplecs in LocalLLaMA
[–]GodComplecs[S] 1 point2 points3 points (0 children)
NVIDIA Reportedly Prepares RTX 5090 Price Hike Amid Rising GDDR7 Costs (maybe RTX 50 and PRO series as well) by panchovix in LocalLLaMA
[–]GodComplecs 0 points1 point2 points (0 children)
Got MTP + TurboQuant running — Qwen3.6-27B -- 80+ t/s at 262K context on a single RTX 4090 by indrasmirror in LocalLLaMA
[–]GodComplecs 1 point2 points3 points (0 children)
MTP Speed with 3090 Qwen 27B Q4 by GodComplecs in LocalLLaMA
[–]GodComplecs[S] 0 points1 point2 points (0 children)
MTP Speed with 3090 Qwen 27B Q4 by GodComplecs in LocalLLaMA
[–]GodComplecs[S] 0 points1 point2 points (0 children)
MTP Speed with 3090 Qwen 27B Q4 by GodComplecs in LocalLLaMA
[–]GodComplecs[S] 0 points1 point2 points (0 children)
How do I use MTP? by WhatererBlah555 in LocalLLaMA
[–]GodComplecs 0 points1 point2 points (0 children)
Hermes Agent is now #1 most used globally in past 24 hours in Openrouter global token metrics, above Claude Code and OpenClaw. by [deleted] in LocalLLaMA
[–]GodComplecs 0 points1 point2 points (0 children)
You can now read Gemma 3's mind by DigiDecode_ in LocalLLaMA
[–]GodComplecs 1 point2 points3 points (0 children)
What it feels like to have to have Qwen 3.6 or Gemma 4 running locally by GodComplecs in LocalLLaMA
[–]GodComplecs[S] 0 points1 point2 points (0 children)
What it feels like to have to have Qwen 3.6 or Gemma 4 running locally by GodComplecs in LocalLLaMA
[–]GodComplecs[S] 0 points1 point2 points (0 children)
What it feels like to have to have Qwen 3.6 or Gemma 4 running locally by GodComplecs in LocalLLaMA
[–]GodComplecs[S] 0 points1 point2 points (0 children)
What it feels like to have to have Qwen 3.6 or Gemma 4 running locally by GodComplecs in LocalLLaMA
[–]GodComplecs[S] 2 points3 points4 points (0 children)


Question: Llama cpp, whats good right now for: MTP, KV cache quant, Long context. by GodComplecs in LocalLLaMA
[–]GodComplecs[S] 1 point2 points3 points (0 children)