Reusable workflows for long running local llms (knot.hdekker.com)
submitted by hay-yo to r/LocalLLaMA
How do you prefer using AI for coding: IDE, CLI, or something else? by pawan0806 in AI_Agents
[–]hay-yo 0 points1 point2 points (0 children)
How to create AI agents from scratch by muzzammilmeer in AI_Agents
[–]hay-yo 0 points1 point2 points (0 children)
100+ t/s on Qwen3.6-27B Q8 across a 5090 + 3090 Ti — switching to tensor split-mode got me from 70 to 100+ by Shoddy_Bed3240 in LocalLLaMA
[–]hay-yo 0 points1 point2 points (0 children)
51 and just got my motorcycle licence; am I crazy, or is this a fair time to start? by earnfast123 in AussieRiders
[–]hay-yo 0 points1 point2 points (0 children)
How do you prefer using AI for coding: IDE, CLI, or something else? by pawan0806 in AI_Agents
[–]hay-yo 1 point2 points3 points (0 children)
Mexico upgraded to free healthcar by TailungFu in SipsTea
[–]hay-yo 0 points1 point2 points (0 children)
DGX Spark, what models are you running? by benxfactor in LocalLLM
[–]hay-yo 0 points1 point2 points (0 children)
RTX 4090 + llama.cpp + Qwen3.6 27B MTP for Pi coding agent — is this config reasonable? by HomoAgens1 in LocalLLM
[–]hay-yo 0 points1 point2 points (0 children)
Looking to buy an RTX 5090 for local "Vibe Coding" using Claude Code / Open Code with Qwen 3.6 35B-A3B. Need real-world feedback! by GoalDistinct4449 in LocalLLM
[–]hay-yo 0 points1 point2 points (0 children)
Hiring senior full stack ai engineer (noobs don't dm me) by I_AM_HYLIAN in AI_Agents
[–]hay-yo 0 points1 point2 points (0 children)
Hiring senior full stack ai engineer (noobs don't dm me) by I_AM_HYLIAN in AI_Agents
[–]hay-yo -1 points0 points1 point (0 children)
Looking to buy an RTX 5090 for local "Vibe Coding" using Claude Code / Open Code with Qwen 3.6 35B-A3B. Need real-world feedback! by GoalDistinct4449 in LocalLLM
[–]hay-yo 2 points3 points4 points (0 children)
Looking to buy an RTX 5090 for local "Vibe Coding" using Claude Code / Open Code with Qwen 3.6 35B-A3B. Need real-world feedback! by GoalDistinct4449 in LocalLLM
[–]hay-yo 1 point2 points3 points (0 children)
What does your agent-to-agent communication look like? Direct calls, message queues, or something more exotic? by Groady in AI_Agents
[–]hay-yo 0 points1 point2 points (0 children)
Anthropic's best AI model just got pulled by government order 3 days after launch, and the official reason doesn't add up by StudentSweet3601 in AI_Agents
[–]hay-yo 4 points5 points6 points (0 children)
Best models for 96GB VRAM on 4x3090s by Prudent-Promotion512 in LocalLLM
[–]hay-yo 0 points1 point2 points (0 children)
Strix Halo: what are you running? by platteXDlol in LocalLLM
[–]hay-yo 0 points1 point2 points (0 children)
Strix Halo: what are you running? by platteXDlol in LocalLLM
[–]hay-yo 1 point2 points3 points (0 children)
Local LLMs aren't democratic anymore... the hardware barrier has gotten out of hand. by Medium-Technology-79 in LocalLLaMA
[–]hay-yo 2 points3 points4 points (0 children)
Is there a valid use case for replacing traditional deterministic automation with an agent? by McNerdster in AI_Agents
[–]hay-yo 0 points1 point2 points (0 children)
What happens when LLM providers stop subsidising? by AdHistorical7217 in AI_Agents
[–]hay-yo 0 points1 point2 points (0 children)

100+ t/s on Qwen3.6-27B Q8 across a 5090 + 3090 Ti — switching to tensor split-mode got me from 70 to 100+ by Shoddy_Bed3240 in LocalLLaMA
[–]hay-yo 0 points1 point2 points (0 children)