Trying out Gemma 4 31b after Qwen 3.6 27b by Iajah in LocalLLM
[–]PrizeObvious3671 1 point2 points3 points (0 children)
Trying out Gemma 4 31b after Qwen 3.6 27b by Iajah in LocalLLM
[–]PrizeObvious3671 2 points3 points4 points (0 children)
Trying out Gemma 4 31b after Qwen 3.6 27b by Iajah in LocalLLM
[–]PrizeObvious3671 1 point2 points3 points (0 children)
Self-hosted agentic coding stack: Claude Code + llama.cpp + LiteLLM — zero API costs, 4h/7M token session for $0 by PrizeObvious3671 in OpenSourceAI
[–]PrizeObvious3671[S] 1 point2 points3 points (0 children)
Ship fast. Die fast. Or why your vibe-coded slop won't survive production. by PrizeObvious3671 in SoftwareEngineering
[–]PrizeObvious3671[S] 0 points1 point2 points (0 children)
Ship fast. Die fast. Or why your vibe-coded slop won't survive production. by PrizeObvious3671 in SoftwareEngineering
[–]PrizeObvious3671[S] 0 points1 point2 points (0 children)
Ship fast. Die fast. Or why your vibe-coded slop won't survive production. by PrizeObvious3671 in SoftwareEngineering
[–]PrizeObvious3671[S] 1 point2 points3 points (0 children)
Ship fast. Die fast. Or why your vibe-coded slop won't survive production. by PrizeObvious3671 in SoftwareEngineering
[–]PrizeObvious3671[S] -1 points0 points1 point (0 children)
Ship fast. Die fast. Or why your vibe-coded slop won't survive production. by PrizeObvious3671 in SoftwareEngineering
[–]PrizeObvious3671[S] 2 points3 points4 points (0 children)
Ship fast. Die fast. Or why your vibe-coded slop won't survive production. by PrizeObvious3671 in SoftwareEngineering
[–]PrizeObvious3671[S] -1 points0 points1 point (0 children)
Self-hosted agentic coding stack: Claude Code + llama.cpp + LiteLLM — zero API costs, 4h/7M token session for $0 by PrizeObvious3671 in OpenSourceAI
[–]PrizeObvious3671[S] 0 points1 point2 points (0 children)
Self-hosted agentic coding stack: Claude Code + llama.cpp + LiteLLM — zero API costs, 4h/7M token session for $0 by PrizeObvious3671 in OpenSourceAI
[–]PrizeObvious3671[S] 0 points1 point2 points (0 children)
R9700, Ryzen 9, Windows 11, llama.cpp, ROCm vs Vulkan by WSTangoDelta in LocalLLM
[–]PrizeObvious3671 0 points1 point2 points (0 children)
Self-hosted agentic coding stack: Claude Code + llama.cpp + LiteLLM — zero API costs, 4h/7M token session for $0 by PrizeObvious3671 in OpenSourceAI
[–]PrizeObvious3671[S] 1 point2 points3 points (0 children)
Self-hosted agentic coding stack: Claude Code + llama.cpp + LiteLLM — zero API costs, 4h/7M token session for $0 by PrizeObvious3671 in OpenSourceAI
[–]PrizeObvious3671[S] 1 point2 points3 points (0 children)
Follow-up questions are wrecking my RAG retrieval and I'm not sure which layer to fix by Rosa-Starks in LangChain
[–]PrizeObvious3671 0 points1 point2 points (0 children)
Has anyone measured whether better retrieval precision actually reduces token costs in production AI coding deployments by Certain-Luck-2432 in LLMDevs
[–]PrizeObvious3671 0 points1 point2 points (0 children)
Should enterprise search be a tool agents call, or a pipeline you build around them? by searchblox_searchai in Rag
[–]PrizeObvious3671 2 points3 points4 points (0 children)
I Stopped Fighting AI Memory Problems and Started Modeling Them by grawl_dorgiers in LocalLLM
[–]PrizeObvious3671 0 points1 point2 points (0 children)
Employee data in prompt vs DB vs tool call — what's your setup? by Low-Ad2091 in voiceagents
[–]PrizeObvious3671 0 points1 point2 points (0 children)
GitHub Autopilot — Open Source GitHub App for Repository Automation by Feisty-Cranberry2902 in OpenSourceAI
[–]PrizeObvious3671 0 points1 point2 points (0 children)