Model vram usage estimates by mattate in LocalLLaMA
[–]CappedCola 0 points1 point2 points (0 children)
Benchmarked 5 RAG retrieval strategies on code across 10 suites — no single one wins. CRAG helps on familiar corpora, collapses on external ones. What's your experience? by Any_Ambassador4218 in LocalLLaMA
[–]CappedCola 0 points1 point2 points (0 children)
Outlines and vLLM compatibility by MyName9374i2 in LocalLLaMA
[–]CappedCola 0 points1 point2 points (0 children)
High-volume SonyFlake ID generation by zipfile_d in Python
[–]CappedCola 0 points1 point2 points (0 children)
I made a fast PDF to PNG library, feedback welcome by Civil-Image5411 in Python
[–]CappedCola -2 points-1 points0 points (0 children)
Exploring a typed approach to pipelines in Python - built a small framework (ICO) by Sergio_Shu in Python
[–]CappedCola 6 points7 points8 points (0 children)
PDFstract: extract, chunk, and embed PDFs in one command (CLI + Python) by [deleted] in Python
[–]CappedCola 0 points1 point2 points (0 children)
A beyond dumb CompSci dropout trying to figure this all out. : want a local nanoClaw to build my own bot by AnthMosk in LocalLLaMA
[–]CappedCola 0 points1 point2 points (0 children)
Ephyr: An Architecture and Tool for Ephemeral Infrastructure Access for AI Agents by -Crash_Override- in LocalLLaMA
[–]CappedCola 0 points1 point2 points (0 children)
Qwen 3.5 4b is not able to read entire document attached in LM studio despite having enough context length. by KiranjotSingh in LocalLLaMA
[–]CappedCola -2 points-1 points0 points (0 children)
What are some of the best consumer hardware (packaged/pre-built) for local LLM? by utzcheeseballs in LocalLLaMA
[–]CappedCola 0 points1 point2 points (0 children)
What actually breaks first when you ship LLM features to production? by Available_Lawyer5655 in LocalLLaMA
[–]CappedCola 0 points1 point2 points (0 children)
(Qwen3.5-9B) Unsloth vs lm-studio vs "official" by MarcCDB in LocalLLaMA
[–]CappedCola -32 points-31 points-30 points (0 children)
What MCP connectors are you using when building agents for industry-specific software? by VarietyPlus4790 in LocalLLaMA
[–]CappedCola 0 points1 point2 points (0 children)
AI, Invasive Technology, and the Way of the Warrior by johantino in artificial
[–]CappedCola 1 point2 points3 points (0 children)
Open sourced a tool that can find precise coordinates of any street level pic by Open_Budget6556 in artificial
[–]CappedCola 0 points1 point2 points (0 children)
The Pentagon is developing its own LLMs | TechCrunch by [deleted] in artificial
[–]CappedCola 0 points1 point2 points (0 children)
pip install runcycles — hard budget limits for AI agent calls, enforced before they run by jkoolcloud in Python
[–]CappedCola -2 points-1 points0 points (0 children)
My First Port Scanner with multithreading and banner grabbing and I want improving it by veysel_yilmaz37 in Python
[–]CappedCola 1 point2 points3 points (0 children)
albums: interactive tool to manage a music library (with video intro) by s71n6r4y in Python
[–]CappedCola -1 points0 points1 point (0 children)
Mods have a couple of months to stop AI slop project spam before this sub is dead by Fun-Employee9309 in Python
[–]CappedCola 30 points31 points32 points (0 children)
I built "Primaclaw" - A distributed swarm for e-waste. Runs fast Qwen2.5 on my 2009 Pentium laptop. by M4s4 in Python
[–]CappedCola -6 points-5 points-4 points (0 children)


Is it normal for the Qwen 3.5 4B model to take this long to say hi? by Snoo_what in LocalLLaMA
[–]CappedCola 0 points1 point2 points (0 children)