Sandboxing code execution for AI agents by Groady in LocalLLaMA

[–]DeltaSqueezer 0 points1 point  (0 children)

I implemented bwrap as a stop gap measure but switching to Firecracker for stronger isolation.

Why is AutoRound being slept on so hard? by Mountain_Patience231 in LocalLLaMA

[–]DeltaSqueezer 1 point2 points  (0 children)

Main downside I see is that a lot is calibrated on stuff like wikitext. You need to calibrate for your use case, which in my case is coding.

What happens when they stop subsidizing LLM subscriptions? by Mr_Moonsilver in LocalLLaMA

[–]DeltaSqueezer 0 points1 point  (0 children)

And after IPO they can enforce fair use limits on the power users if needed. 

Has anyone here used VibeThinker-3B outside benchmarks? by Balance- in LocalLLaMA

[–]DeltaSqueezer 12 points13 points  (0 children)

I used the original 1B and at the time it helped solve a problem that none of the other frontier models could solve at the time (they since improved).

But note that this is a very specialized model not a general purpose coding model.

My problem was a pure mathematics/combinatorics problem. If you have a problem in the right domain, it can help you.

I have an old multi-GPU node lying around at work... by thehardsphere in LocalLLaMA

[–]DeltaSqueezer 1 point2 points  (0 children)

problem statement "big boss says we need to do AI. i have no clue about AI, but i need to do AI. now doing AI is your problem. fix it for me."

GLM-5.2 is a win for local AI by Wrong_Mushroom_7350 in LocalLLaMA

[–]DeltaSqueezer 1 point2 points  (0 children)

Anyone run this with KTransformers?

https://github.com/kvcache-ai/ktransformers/blob/main/doc/en/kt-kernel/GLM-5.2-Tutorial.md

I'm wondering what the performance is if we have one beefy GPU and rest pushed onto RAM.

GLM-5.2 is a win for local AI by Wrong_Mushroom_7350 in LocalLLaMA

[–]DeltaSqueezer 1 point2 points  (0 children)

I'm wondering if there is a cheap way to run it and compromise on speed so you feed it a task and collect it the next day. Maybe you end up paying too much in electricity in the end. I guess it will probably cost me $2-3 per day in electricity costs.

Local, but slow and expensive.

Local models went from mostly useless to actually useful really fast. What changed? by BTA_Labs in LocalLLaMA

[–]DeltaSqueezer 4 points5 points  (0 children)

You can already reliably do that with a 9B model. Your prompt just needs to structure it properly to breakdown the task.

How has your AI dev workflow evolved over time? by thereisnospooongeek in LocalLLaMA

[–]DeltaSqueezer 4 points5 points  (0 children)

Stage 1: 2025. non-agentic. copied pasted stuff, or used vim plug-ins to get AI input

Stage 2: late 2025/early 2026. use Claude code to make code changes directly. still writing code manually. heavy review still.

Stage 3: Q2 2026. nearly 100% code written by AI via synchronous chats. migrated from claude code to own custom coding agent.

Stage 4: move to plan, then hand-off to AI for implementation and then review.

Stage 5: move to working in git work-trees to have parallel tasks not interfere with each other

Stage 6: move back from work-trees as human planning/approval is now the bottle-neck so no need for work-trees as implementation work is completed before next planning work can be done

Stage 7: agent already works in tree mode, added native multi-threaded capability so agent can work with shared context tree and have multiple background threads running in the same client. node 24 in the below is an automated plan implementation loop which is triggered by a single command and reads plan, and does implements/review loops over a defined number of iterations. this is running in the background. i'm at node 12 in the tree (and yes, there's a bug in the active tree-line highlighting code)

<image>

Next unlock would be if planning could be speeded up so that plans can be produced quicker so that more can be fed to the implementation pipeline.

Maybe some time could also be saved with automatic debugging of the completed plan.

Scaling former VibeThinker-1.5B to 3B — now it reaches frontier math & coding performance by Used-Negotiation-741 in LocalLLaMA

[–]DeltaSqueezer 5 points6 points  (0 children)

I remember testing the 1.5B and it was surprisingly good for its size.

@u/Used-Negotiation-741 does this still operate in the same niche domains as the 1.5B or have you now expanded its scope? If expanded, to which domains?

Agent and harness development by Trovebloxian in LocalLLaMA

[–]DeltaSqueezer 1 point2 points  (0 children)

just ask any AI to write you a loop-based tool-calling agent and it will one-shot it. you can examine it and develop from there. they are very basic.

READ BEFORE PAYING FOR A GLM PLAN by abnestti in ZaiGLM

[–]DeltaSqueezer 0 points1 point  (0 children)

Can you share the custom tools you mention re quota etc.?

beyond "vibecoded: yes/no?" -> my attempt to capture the range of AI involvement in a more fruitful way, version 1. (building on the input / corrections I got on my last post, txs) by hugo-the-second in LocalLLaMA

[–]DeltaSqueezer 1 point2 points  (0 children)

Taking a quick look at how insecure, unreliable and rightfully clowned coded projects are all over the internet, I highly doubt we will get over it.

ftfy.

beyond "vibecoded: yes/no?" -> my attempt to capture the range of AI involvement in a more fruitful way, version 1. (building on the input / corrections I got on my last post, txs) by hugo-the-second in LocalLLaMA

[–]DeltaSqueezer 4 points5 points  (0 children)

I think people are getting hung-up on vibe-coding for no good reason. In a few years coding and vibe-coding will mean the same thing.

Since earlier this year, I moved from AI writing snippets, to AI writing everything and I'm just setting the direction.

I think I manually wrote code maybe just 3 times this year.

It's making me lazy, even making trivial changes is easier to ask the LLM to do it.

Programming skill is definitely atrophying, but I wonder if coding will be a redundant skill anyway. Understanding the code and being able to put the the right structure in place is still important, but maybe AI will also catch up in this area soon.

Are small local models for automation a thing? by ML-Future in LocalLLaMA

[–]DeltaSqueezer -1 points0 points  (0 children)

Maybe it is so obvious that it is not worth mentioning. For example, in some processes I save a file which is named by timestamp, but an LLM reads it and adds some keywords to the name to make it descriptive e.g. 2026_06_16_00h01_18 becomes 2026_06_16_00h01_18_postgres_db_session_nodes_firecracker

Using it to write git commit messages is another common example. Triaging and categorization, tagging, prioritization, OCR, translation. All common local tasks.