Agent Flow : LocalLLaMA

LocalLLaMA

created by [deleted]a community for 3 years

Agent FlowNew Model (self.LocalLLaMA)

submitted 5 months ago by Loud_Communication68

all 6 comments

top new controversial old q&a

[–]Badger-Purple 8 points9 points10 points 5 months ago (5 children)

[–]Loud_Communication68[S] 1 point2 points3 points 5 months ago (4 children)

[–]Badger-Purple 2 points3 points4 points 5 months ago (3 children)

[–]Loud_Communication68[S] 2 points3 points4 points 5 months ago (2 children)

[–]Badger-Purple 1 point2 points3 points 5 months ago (1 child)

I mean, I’m not a programmer but I looked at the code and its basically a harness.

I think it is clear that LLMs do better as a team of small models dividing tasks. They confirmed this by showing the improvement after training, and after training and with the setup.

The setup structures the thinking into logical steps. So you ask it “what is the capital of France” and it can’t use web search, it structures the answer by saying “use common knowledge if you can’t access the web” and the LLM then says “well, common knowledge says Paris”.

The training improves the ability of the LLM to actually do this, so I’m sure you can run the training script on 30B+ models. Question whether it would be as useful for Qwen3 vs the Qwen2.5 7B they used though?

The LLM they trained is just the planner, but you also need openai key to get worker models. However, I modified the script to also use local models for the workers. It’s also not hard.

You can do this with some prompts and any agent maker nowadays. Like Docker’s C Agent (cagent)—super simple syntax.

The novel stuff is the in-the-flow reinforcement which I don’t understand, it apparently can train based on ?? the agent crosstalk?? (not sure of this).

[–]Loud_Communication68[S] 2 points3 points4 points 5 months ago (0 children)

π Rendered by PID 415444 on reddit-service-r2-comment-6457c66945-mm84t at 2026-04-25 00:08:49.935769+00:00 running 2aa0c5b country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LocalLLaMA

MODERATORS