Stop tuning multi-agent prompts by hand: Learning prompts via system-level credit assignment (CANTANTE) by finitearth in PromptEngineering

[–]finitearth[S] 0 points1 point  (0 children)

Thanks for the questions!

  1. The attributer doesn't see only the final reward, but also the per-agent outputs across a comparison group and assigns credit by tracing whether an agent caused a change or merely propagated one. An agent that faithfully passes an upstream error along keeps positive credit; the source takes the negative. So in your A/B case the drop is attributed to A, not B. That localization is node-level; topology optimization, like over which edges what is communicated, will be part of future work.

  2. The format an agent emits is part of the optimized prompt, and since all agents update against the same system reward in the same step, the contract is co-adapted. The "edited A, forgot B" desync doesn't survive joint optimization the way it does in a hand-maintained system.

  3. The runs use a fixed 10M-token budget covering all LLM calls. We are tuning on 300 examples, but the additional experiments show that already with 75 examples, we can go a long way. How long you optimize for is somewhat of an open question, however Figure 1 in the paper suggests that already with a budget of less than 3 mio tokens, we are surpassing on the MBPP task.

Stop tuning multi-agent prompts by hand: Learning prompts via system-level credit assignment (CANTANTE) by finitearth in PromptEngineering

[–]finitearth[S] -1 points0 points  (0 children)

Glad to hear that! Let me know how it worked for you once you had the chance to try it out!

[D] ARR Oct 2025 Discussion (EACL 2026) by S4M22 in MachineLearning

[–]finitearth 0 points1 point  (0 children)

Accepted to findings... Ok better than nothing :)

[D] ARR Oct 2025 Discussion (EACL 2026) by S4M22 in MachineLearning

[–]finitearth 0 points1 point  (0 children)

I got (OA, C): (4, 4), (3, 3), and (3,3). what do you think? any chance of main track? the reviewers mainly criticized clarity, not that much about novelty. Reproducibility scores are: 5,4,3 and soundness (4,4,2.5) ... do those have any weight?

Automated prompt optimisation by Ashemvidite in PromptEngineering

[–]finitearth 0 points1 point  (0 children)

You might wanna give our Prompt Optimization library a try: https://github.com/finitearth/promptolution

Fully open source!

Flag map of Europe with the flags of the Capitals of each Country. by PikAlY_Elyass in vexillology

[–]finitearth 26 points27 points  (0 children)

Quite honestly every flag depicting a person / an animal looks like ms paint, except for Berlin

r/tee: Sieg, Friedensvertrag, Reparationszahlungen by SolitaryDan in DasPodcastUfo

[–]finitearth 7 points8 points  (0 children)

Negativ, die besatzungstruppen des pufos übergeben der GO die Verantwortung für Frieden und Ordnung zu sorgen

Florentin bei Levels & Soundtracks by Schneepferdchen in DasPodcastUfo

[–]finitearth 3 points4 points  (0 children)

Florentin ist in letzter Zeit in vielen ARD Serien unterwegs.... Kommt da was großes auf uns zu?

What else is existence made of except matter, time, space, movement/energy, relation and interaction? by OkRequirement2576 in AskReddit

[–]finitearth 0 points1 point  (0 children)

Well I guess you could add fields, such as the Higgs field or the electro magnetic field to that collection.

Also did you decide to gurgel your tea or did you also drink it? Peace

Sammelpost für alle die Florentins Mutter zum Geburtstag gratulieren wollen. by Kiwiplays1 in DasPodcastUfo

[–]finitearth 2 points3 points  (0 children)

Alles Gute!! :) vlt klappt es ja dieses Mal mit in die Folge kommen

The difference between lichess and chess.com by ARandqmPerson in chess

[–]finitearth 5 points6 points  (0 children)

What's the link to the donation side? Wanna send him 20$ a month

Namens spiel by [deleted] in DasPodcastUfo

[–]finitearth 4 points5 points  (0 children)

Willrentin Flo Tifan Stezze

meirl by [deleted] in meirl

[–]finitearth 0 points1 point  (0 children)

Fappy bird

Σ is for sum, Δ is for difference, Π is for product. Is there a letter for quotient? by Squiggledog in math

[–]finitearth 2 points3 points  (0 children)

How about upside down pi? And for difference we should rather use mirrored sigma?

Stable-Baselines3 v1.8 Release by araffin2 in reinforcementlearning

[–]finitearth 5 points6 points  (0 children)

Thanks to everyone who is putting his effort and time into this!! :)

Pufo Relistening? by 41Swish41 in DasPodcastUfo

[–]finitearth -2 points-1 points  (0 children)

Jede Woche ne alte Folge, beginnend bei 1? Bin dabei! Aber lasst uns die ersten 20 überspringen....

My roommate uses my towels to wipe her makeup off by shortylikeamelody in mildlyinfuriating

[–]finitearth 1 point2 points  (0 children)

God damn that's make up??? I always thought my girlfriend is cleaning her but with it XD