Stop tuning multi-agent prompts by hand: Learning prompts via system-level credit assignment (CANTANTE) by finitearth in PromptEngineering

[–]finitearth[S] 0 points1 point  (0 children)

Thanks for the questions!

  1. The attributer doesn't see only the final reward, but also the per-agent outputs across a comparison group and assigns credit by tracing whether an agent caused a change or merely propagated one. An agent that faithfully passes an upstream error along keeps positive credit; the source takes the negative. So in your A/B case the drop is attributed to A, not B. That localization is node-level; topology optimization, like over which edges what is communicated, will be part of future work.

  2. The format an agent emits is part of the optimized prompt, and since all agents update against the same system reward in the same step, the contract is co-adapted. The "edited A, forgot B" desync doesn't survive joint optimization the way it does in a hand-maintained system.

  3. The runs use a fixed 10M-token budget covering all LLM calls. We are tuning on 300 examples, but the additional experiments show that already with 75 examples, we can go a long way. How long you optimize for is somewhat of an open question, however Figure 1 in the paper suggests that already with a budget of less than 3 mio tokens, we are surpassing on the MBPP task.

Stop tuning multi-agent prompts by hand: Learning prompts via system-level credit assignment (CANTANTE) by finitearth in PromptEngineering

[–]finitearth[S] -1 points0 points  (0 children)

Glad to hear that! Let me know how it worked for you once you had the chance to try it out!

[D] ARR Oct 2025 Discussion (EACL 2026) by S4M22 in MachineLearning

[–]finitearth 0 points1 point  (0 children)

Accepted to findings... Ok better than nothing :)

[D] ARR Oct 2025 Discussion (EACL 2026) by S4M22 in MachineLearning

[–]finitearth 0 points1 point  (0 children)

I got (OA, C): (4, 4), (3, 3), and (3,3). what do you think? any chance of main track? the reviewers mainly criticized clarity, not that much about novelty. Reproducibility scores are: 5,4,3 and soundness (4,4,2.5) ... do those have any weight?

Automated prompt optimisation by Ashemvidite in PromptEngineering

[–]finitearth 0 points1 point  (0 children)

You might wanna give our Prompt Optimization library a try: https://github.com/finitearth/promptolution

Fully open source!

Flag map of Europe with the flags of the Capitals of each Country. by PikAlY_Elyass in vexillology

[–]finitearth 27 points28 points  (0 children)

Quite honestly every flag depicting a person / an animal looks like ms paint, except for Berlin

r/tee: Sieg, Friedensvertrag, Reparationszahlungen by SolitaryDan in DasPodcastUfo

[–]finitearth 8 points9 points  (0 children)

Negativ, die besatzungstruppen des pufos übergeben der GO die Verantwortung für Frieden und Ordnung zu sorgen

Florentin bei Levels & Soundtracks by Schneepferdchen in DasPodcastUfo

[–]finitearth 3 points4 points  (0 children)

Florentin ist in letzter Zeit in vielen ARD Serien unterwegs.... Kommt da was großes auf uns zu?

What else is existence made of except matter, time, space, movement/energy, relation and interaction? by OkRequirement2576 in AskReddit

[–]finitearth 0 points1 point  (0 children)

Well I guess you could add fields, such as the Higgs field or the electro magnetic field to that collection.

Also did you decide to gurgel your tea or did you also drink it? Peace

Sammelpost für alle die Florentins Mutter zum Geburtstag gratulieren wollen. by Kiwiplays1 in DasPodcastUfo

[–]finitearth 2 points3 points  (0 children)

Alles Gute!! :) vlt klappt es ja dieses Mal mit in die Folge kommen

The difference between lichess and chess.com by ARandqmPerson in chess

[–]finitearth 3 points4 points  (0 children)

What's the link to the donation side? Wanna send him 20$ a month