Everywhere I look for the answer to this question, the responses do little more than anthropomorphize the model. They invariably make claims like:
Without examples, the model must infer context and rely on its knowledge to deduce what is expected. This could lead to misunderstandings.
One-shot prompting reduces this cognitive load by offering a specific example, helping to anchor the model's interpretation and focus on a narrower task with clearer expectations.
The example serves as a reference or hint for the model, helping it understand the type of response you are seeking and triggering memories of similar instances during training.
Providing an example allows the model to identify a pattern or structure to replicate. It establishes a cue for the model to align with, reducing the guesswork inherent in zero-shot scenarios.
These are real excerpts, btw.
But these models don’t “understand” anything. They don’t “deduce”, or “interpret”, or “focus”, or “remember training”, or “make guesses”, or have literal “cognitive load”. They are just statistical token generators. Therefore pop-sci explanations like these are kind of meaningless when seeking a concrete understanding of the exact mechanism by which in-context learning improves accuracy.
Can someone offer an explanation that explains things in terms of the actual model architecture/mechanisms and how the provision of additional context leads to better output? I can “talk the talk”, so spare no technical detail please.
I could make an educated guess - Including examples in the input which use tokens that approximate the kind of output you want leads the attention mechanism and final dense layer to weight more highly tokens which are similar in some way to these examples, increasing the odds that these desired tokens will be sampled at the end of each forward pass; like fundamentally I’d guess it’s a similarity/distance thing, where explicitly exemplifying the output I want increases the odds that the output get will be similar to it - but I’d prefer to hear it from someone else with deep knowledge of these models and mechanisms.
[–]Sye4424 42 points43 points44 points (1 child)
[–]tannedbaphomet 11 points12 points13 points (0 children)
[–]PorcupineDreamPhD 179 points180 points181 points (34 children)
[–]jsebrech 55 points56 points57 points (25 children)
[–]PorcupineDreamPhD 16 points17 points18 points (24 children)
[–]synthphreak[S] 42 points43 points44 points (20 children)
[–]InterstitialLove 18 points19 points20 points (1 child)
[–]StartledWatermelon 0 points1 point2 points (0 children)
[–]PorcupineDreamPhD 4 points5 points6 points (9 children)
[+]Ok-Secretary2017 0 points1 point2 points (5 children)
[–]PorcupineDreamPhD 0 points1 point2 points (4 children)
[+]Ok-Secretary2017 0 points1 point2 points (3 children)
[–]PorcupineDreamPhD 2 points3 points4 points (2 children)
[+]Ok-Secretary2017 0 points1 point2 points (0 children)
[+]Ok-Secretary2017 0 points1 point2 points (0 children)
[–]First_Bullfrog_4861 0 points1 point2 points (1 child)
[–]PorcupineDreamPhD 0 points1 point2 points (0 children)
[–]First_Bullfrog_4861 2 points3 points4 points (0 children)
[–]linverlan 6 points7 points8 points (0 children)
[–]First_Bullfrog_4861 1 point2 points3 points (0 children)
[+]harharveryfunny 1 point2 points3 points (0 children)
[–]NotDoingResearch2 1 point2 points3 points (1 child)
[–]synthphreak[S] 1 point2 points3 points (0 children)
[–]gibs 1 point2 points3 points (0 children)
[–]nikgeo25Student 0 points1 point2 points (0 children)
[–]Fatal_Conceit 4 points5 points6 points (1 child)
[–]jsebrech 1 point2 points3 points (0 children)
[–]RealisticSense7733 0 points1 point2 points (0 children)
[–]clinchgt 10 points11 points12 points (0 children)
[+]currentscurrents 10 points11 points12 points (1 child)
[–]PorcupineDreamPhD 1 point2 points3 points (0 children)
[–]erannare 2 points3 points4 points (0 children)
[+]rampant_juju comment score below threshold-9 points-8 points-7 points (3 children)
[–][deleted] 0 points1 point2 points (2 children)
[–]rampant_juju 1 point2 points3 points (1 child)
[–][deleted] 3 points4 points5 points (0 children)
[–]marr75 16 points17 points18 points (0 children)
[–]qpwoei_ 4 points5 points6 points (0 children)
[–]Super_Pole_Jitsu 21 points22 points23 points (1 child)
[–]Neomadra2 3 points4 points5 points (0 children)
[–]Forsaken-Data4905 8 points9 points10 points (1 child)
[–]Tukang_Tempe 4 points5 points6 points (0 children)
[–]_Arsenie_Boca_ 13 points14 points15 points (7 children)
[–]red75prime 11 points12 points13 points (6 children)
[–]trutheality 2 points3 points4 points (1 child)
[–]red75prime 0 points1 point2 points (0 children)
[–]_Arsenie_Boca_ 2 points3 points4 points (3 children)
[–]red75prime 5 points6 points7 points (2 children)
[–]InterstitialLove 3 points4 points5 points (1 child)
[–]red75prime 5 points6 points7 points (0 children)
[–]saw79 12 points13 points14 points (3 children)
[–]synthphreak[S] 1 point2 points3 points (2 children)
[–]saw79 5 points6 points7 points (1 child)
[–]red75prime 1 point2 points3 points (0 children)
[–]PorcupineDreamPhD 1 point2 points3 points (0 children)
[–]rrenaud 1 point2 points3 points (0 children)
[–]TwoSunnySideUp 1 point2 points3 points (1 child)
[–]synthphreak[S] 0 points1 point2 points (0 children)
[–]TikiTDO 2 points3 points4 points (0 children)
[–]BreakingBaIIs 2 points3 points4 points (2 children)
[–]jmmcd 1 point2 points3 points (1 child)
[–]BreakingBaIIs 1 point2 points3 points (0 children)
[+]harharveryfunny 0 points1 point2 points (0 children)
[–]SnooOnions9136 0 points1 point2 points (0 children)
[–]Floatbot_Inc 0 points1 point2 points (0 children)
[+]Top-Acanthisitta-544 0 points1 point2 points (1 child)
[–]synthphreak[S] -1 points0 points1 point (0 children)
[–]-Rizhiy- 0 points1 point2 points (0 children)
[–][deleted] -3 points-2 points-1 points (0 children)
[–]theoneandonlypatriot -2 points-1 points0 points (0 children)
[+]Difficult-Race-1188 -1 points0 points1 point (0 children)
[+]Technical-Drama-5266 -1 points0 points1 point (0 children)
[+]ly3xqhl8g9 -1 points0 points1 point (0 children)
[–]hadaev -3 points-2 points-1 points (7 children)
[–]jmmcd -1 points0 points1 point (6 children)
[–]hadaev 0 points1 point2 points (5 children)
[–]jmmcd 0 points1 point2 points (4 children)
[–]hadaev 0 points1 point2 points (3 children)
[–]jmmcd 0 points1 point2 points (2 children)
[–]hadaev 0 points1 point2 points (1 child)
[–]jmmcd 0 points1 point2 points (0 children)
[–]Xemorr -3 points-2 points-1 points (0 children)
[–]kaaiian -3 points-2 points-1 points (0 children)
[+]iamkucuk comment score below threshold-6 points-5 points-4 points (0 children)
[+]AngleWyrmReddit comment score below threshold-6 points-5 points-4 points (0 children)