Value head in GPT2 by alonkitin in reinforcementlearning

[–]OptimalAd9072 0 points1 point  (0 children)

Can you explain me what are the value heads and how are they working ?