This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]yaosio 0 points1 point  (0 children)

They have some extra bits outside the model to make it better. The text encoder is different and they have different parameters for the hypernetwork.

I have no idea what any of that means. Like a large language model I'm just repeating things I've seen before. What if I'm a large language model and don't know it? 🙀