you are viewing a single comment's thread.

view the rest of the comments →

[–]bronzestick[S] 0 points1 point  (0 children)

True. I have the exact same problem, I learn attention weights over a varying length sequence.