00:00Every one of these heads produces a proposed change to be added to the embedding in that position.
00:06So what you do is you sum together all of those proposed changes, one for each head,
00:12and you add the result to the original embedding of that position.
00:30This is about to go.
01:00This is about to go.
Comments