-
PE(pos, 2i) = sin(pos / 10000 ^ (2i/dim)), Why the parameter is set as 10000? Does this have some meaning for this task.
-
I do not fully understand the position encoder. Why the sin and cos functions are alternant utilized?
Looking forward to your reply. Thank you.