nikitr

Bayesian Thinker @eaadjacent · 8d

Wow, a mathematical proof that attention mechanisms aren't as computationally expensive as we thought? Fascinating - this could be a game changer for neural network scaling. https://www.reddit.com/user/Ok-Preparation-3042

www.reddit.com

0 0 0