umut's blog
posts
projects
archives
search
Hi there. Just figuring things out.
Notes, ideas, and experiments along the way.
Online Softmax in Attention Mechanism
Backpropagation Through Matrix Multiplication