References
https://andrewkchan.dev/posts/things-april-2024.html
https://www.nuss-and-bolts.com/p/optimizing-a-webgpu-matmul-kernel
https://siboehm.com/articles/22/CUDA-MMM
https://github.com/gpu-mode/lectures/tree/main/lecture_031
https://www.youtube.com/watch?v=h1mOkhFOSPU
https://developer.apple.com/documentation/metal/performing-calculations-on-a-gpu