Courses
-
MiniTorch Module 3 parts 3 and 4
-
LLM System lecture 1a: https://llmsystem.github.io/llmsystem2025spring/assets/files/llmsys-01-intro-494cf8038c6bbd0cbef448899ef34864.pdf
-
LLM System lecture 1b: https://llmsystem.github.io/llmsystem2025spring/assets/files/llmsys-02-gpu-programming-5fba63d213cdb0da3a74246309497470.pdf
Tutorials
- Using Shared Memory in CUDA C/C++ https://developer.nvidia.com/blog/using-shared-memory-cuda-cc/
Exploratory
- NUMBA Writing CUDA kernels https://numba.readthedocs.io/en/stable/cuda/kernels.html
- NUMBA CUDA Memory Management https://numba.readthedocs.io/en/stable/cuda/memory.html
- NUMBA CUDA examples for vector addition, sum reduction and matmul https://numba.readthedocs.io/en/stable/cuda/examples.html
Random
https://forum.obsidian.md/t/a-guide-on-links-vs-tags-in-obsidian/28231/2