1. Awesome High-Performance AI Compute
  2. Introduction
  3. High-Performance AI Computing
  4. Parallel Computing
  5. CUDA Programming
    1. CUDA Concepts
      1. Thread Coarsening
      2. Reduction
    2. CUDA Kernels
      1. Attention
      2. Encoder
      3. LayerNorm
      4. Matrix Multiplication (MatMul)
      5. Softmax
      6. Triangular Matrix Multiplication (TriMat)