Stanford CS149 I Parallel Computing I 2023 I Lecture 7 - GPU architecture and CUDA Programming

18 Sep 2024 (1 day ago)
Stanford CS149 I Parallel Computing I 2023 I Lecture 7 - GPU architecture and CUDA Programming

Early GPU Development and Purpose

GPU Architecture and Processing Power

Early Attempts at General-Purpose Computing

Introduction of CUDA

CUDA Programming Concepts

CUDA Memory Model and Communication

CUDA Thread Management

Shared Memory in CUDA

CUDA Thread Block Management

CUDA Compilation and Execution

GPU Core Architecture

Warp Execution and Scheduling

Thread Blocks and Warps

Streaming Multiprocessors (SMs)

Block Size Flexibility and Optimization

Shared Memory Constraints

Partial Thread Block Execution and Synchronization

Overwhelmed by Endless Content?