simt
SIMT stands for Single Instruction, Multiple Threads. It is a parallel execution model used by many modern graphics processing units, most notably NVIDIA’s CUDA-enabled devices. Under SIMT, groups of threads—commonly called warps—execute the same instruction across multiple data elements. The hardware schedules warps on a set of execution units, enabling thousands of threads to participate in data-parallel computations.
Unlike traditional SIMD, SIMT allows threads within a warp to follow different control paths. When a branch
In practice, a warp often consists of a fixed number of threads (for example, 32 in many
SIMT is a core concept in GPU programming models like CUDA and underpins scalable data-parallel computation.