simt

SIMT stands for Single Instruction, Multiple Threads. It is a parallel execution model used by many modern graphics processing units, most notably NVIDIA’s CUDA-enabled devices. Under SIMT, groups of threads—commonly called warps—execute the same instruction across multiple data elements. The hardware schedules warps on a set of execution units, enabling thousands of threads to participate in data-parallel computations.

Unlike traditional SIMD, SIMT allows threads within a warp to follow different control paths. When a branch

In practice, a warp often consists of a fixed number of threads (for example, 32 in many

SIMT is a core concept in GPU programming models like CUDA and underpins scalable data-parallel computation.

a

a

Occupancy—the

multiprocessor—also

multiprocessor.