Prefetching

Prefetching is a performance optimization in which data or instructions are moved to a faster storage layer before they are needed, with the goal of hiding memory latency and increasing system throughput. Prefetching can occur at multiple levels, from processor caches to storage subsystems and even networked resources.

In CPUs, prefetching can be hardware-driven, where the memory subsystem detects access patterns and issues prefetch

Operating systems and storage subsystems implement prefetching through read-ahead or prefetch algorithms that fetch data blocks

In web environments, browsers may prefetch or prerender resources associated with a page, or DNS prefetching

Effectiveness depends on workload characteristics, data locality, and system resources. Poorly tuned prefetching can evict useful

See also caching and speculative execution.

software-driven,

responsiveness,