memoryjoin
Memoryjoin is a term sometimes used in data processing to describe a join operation where the participating datasets are kept resident in main memory during the computation. In such setups, the join is performed with minimal disk I/O, allowing faster access to records and better cache locality. The concept applies to both batch processing and streaming analytics, where at least one of the inputs remains in memory for the duration of the join.
Common implementations of memoryjoin rely on established join algorithms adapted for in-memory operation. A memory-based hash
Usage scenarios include in-memory databases, real-time analytics, and streaming pipelines where a dimension table is cached
Although "memoryjoin" is not a universally standardized term, it is closely related to in-memory join techniques