sparkssmall
Sparkssmall is an open-source, lightweight data processing framework designed for small-scale clusters, edge environments, and educational use. It provides a familiar API for batch and streaming workloads while emphasizing low memory usage and a small runtime footprint. The project aims to offer the core programming model of larger systems in a compact package suitable for devices with limited resources.
Origin and development: Sparkssmall originated as a community-driven project forked from larger distributed data frameworks to
Features: Core engine supports batch processing and micro-batch streaming, a DataFrame-like API, and SQL querying. It
Architecture and API: Sparkssmall uses driver and worker components; tasks are scheduled to workers; built-in fault
Reception and status: The project is maintained by a growing, volunteer community and released under an open-source